Hi there Welcome to my Homepage!

Hi! I am Qifan Liang (Simon Leong), a first-year master student (MComp AI) at the Sound and Music Computing Lab, National University of Singapore, supervised by Prof. Ye Wang. Previously, I conducted computer vision research under Prof. Jens Rittscher at Oxford University and Prof. Zhen Han, Prof. Zhongyuan Wang at National Multimedia Software Engineering Technology Research Center, Wuhan University.

Feel free to reach out if you are interested in collaboration or potential opportunities!

๐Ÿ”ฌ Research Interests

โ€œComputer vision will open our eyes to a new world of possibilities, where machines not only see but understand the world as we do.โ€ โ€” Inspired by Fei-Fei Li

  • Controllable generative modeling in Audio-Visual Multimodal Learning.
  • Low- and high-level computer vision for effective visual representation and understanding.
  • Foundations and applications of human-centric perception and 3D scene understanding.
๐Ÿ“ข I am actively looking for 27 Fall PhD positions and research internship opportunities. Feel free to reach out!
๐Ÿ“ฐ News
  • 2026.05 ๐ŸŽ‰๐ŸŽ‰ Two papers submitted to NeurIPS 2026. Thanks to all co-authors!
  • 2026.04 ๐ŸŽ‰๐ŸŽ‰ One paper accepted to ACL 2026, nominated for Oral by SAC. Thanks to all co-authors!
  • 2025.12 ๐ŸŽ‰๐ŸŽ‰ One paper accepted to AAAI 2026. Thanks to all co-authors!
  • 2025.11 ๐ŸŽ‰ I joined the Sound and Music Computing Lab at National University of Singapore, supervised by Prof. Ye Wang.
  • 2025.08 ๐ŸŽ‰ I joined the MComp AI program at National University of Singapore.
  • 2025.05 My undergraduate thesis was awarded as an Outstanding Graduation Thesis at Wuhan University.
  • 2025.05 I received the Lei Jun Computer Innovation and Development Grant and the Lei Jun Computer Research Grant from Xiaomi.
  • 2025.03 I received the honor of Outstanding Graduate of the School of Computer Science at Wuhan University.
  • 2024.11 I won the DiDi Inc. Outstanding Undergraduate Scholarship (6/685 in department) at Wuhan University.
  • 2024.10 I won the Merit Student Scholarship (Top 10%, school-wide) at Wuhan University.
  • 2024.09 One Chinese Software Copyright "All-weather High-precision Transmission Tower Vibration Monitoring System" has been published.
  • 2024.06 My project has won 2 national awards in 2024.
  • 2024.05 ๐ŸŽ‰๐ŸŽ‰ One paper accepted by TOMM 2024. Thanks to all co-authors!
  • 2024.03 I won the Lei Jun Computer Innovation and Development Fund (Top 7.2%, major-wide) at Wuhan University.
  • 2023.11 I won the Samsung Group Outstanding Undergraduate Scholarship (1/685 in department) at Wuhan University.
  • 2023.10 I won the First-Class Scholarship (Top 5%, school-wide) at Wuhan University.
  • 2023.09 One Chinese Software Copyright "Innovative Integrated Platform for Emergency Monitoring of Forest Fires" has been published.
  • 2023.06 My project has won 3 national awards in 2023.
๐ŸŽ“ Experience
National University of Singapore - SMC Lab
2025.11 โ€“ Present
Master of Computing (AI) at Sound and Music Computing Lab, supervised by Prof. Ye Wang
Oxford University โ€“ Big Data Institute
2024.06 โ€“ 2024.10
Remote Summer Research Intern at Oxford Big Data Institute, supervised by Prof. Jens Rittscher
Wuhan University โ€“ Multimedia Lab
2023.05 โ€“ 2025.06
Research Assistant at National Multimedia Software Engineering Technology Research Center, supervised by Prof. Zhen Han
๐Ÿ“„ Publications

(* corresponding author ย ยทย  โ€  equal contribution)

PersonaGest
PersonaGest: Personalized Co-Speech Gesture Generation with Semantic-Guided Hierarchical Motion Representation
Junchuan Zhaoโ€ , Qifan Liangโ€ , Ye Wang*
A two-stage framework for personalized co-speech gesture generation that disentangles content from style via a semantic-guided RVQ-VAE, then uses masked generative and style residual transformers to produce semantically coherent, style-consistent gestures with fine-grained part-wise control.
arXiv 2026   [arXiv] [website]
TED-TTS
TED-TTS: Training-Free Intra-Utterance Emotion and Duration Control for Text-to-Speech Synthesis
Qifan Liangโ€ , Y Liuโ€ , R Weiโ€ , N Lu, J Zhao*, Ye Wang
A training-free framework for fine-grained intra-utterance emotion and duration control in TTS, using segment-aware emotion and duration conditioning with causal masking and monotonic stream alignment filtering.
ACL 2026   ๐ŸŽ™ Oral Nominee (SAC) [arXiv] [website] [code]
STSVD
Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset
Qifan Liang, Junlin Li, Zhen Han*, Xihao Wang, Zhongyuan Wang, Bin Mei
A smoke-type-aware network (STANet) for laparoscopic video desmoking, and introduce STSVD, a large-scale synthetic dataset with smoke type annotations across 28 surgical scenarios.
AAAI 2026   [arXiv] [website] [code]
TOMM
Few-Shot Face Sketch-to-Photo Synthesis via Global-Local Asymmetric Image-to-Image Translation
Yongkang Li, Qifan Liang, Wenjun Mai, Zhen Han*, Zhongyuan Wang
A global-local asymmetric translation framework for few-shot face sketch-to-photo synthesis, combining style transfer and structural alignment to produce realistic photo-realistic outputs.
TOMM 2024   [paper]
๐Ÿ’ก Projects
Forest Fire
Multi-Source Perception and Intelligent Prediction Emergency Monitoring Platform for Forest Fires
Qifan Liang (Team Lead)
An intelligent forest fire monitoring platform integrating multi-source perception, real-time prediction, and emergency response.
Project   [code]
๐Ÿ† National Grand Prize (Top 0.5%) ยท 2023 National University Student Surveying & Mapping Competition
๐Ÿ† National First Prize (Top 1.7%) ยท 2023 "Dingxin Cup" National Youth Innovation Competition
๐Ÿ† National Second Prize (Top 0.7%) ยท 2023 Chinese Collegiate Computing Competition
๐Ÿ† Honors & Awards
  • 2025.05 Outstanding Graduation Thesis ยท Wuhan University
  • 2025.05 Lei Jun Computer Innovation and Development Fund & Lei Jun Computer Research Grant (Rate: 7.2%, major-wide) ยท Xiaomi Corporation
  • 2025.03 Outstanding Graduate ยท School of Computer Science, Wuhan University
  • 2024.11 Outstanding Undergraduate Scholarship (10/685, major-wide) ยท DiDi Inc.
  • 2024, 2023.10 Merit Student (Top 5%, school-wide) ยท Wuhan University
  • 2024.05 National University Studentsโ€™ Innovation & Entrepreneurship Fund (Top 3%, school-wide) ยท Wuhan University
  • 2024.03 Lei Jun Computer Innovation and Development Fund (Rate: 7.2%, major-wide) ยท Xiaomi Corporation
  • 2023.11 Outstanding Undergraduate Scholarship (1/685, major-wide) ยท Samsung Group
  • 2023.10 First-Class Scholarship (Top 5%, school-wide) ยท Wuhan University
  • 2023.06 National Grand Prize (Top 0.5%, nation-wide) ยท 2023 National University Student Surveying & Mapping Competition
  • 2022.10 Outstanding Student (Top 30%, school-wide) ยท Wuhan University
๐Ÿค Services
  • Reviewer, AAAI Conference on Artificial Intelligence (AAAI 2026)
  • Poster Presenter, AAAI Conference on Artificial Intelligence (AAAI 2026)
  • Presenter, Annual Meeting of the Association for Computational Linguistics (ACL 2026)
  • Conference Participant, China Computer Federation (CCF) Computer Networking Conference, Wenzhou, China, 2023
  • Conference Participant, Intelligent Transportation Networking and Telematics Yongjia Forum, Yongjia, China, 2023