Hi there
Welcome to my Homepage!
Hi! I am Qifan Liang (Simon Leong), a first-year master student (MComp AI) at the Sound and Music Computing Lab, National University of Singapore, supervised by Prof. Ye Wang. Previously, I conducted computer vision research under Prof. Jens Rittscher at Oxford University and Prof. Zhen Han, Prof. Zhongyuan Wang at National Multimedia Software Engineering Technology Research Center, Wuhan University.
Feel free to reach out if you are interested in collaboration or potential opportunities!
โComputer vision will open our eyes to a new world of possibilities, where machines not only see but understand the world as we do.โ โ Inspired by Fei-Fei Li
- Controllable generative modeling in Audio-Visual Multimodal Learning.
- Low- and high-level computer vision for effective visual representation and understanding.
- Foundations and applications of human-centric perception and 3D scene understanding.
- 2026.05 ๐๐ Two papers submitted to
NeurIPS 2026. Thanks to all co-authors!
- 2026.04 ๐๐ One paper accepted to
ACL 2026, nominated for Oral by SAC. Thanks to all co-authors!
- 2025.12 ๐๐ One paper accepted to
AAAI 2026. Thanks to all co-authors! - 2025.11 ๐ I joined the Sound and Music Computing Lab at
National University of Singapore, supervised by Prof. Ye Wang. - 2025.08 ๐ I joined the MComp AI program at
National University of Singapore. - 2025.05 My undergraduate thesis was awarded as an Outstanding Graduation Thesis at
Wuhan University. - 2025.05 I received the Lei Jun Computer Innovation and Development Grant and the Lei Jun Computer Research Grant from
Xiaomi. - 2025.03 I received the honor of Outstanding Graduate of the School of Computer Science at
Wuhan University. - 2024.11 I won the
DiDi Inc. Outstanding Undergraduate Scholarship (6/685 in department) at
Wuhan University. - 2024.10 I won the Merit Student Scholarship (Top 10%, school-wide) at
Wuhan University. - 2024.09 One Chinese Software Copyright "All-weather High-precision Transmission Tower Vibration Monitoring System" has been published.
- 2024.06 My project has won 2 national awards in 2024.
- 2024.05 ๐๐ One paper accepted by TOMM 2024. Thanks to all co-authors!
- 2024.03 I won the
Lei Jun Computer Innovation and Development Fund (Top 7.2%, major-wide) at
Wuhan University. - 2023.11 I won the
Samsung Group Outstanding Undergraduate Scholarship (1/685 in department) at
Wuhan University. - 2023.10 I won the First-Class Scholarship (Top 5%, school-wide) at
Wuhan University. - 2023.09 One Chinese Software Copyright "Innovative Integrated Platform for Emergency Monitoring of Forest Fires" has been published.
- 2023.06 My project has won 3 national awards in 2023.
2025.11 โ Present
Master of Computing (AI) at Sound and Music Computing Lab, supervised by Prof. Ye Wang
2024.06 โ 2024.10
Remote Summer Research Intern at Oxford Big Data Institute, supervised by Prof. Jens Rittscher
2023.05 โ 2025.06
Research Assistant at National Multimedia Software Engineering Technology Research Center, supervised by Prof. Zhen Han
(* corresponding author ย ยทย โ equal contribution)
Junchuan Zhaoโ , Qifan Liangโ , Ye Wang*
A two-stage framework for personalized co-speech gesture generation that disentangles content from style via a semantic-guided RVQ-VAE, then uses masked generative and style residual transformers to produce semantically coherent, style-consistent gestures with fine-grained part-wise control.
arXiv 2026 [arXiv] [website]
Qifan Liangโ , Y Liuโ , R Weiโ , N Lu, J Zhao*, Ye Wang
A training-free framework for fine-grained intra-utterance emotion and duration control in TTS, using segment-aware emotion and duration conditioning with causal masking and monotonic stream alignment filtering.
ACL 2026 ๐ Oral Nominee (SAC) [arXiv] [website] [code]
Qifan Liang, Junlin Li, Zhen Han*, Xihao Wang, Zhongyuan Wang, Bin Mei
A smoke-type-aware network (STANet) for laparoscopic video desmoking, and introduce STSVD, a large-scale synthetic dataset with smoke type annotations across 28 surgical scenarios.
AAAI 2026 [arXiv] [website] [code]
Yongkang Li, Qifan Liang, Wenjun Mai, Zhen Han*, Zhongyuan Wang
A global-local asymmetric translation framework for few-shot face sketch-to-photo synthesis, combining style transfer and structural alignment to produce realistic photo-realistic outputs.
TOMM 2024 [paper]
-
arXiv 2026
PersonaGest: Personalized Co-Speech Gesture Generation with Semantic-Guided Hierarchical Motion Representation
[arXiv] [website] -
ACL 2026
๐ Oral Nominee (SAC)
TED-TTS: Training-Free Intra-Utterance Emotion and Duration Control for Text-to-Speech Synthesis
[arXiv] [website] [code] -
AAAI 2026
Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and Dataset
[arXiv] [website] [code] -
TOMM 2024
Few-Shot Face Sketch-to-Photo Synthesis via Global-Local Asymmetric Image-to-Image Translation
[paper]
Qifan Liang (Team Lead)
An intelligent forest fire monitoring platform integrating multi-source perception, real-time prediction, and emergency response.
Project [code]
๐ National Grand Prize (Top 0.5%) ยท 2023 National University Student Surveying & Mapping Competition
๐ National First Prize (Top 1.7%) ยท 2023 "Dingxin Cup" National Youth Innovation Competition
๐ National Second Prize (Top 0.7%) ยท 2023 Chinese Collegiate Computing Competition
- 2025.05 Outstanding Graduation Thesis ยท Wuhan University
- 2025.05 Lei Jun Computer Innovation and Development Fund & Lei Jun Computer Research Grant (Rate: 7.2%, major-wide) ยท Xiaomi Corporation
- 2025.03 Outstanding Graduate ยท School of Computer Science, Wuhan University
- 2024.11 Outstanding Undergraduate Scholarship (10/685, major-wide) ยท DiDi Inc.
- 2024, 2023.10 Merit Student (Top 5%, school-wide) ยท Wuhan University
- 2024.05 National University Studentsโ Innovation & Entrepreneurship Fund (Top 3%, school-wide) ยท Wuhan University
- 2024.03 Lei Jun Computer Innovation and Development Fund (Rate: 7.2%, major-wide) ยท Xiaomi Corporation
- 2023.11 Outstanding Undergraduate Scholarship (1/685, major-wide) ยท Samsung Group
- 2023.10 First-Class Scholarship (Top 5%, school-wide) ยท Wuhan University
- 2023.06 National Grand Prize (Top 0.5%, nation-wide) ยท 2023 National University Student Surveying & Mapping Competition
- 2022.10 Outstanding Student (Top 30%, school-wide) ยท Wuhan University
- Reviewer, AAAI Conference on Artificial Intelligence (AAAI 2026)
- Poster Presenter, AAAI Conference on Artificial Intelligence (AAAI 2026)
- Presenter, Annual Meeting of the Association for Computational Linguistics (ACL 2026)
- Conference Participant, China Computer Federation (CCF) Computer Networking Conference, Wenzhou, China, 2023
- Conference Participant, Intelligent Transportation Networking and Telematics Yongjia Forum, Yongjia, China, 2023