CVPR 2026
Learning Latent Proxies for Controllable Single-Image Relighting
Haoze Zheng, Zihao Wang, Xianfeng Wu, Yajing Bai, Yexin Liu, Yun Li, Xiaogang Xu,
Harry Yang
CVPR 2026
Group Editing: Edit Multiple Images in One Go
Yue Ma, Xinyu Wang, Qianli Ma, Qinghe Wang, Mingzhe Zheng, Xiangpeng Yang, Hao
Li, Chongbo Zhao, Jixuan Ying, Harry Yang, Hongyu Liu, Qifeng Chen
CVPR 2026 (Findings)
DenDiff: Density-Guided Diffusion for Quantity-Aware Image Synthesis
Bo Gao, Haoyu Liang, Harry Yang, Ser-Nam Lim
CVPR 2026 (Findings)
OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for
Subject-driven Image Generation and Manipulation
Yexin Liu, Manyuan Zhang, Yueze Wang, Hongyu Li, Dian Zheng, Weiming Zhang,
Changsheng Lu, Xunliang Cai, Yan Feng, Peng Pei, Harry Yang
CVPR 2026 (Findings)
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
Shunian Chen, Hejin Huang, Yexin Liu, Zihan Ye, Pengcheng Chen, Chenghao Zhu,
Michael Guan, Rongsheng Wang, Junying Chen, Jianye Hou, Bo Li, Guanbin Li, Ser-Nam Lim, Harry Yang,
Benyou Wang
ICLR 2026
AC-Foley: Reference-Audio-Guided Video-to-Audio Synthesis with Acoustic Transfer
Pengjun Fang, Yingqing He, Yazhou Xing, Qifeng Chen, Ser-Nam Lim, Harry Yang
ICLR 2026
EditAnyShape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Zeqian Long, Mingzhe Zheng, Kunyu Feng, Xinhua Zhang, Hongyu Liu, Harry Yang,
Linfeng Zhang, Qifeng Chen, Yue Ma
ICLR 2026
Deforming Videos to Masks: Flow Matching for Referring Video Segmentation
Zanyi Wang, Dengyang Jiang, Liuzhuozheng Li, Sizhe Dang, Chengzu Li, Harry Yang,
Guang Dai, Mengmeng Wang, Jingdong Wang
Technical Report
INT4 Quantization for FlashAttention
Yaofu Liu, Harry Yang
arXiv 2026
Thinking in Loops: Scaling Visual ARC with Looped Transformers
Wen-Jie Shu, Xuerui Qiu, Rui-Jie Zhu, Harold Haodong Chen, Yexin Liu, Harry Yang
Journal of Technology in Behavioral Science
Reducing depressive symptoms through AI-guided narrative self-films: Results from a
randomized controlled trial
Elvin Yao, Harry Yang
arXiv 2025
AlignVid: Training-Free Attention Scaling for Semantic Fidelity in Text-Guided
Image-to-Video Generation
Yexin Liu, Wen-Jie Shu, Zile Huang, Haoze Zheng, Yueze Wang, Manyuan Zhang,
Ser-Nam Lim, Harry Yang
arXiv 2025
Distribution Matching Distillation Meets Reinforcement Learning
Dengyang Jiang, Dongyang Liu, Zanyi Wang, Qilong Wu, Liuzhuozheng Li, Hengzhuang
Li, Xin Jin, David Liu, Zhen Li, Bo Zhang, Mengmeng Wang, Steven Hoi, Peng Gao, Harry Yang
AAAI 2026 (Poster)
Next Patch Prediction for AutoRegressive Visual Generation
Yatian Pang, Peng Jin, Shuo Yang, Bin Lin, Bin Zhu, Zhenyu Tang, Liuhan Chen,
Francis E. H. Tay, Ser-Nam Lim, Harry Yang, Li Yuan
COLM 2025
Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments
Yuzhe Yang, Yipeng Du, Ahmad Farhan, Claudio Angione, Yue Zhao, Harry Yang,
Fielding Johnston, James Buban, Patrick Colangelo
NeurIPS 2025
Hierarchical Fine-Grained Preference Optimization for Physically Plausible Video
Generation
Harold Haodong Chen, Haojian Huang, Qifeng Chen, Harry Yang, Ser-Nam Lim
NeurIPS 2025
When Semantics Mislead Vision: Mitigating Hallucinations in MLLMs
Yan Shu, Hangui Lin, Yexin Liu, Yan Zhang, Gangyan Zeng, Yan Li, Yu Zhou, Ser-Nam
Lim, Harry Yang, Nicu Sebe
NeurIPS 2025 NextVid Workshop (Oral)
VideoGen-of-Thought: Step-by-Step Generation of Multi-Shot Videos
Mingzhe Zheng, Yongqi Xu, Haojian Huang, Xuran Ma, Yexin Liu, Wenjie Shu, Yatian
Pang, Feilong Tang, Qifeng Chen, Harry Yang, Ser-Nam Lim
ICCV 2025
DreamDance: Animating Human Images by Enriching 3D Geometry Cues
Yatian Pang, Bin Zhu, Bin Lin, Mingzhe Zheng, Francis E. H. Tay, Ser-Nam Lim,
Harry Yang, Li Yuan
ICCV 2025
Model Reveals What to Cache: Profiling-Based Feature Reuse
Xuran Ma, Yexin Liu, Yaofu Liu, Xianfeng Wu, Mingzhe Zheng, Zihao Wang, Ser-Nam
Lim, Harry Yang
CVPR 2025
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
Yexin Liu, Zhengyang Liang, Yueze Wang, Xianfeng Wu, Feilong Tang, Muyang He,
Jian Li, Zheng Liu, Harry Yang, Ser-Nam Lim, Bo Zhao
ICLR 2025
Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations
Feilong Tang, Zile Huang, Chengzhi Liu, Qiang Sun, Harry Yang, Ser-Nam Lim
ICLR 2023
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan
Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman
ECCV 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin
Huang, Devi Parikh
CVPR 2017
High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis
Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li
Feb 2026
Five papers accepted to CVPR 2026 (2 in main conference and 3 in
findings).
Feb 2026
Visit XbotPark in Songshanhu and meet with Prof. Zexiang
Li.
Feb 2026
Panel speaker at Desci Hong Kong 2026.
Jan 2026
Three papers accepted to ICLR 2026.
Dec 2025
Awarded 2026 HKUST-UStA Global Knowledge Network Awards/Joint Seed
Funding.
Dec 2025
Exhibiting Cosmos Mapping: Unlimited Exploration (with Yuyang
Jiang) at Touching the Void: Art Without an Object. Dec 19-21. Blanc Gallery, 15 E 40th
St, New York.
Dec 2025
Selected by
Hong Kong Monetary Authority (HKMA) for GenAI Sandbox
testing (Phase 2).
SCMP
Dec 2025
Giving talk and serving as panel at SIGGRAPH Asia Birds of a
Feather: "Working in an Interdisciplinary Department: When Art and Technology Intertwine".
Dec 2025
Organizer for the exhibition 【AMC ×
CMA】共息信号:缠联的感知与算法|港科大
× 港科广 第二届跨校区艺术展览 (Entangled Signals: Perceptions and Algorithms Entwined).
Dec 2025
Invited Mr. Xiangchen Kong (Zhejiang TV) and Mr. Yu
Chen (Renowned Designer) for job talks.
Nov 2025
Giving a keynote speech at From Vibe to Viable, Build Real Apps with AI
Coding + No-Code Workshop - Hong Kong.
Oct 2025
Serving as Area Chair for ICLR 2026 and
AAAI 2026.
Sep 2025
Giving a
talk
at
HKUST-GZ CMA Seminar.
July 2025
Internship placements: Congratulations to my first-year PhD students for securing
research internships at Kuaishou (Kling),
Tencent, and Bytedance.
June 2025
Awarded GRF/ECS 2025-26 funding.
June 2025
Serving as Associate Editor for APSIPA Transactions on Signal and
Information Processing.
June 2025
Visiting Meta AI, New York.
June 2025
- Interviewed by RTHK on GenAI and video generation,
discussing applications in Hong Kong local community. (Cantonese)
- Interviewed by CNN on Embodied AI, discussing the current status,
challenges, and future directions.
Feb 2025
- Panelist at the Hong Kong Web3 & AI Builder
Workshop, invited by Prof. Xiaofan Liu, speaking on “When AI Meets
Web3: Redefining the Future for Developers and Builders.”
- Panel speaker at DeSci HK 2025 during the 2025 HKG Consensus Web3
Conference (hosted by CityU), invited by Prof. Yu Wang.
Jan 2025
Awarded HKUST-POSTECH Joint Research Seed Grant.
Dec 2024
Video project approved for HKSTP Incubation Program (3 years).
Aug 2024
Hosted a roundtable at Foresight 2024 in Hong Kong.
Sep 2024
Visited Abu Dhabi (Royal Family meeting) and Token2049 Singapore.