Research

(Click here for my full publication list or here for my Google Scholar profile)

Selected Projects

[Image/Video Generation] [3D Human Modeling] [Face Image/Video Processing] [Photometric Stereo] [Scene Text Recognition] [Transparent Object Reconstruction] [Mirror Surface Reconstruction] [Camera Calibration]

Image/Video Generation

	Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers. Zhengyao Lv, Tianlin Pan, Chenyang Si, Zhaoxi Chen, Wangmeng Zuo, Ziwei Liu, and Kwan-Yee K. Wong. ICCV 2025 (to appear) [BibTeX] [Paper] [Project] [Code]
	Dual-Expert Consistency Model for Efficient and High-Quality Video Generation. Zhengyao Lv, Chenyang Si, Tianlin Pan, Zhaoxi Chen, Kwan-Yee K. Wong, Yu Qiao, and Ziwei Liu. ICCV 2025 (to appear) [BibTeX] [Paper] [Project] [Code]
	ArtiFade: Learning to Generate High-quality Subject from Blemished Image. Shuya Yang, Shaozhe Hao, Yukang Cao, and Kwan-Yee K. Wong. CVPR 2025 [BibTeX] [Paper] [Project] [Code]
	FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality. Zhengyao Lv, Chenyang Si, Junhao Song, Zhenyu Yang, Yu Qiao, Ziwei Liu, and Kwan-Yee K. Wong. ICLR 2025 [BibTeX] [Paper] [Project] [Code]
	BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities. Shaozhe Hao, Xuantong Liu, Xianbiao Qi, Shihao Zhao, Bojia Zi, Rong Xiao, Kai Han, and Kwan-Yee K. Wong. ICLR 2025 [BibTeX] [Paper] [Project] [Code]
	Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation. Shihao Zhao, Shaozhe Hao, Bojia Zi, Huaizhe Xu, and Kwan-Yee K. Wong. ECCV 2024 [BibTeX] [Paper] [Project] [Code]
	ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction. Shaozhe Hao, Kai Han, Zhengyao Lv, Shihao Zhao, and Kwan-Yee K. Wong. ECCV 2024 [BibTeX] [Paper] [Project] [Code]
	PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis. Zhengyao Lv, Yuxiang Wei, Wangmeng Zuo, and Kwan-Yee K. Wong. CVPR 2024 [BibTeX] [Paper] [Project] [Code]
	Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models. Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, and Kwan-Yee K. Wong. NeurIPS 2023 [BibTeX] [Paper] [Project] [Code]

3D Human Modeling

	AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation. Yukang Cao, Liang Pan, Kai Han, Kwan-Yee K. Wong, and Ziwei Liu. ICLR 2025 [BibTeX] [Paper] [Project] [Code]
	DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models. Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, and Kwan-Yee K. Wong. CVPR 2024 [BibTeX] [Paper] [Project] [Code]
	HeadSculpt: Crafting 3D Head Avatars with Text. Xiao Han, Yukang Cao, Kai Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang, and Kwan-Yee K. Wong. NeurIPS 2023 [BibTeX] [Paper] [Project] [Code]
	SeSDF: Self-evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction. Yukang Cao, Kai Han, and Kwan-Yee K. Wong. CVPR 2023 [BibTeX] [Paper] [Project] [Code]
	JIFF: Jointly-aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction. Yukang Cao, Guanying Chen, Kai Han, Wenqi Yang, and Kwan-Yee K. Wong. CVPR 2022 [BibTeX] [Paper] [Project] [Code]

Face Image/Video Processing

	RIGID: Recurrent GAN Inversion and Editing of Real Face Videos and Beyond. Yangyang Xu, Shengfeng He, Kwan-Yee K. Wong, and Ping Luo. IJCV 2025 [BibTeX] [Paper] [Project] [Code]
	RIGID: Recurrent GAN Inversion and Editing of Real Face Videos. Yangyang Xu, Shengfeng He, Kwan-Yee K. Wong, and Ping Luo. ICCV 2023 [BibTeX] [Paper] [Project] [Code]
	Deep Face Video Inpainting via UV Mapping. Wenqi Yang, Zhenfang Chen, Chaofeng Chen, Guanying Chen, and Kwan-Yee K. Wong. TIP 2023 [BibTeX] [Paper]
	Semi-supervised Cycle-GAN for face photo-sketch translation in the wild. Chaofeng Chen, Wei Liu, Xiao Tan, and Kwan-Yee K. Wong. CVIU 2023 [BibTeX] [Paper] [Code]
	Progressive semantic-aware style transformation for blind face restoration. Chaofeng Chen, Xiaoming Li, Lingbo Yang, Xianhui Lin, Lei Zhang, and Kwan-Yee K. Wong. CVPR 2021 [BibTeX] [Paper] [Code]
	Learning Spatial Attention for Face Super-Resolution. Chaofeng Chen, Dihong Gong, Hao Wang, Zhifeng Li, and Kwan-Yee K. Wong. TIP 2020 [BibTeX] [Paper] [Code]
	Semi-supervised Learning for Face Sketch Synthesis in the Wild. Chaofeng Chen, Wei Liu, Xiao Tan, and Kwan-Yee K. Wong. ACCV 2018 [BibTeX] [Paper] [Code]
	Face Sketch Synthesis with Style Transfer using Pyramid Column Feature. Chaofeng Chen, Xiao Tan, and Kwan-Yee K. Wong. WACV 2018 [BibTeX] [Paper] [Code]
	Markov Weight Fields for Face Sketch Synthesis. Hao Zhou, Zhanghui Kuang, and Kwan-Yee K. Wong. CVPR 2012 [BibTeX] [Paper]

Photometric Stereo

	PS-NeRF: Neural Inverse Rendering for Mulit-view Photometric Stereo. Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, and Kwan-Yee K. Wong. ECCV 2022 [BibTeX] [Paper] [Project] [Code]
	Deep Photometric Stereo for Non-Lambertian Surfaces. Guanying Chen, Kai Han, Boxin Shi, Yasuyuki Matsushita, and Kwan-Yee K. Wong. TPAMI 2022 [BibTeX] [Paper] [Project] [Code]
	What is Learned in Deep Uncalibrated Photometric Stereo? Guanying Chen, Michael Waechter, Boxin Shi, Kwan-Yee K. Wong, and Yasuyuki Matsushita. ECCV 2020 [BibTeX] [Paper] [Project] [Code]
	Self-calibrating Deep Photometric Stereo Networks. Guanying Chen, Kai Han, Boxin Shi, Yasuyuki Matsushita, and Kwan-Yee K. Wong. CVPR 2019 [BibTeX] [Paper] [Project] [Code]
	PS-FCN: A Flexible Learning Framework for Photometric Stereo. Guanying Chen, Kai Han, and Kwan-Yee K. Wong. ECCV 2018 [BibTeX] [Paper] [Project] [Code]

Scene Text Recognition

	SAFE: Scale Aware Feature Encoder for Scene Text Recognition. Wei Liu, Chaofeng Chen, and Kwan-Yee K. Wong. ACCV 2018 [BibTeX] [Paper]
	Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition. Wei Liu, Chaofeng Chen, and Kwan-Yee K. Wong. AAAI 2018 [BibTeX] [Paper]
	STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition. Wei Liu, Chaofeng Chen, Kwan-Yee K. Wong, Zhizhong Su, and Junyu Han. BMVC 2016 [BibTeX] [Paper]

Transparent Object Reonstruction

	Dense Reconstruction of Transparent Objects by Altering Incident Light Paths Through Refraction. Kai Han, Kwan-Yee K. Wong, and Miaomiao Liu. IJCV 2018 [BibTeX] [Paper]
	A Fixed Viewpoint Approach for Dense Reconstruction of Transparent Objects. Kai Han, Kwan-Yee K. Wong, and Miaomiao Liu. CVPR 2015 [BibTeX] [Paper]
	Depth from Refraction Using a Transparent Medium with Unknown Pose and Refractive Index. Zhihu Chen, Kwan-Yee K. Wong, Yasuyuki Matsushita, and Xiaolong Zhu. IJCV 2013 [BibTeX] [Paper]
	Self-Calibrating Depth from Refraction. Zhihu Chen, Kwan-Yee K. Wong, Yasuyuki Matsushita, Xiaolong Zhu, and Miaomiao Liu. ICCV 2011 [BibTeX] [Paper]

Mirror Surface Reconstruction

	Fixed Viewpoint Mirror Surface Reconstruction Under an Uncalibrated Camera. Kai Han, Miaomiao Liu, Dirk Schnieders, and Kwan-Yee K. Wong. TIP 2021 [BibTeX] [Paper] [Project] [Code]
	Mirror Surface Reconstruction under an Uncalibrated Camera. Kai Han, Kwan-Yee K. Wong, Dirk Schnieders, and Miaomiao Liu. CVPR 2016 [BibTeX] [Paper]
	Pose Estimation from Reflections for Specular Surface Recovery. Miaomiao Liu, Kwan-Yee K. Wong, Zhenwen Dai, and Zhihu Chen. ICCV 2011 [BibTeX] [Paper]
	Estimating the Unknown Poses of a Reference Plane for Specular Shape Recovery. Miaomiao Liu and Kwan-Yee K. Wong. CPCV 2011 [BibTeX] [Paper]
	Specular Surface Recovery from Reflections of a Planar Pattern Undergoing an Unknown Pure Translation Miaomiao Liu, Kwan-Yee K. Wong, Zhenwen Dai, and Zhihu Chen. ACCV 2010 [BibTeX] [Paper]

Camera Calibration

	A Stratified Approach for Camera Calibration Using Spheres. Kwan-Yee K. Wong, Guoqiang Zhang, and Zhihu Chen. TIP 2011 [BibTeX] [Paper]
	Camera Calibration from Images of Spheres. Hui Zhang, Kwan-Yee K. Wong, and Guoqiang Zhang. TPAMI 2007 [BibTeX] [Paper]
	Camera Calibration with Spheres: Linear Approaches. Hui Zhang, Guoqiang Zhang, and Kwan-Yee K. Wong. ICIP 2005 [BibTeX] [Paper]
	Camera Calibration from Surfaces of Revolution. Kwan-Yee K. Wong, Paulo R. S. Mendonça, and Roberto Cipolla. TPAMI 2003 [BibTeX] [Paper]
	Camera Calibration from Symmetry. Kwan-Yee. K. Wong, Paulo R. S. Mendonça, and Roberto Cipolla. Mathematics of Surfaces 2000 [BibTeX] [Paper]

Last modified: