Research

(Click here for my full publication list or here for my Google Scholar profile)

Selected Projects

[Text-to-Image/Video Generation] [3D Human Modeling] [Face Image/Video Processing] [Photometric Stereo] [Face Sketch Synthesis] [Scene Text Recognition] [Transparent Object Reconstruction] [Mirror Surface Reconstruction] [Camera Calibration]

Text-to-Image/Video Generation

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality.
Zhengyao Lv, Chenyang Si, Junhao Song, Zhenyu Yang, Yu Qiao, Ziwei Liu, and Kwan-Yee K. Wong.
ICLR 2025
[BibTeX] [Paper] [Project] [Code]

BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities.
Shaozhe Hao, Xuantong Liu, Xianbiao Qi, Shihao Zhao, Bojia Zi, Rong Xiao, Kai Han, and Kwan-Yee K. Wong.
ICLR 2025
[BibTeX] [Paper] [Project] [Code]

Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation.
Shihao Zhao, Shaozhe Hao, Bojia Zi, Huaizhe Xu, and Kwan-Yee K. Wong.
ECCV 2024
[BibTeX] [Paper] [Project] [Code]

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction.
Shaozhe Hao, Kai Han, Zhengyao Lv, Shihao Zhao, and Kwan-Yee K. Wong.
ECCV 2024
[BibTeX] [Paper] [Project] [Code]

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis.
Zhengyao Lv, Yuxiang Wei, Wangmeng Zuo, and Kwan-Yee K. Wong.
CVPR 2024
[BibTeX] [Paper] [Project] [Code]

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models.
Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, and Kwan-Yee K. Wong.
NeurIPS 2023
[BibTeX] [Paper] [Project] [Code]

3D Human Modeling

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation.
Yukang Cao, Liang Pan, Kai Han, Kwan-Yee K. Wong, and Ziwei Liu.
ICLR 2025
[BibTeX] [Paper] [Project] [Code]

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models.
Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, and Kwan-Yee K. Wong.
CVPR 2024
[BibTeX] [Paper] [Project] [Code]

HeadSculpt: Crafting 3D Head Avatars with Text.
Xiao Han, Yukang Cao, Kai Han, Xiatian. Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang, and Kwan-Yee K. Wong.
NeurIPS 2023
[BibTeX] [Paper] [Project] [Code]

SeSDF: Self-evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction.
Yukang Cao, Kai Han, and Kwan-Yee K. Wong.
CVPR 2023
[BibTeX] [Paper] [Project] [Code]

JIFF: Jointly-aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction.
Yukang Cao, Guanying Chen, Kai Han, Wenqi Yang, and Kwan-Yee K. Wong.
CVPR 2022
[BibTeX] [Paper] [Project] [Code]

Face Image/Video Processing

RIGID: Recurrent GAN Inversion and Editing of Real Face Videos and Beyond.
Yangyang Xu, Shengfeng He, Kwan-Yee K. Wong, and Ping Luo.
IJCV 2025
[BibTeX] [Paper] [Project] [Code]

Deep Face Video Inpainting via UV Mapping.
Wenqi Yang, Zhenaeng. Chen, Chaofeng Chen, Guanying Chen, and Kwan-Yee K. Wong.
TIP 2023
[BibTeX] [Paper]

RIGID: Recurrent GAN Inversion and Editing of Real Face Videos.
Yangyang Xu, Shengfeng He, Kwan-Yee K. Wong, and Ping Luo.
ICCV 2023
[BibTex] [Paper] [Project] [Code]

Progressive semantic-aware style transformation for blind face restoration.
Chaofeng Chen, Xiaoming Li, Lingbo Yang, Xianhui Lin, Lei Zhang, and Kwan-Yee K. Wong.
CVPR 2021
[BibTeX] [Paper] [Code]

Learning Spatial Attention for Face Super-Resolution.
Chaofeng Chen, Dihong Gong, Hao Wang, Zhifeng Li, and Kwan-Yee K. Wong.
TIP 2020
[BibTeX] [Paper] [Code]

Photometric Stereo

PS-NeRF: Neural Inverse Rendering for Mulit-view Photometric Stereo.
Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, and Kwan-Yee K. Wong.
ECCV 2022
[BibTeX] [Paper] [Project] [Code]

What is Learned in Deep Uncalibrated Photometric Stereo?
Guanying Chen, Michael Waechter, Boxin Shi, Kwan-Yee K. Wong, and Yasuyuki Matsushita.
ECCV 2020
[BibTeX] [Paper] [Project] [Code]

Deep Photometric Stereo for Non-Lambertian Surfaces.
Guanying Chen, Kai Han, Boxin Shi, Yasuyuki Matsushita, and Kwan-Yee K. Wong.
TPAMI 2020
[BibTeX] [Paper] [Project] [Code]

Self-calibrating Deep Photometric Stereo Networks.
Guanying Chen, Kai Han, Boxin Shi, Yasuyuki Matsushita, and Kwan-Yee K. Wong.
CVPR 2019
[BibTeX] [Paper] [Project] [Code]

PS-FCN: A Flexible Learning Framework for Photometric Stereo.
Guanying Chen, Kai Han, and Kwan-Yee K. Wong.
ECCV 2018
[BibTeX] [Paper] [Project] [Code]

Face Sketch Synthesis

Semi-supervised Cycle-GAN for face photo-sketch translation in the wild.
Chaofeng Chen, Wei Liu, Xiao Tan, and Kwan-Yee K. Wong.
CVIU 2023
[BibTeX] [Paper] [Code]

Semi-supervised Learning for Face Sketch Synthesis in the Wild.
Chaofeng Chen, Wei Liu, Xiao Tan, and Kwan-Yee K. Wong.
ACCV 2018
[BibTeX] [Paper] [Code]

Face Sketch Synthesis with Style Transfer using Pyramid Column Feature.
Chaofeng Chen, Xiao Tan, and Kwan-Yee K. Wong.
WACV 2018)
[BibTeX] [Paper] [Code]

Markov Weight Fields for Face Sketch Synthesis.
Hao Zhou, Zhanghui Kuang, and Kwan-Yee K. Wong.
CVPR 2012
[BibTeX] [Paper]

Scene Text Recognition

SAFE: Scale Aware Feature Encoder for Scene Text Recognition.
Wei Liu, Chaofeng Chen, and Kwan-Yee K. Wong.
ACCV 2018
[BibTeX] [Paper]

Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition.
Wei Liu, Chaofeng Chen, and Kwan-Yee K. Wong.
AAAI 2018
[BibTeX] [Paper]

STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition.
Wei Liu, Chaofeng Chen, Kwan-Yee K. Wong, Zhizhong Su, and Junyu Han.
BMVC 2016
[BibTeX] [Paper]

Transparent Object Reonstruction

Dense Reconstruction of Transparent Objects by Altering Incident Light Paths Through Refraction.
Kai Han, Kwan-Yee K. Wong, and Miaomiao Liu.
IJCV 2018
[BibTeX] [Paper]

A Fixed Viewpoint Approach for Dense Reconstruction of Transparent Objects.
Kai Han, Kwan-Yee K. Wong, and Miaomiao Liu.
CVPR 2015
[BibTeX] [Paper]

Depth from Refraction Using a Transparent Medium with Unknown Pose and Refractive Index.
Zhihu Chen, Kwan-Yee K. Wong, Yasuyuki Matsushita, and Xiaolong Zhu.
IJCV 2013
[BibTeX] [Paper]

Self-Calibrating Depth from Refraction.
Zhihu Chen, Kwan-Yee K. Wong, Yasuyuki Matsushita, Xiaolong Zhu, and Miaomiao Liu.
ICCV 2011
[BibTeX] [Paper]

Mirror Surface Reconstruction

Fixed Viewpoint Mirror Surface Reconstruction Under an Uncalibrated Camera.
Kai Han, Miaomiao Liu, Dirk Schnieders, and Kwan-Yee K. Wong.
TIP 2021
[BibTeX] [Paper] [Project] [Code]

Mirror Surface Reconstruction under an Uncalibrated Camera.
Kai Han, Kwan-Yee K. Wong, Dirk Schnieders, and Miaomiao Liu.
CVPR 2016
[BibTeX] [Paper]

Pose Estimation from Reflections for Specular Surface Recovery.
Miaomiao Liu, Kwan-Yee K. Wong, Zhenwen Dai, and Zhihu Chen.
ICCV 2011
[BibTeX] [Paper]

Estimating the Unknown Poses of a Reference Plane for Specular Shape Recovery.
Miaomiao Liu and Kwan-Yee K. Wong.
CPCV 2011
[BibTeX] [Paper]

Specular Surface Recovery from Reflections of a Planar Pattern Undergoing an Unknown Pure Translation
Miaomiao Liu, Kwan-Yee K. Wong, Zhenwen Dai, and Zhihu Chen.
ACCV 2010
[BibTeX] [Paper]

Camera Calibration

A Stratified Approach for Camera Calibration Using Spheres.
Kwan-Yee K. Wong, Guoqiang Zhang, and Zhihu Chen. TIP 2011
[BibTeX] [Paper]

Camera Calibration from Images of Spheres.
Hui Zhang, Kwan-Yee K. Wong, and Guoqiang Zhang.
TPAMI 2007
[BibTeX] [Paper]

Camera Calibration with Spheres: Linear Approaches.
Hui Zhang, Guoqiang Zhang, and Kwan-Yee K. Wong.
ICIP 2005
[BibTeX] [Paper]

Camera Calibration from Surfaces of Revolution.
Kwan-Yee K. Wong, Paulo R. S. Mendonça, and Roberto Cipolla.
TPAMI 2003
[BibTeX] [Paper]

Camera Calibration from Symmetry.
Kwan-Yee. K. Wong, Paulo R. S. Mendonça, and Roberto Cipolla.
Mathematics of Surfaces 2000
[BibTeX] [Paper]

Last modified: