Zehuan Huang (黄泽桓) 

Final Year Master Student @ Beihang University


Email: huangzehuan@buaa.edu.cn
[Github] [Google Scholar] [Twitter] [CV]

Biography

I am a master student in School of Software from Beihang University now, supervised by Prof. Lu Sheng.

My prior research focused on applying deep generative models to 3D assetcreation, encompassing the generation of 3D objects, scenes, textures, and animations. My current research interests lie in world models and simulation, including (i) generalizable 3D geometry foundation models, (ii) interactive digital world creation, and (iii) physical property simulation.

I am grateful to all my collaborators and mentors along the way. I first started doing research under the guidance of Prof. Miao Wang. Then I started working on deep learning related projects under the supervision of Prof. Lu Sheng. Besides, I also successively haved intern at MiniMax, Shanghai AI Lab, and VAST, and I'm fortunate to have worked closely with Junting Dong, Yuan-Chen Guo and Yanpei Cao.

I am actively seeking PhD opportunities for Spring or Fall 2026 intake.

News

Selected Preprints

MV-Adapter: Multi-view Consistent Image Generation Made Easy
Zehuan Huang, Yuan-Chen Guo, Haoran Wang, Ran Yi, Lizhuang Ma, Yan-Pei Cao, Lu Sheng
Under Review
TL;DR: Versatile multi-view generation with various base models and conditions, and high-quality 3D texture generation.

Selected Publications

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation
Zehuan Huang, Yuan-Chen Guo, Xingqiao An, Yunhan Yang, Yangguang Li, Zi-Xin Zou, Ding Liang, Xihui Liu, Yan-Pei Cao, Lu Sheng
CVPR 2025
TL;DR: MIDI-3D extends image-to-3D object generation models to multi-instance diffusion models for compositional 3D scene generation.
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Hao Wen*, Zehuan Huang*, Yaohui Wang, Xinyuan Chen, Yu Qiao, Lu Sheng
CVPR 2025
TL;DR: Transfer the two-stage image-to-3D pipeline into a unified recursive diffusion process, thereby reducing the data bias of each stage and improving the quality of generated 3D.
TELA: Text to Layer-wise 3D Clothed Human Generation
Junting Dong, Qi Fang, Zehuan Huang, Xudong Xu, Jingbo Wang, Sida Peng, Bo Dai
ECCV 2024
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
Zehuan Huang*, Hao Wen*, Junting Dong*, Yaohui Wang, Yangguang Li, Xinyuan Chen, Yan-Pei Cao, Ding Liang, Yu Qiao, Bo Dai, Lu Sheng
CVPR 2024

Honors & Awards

Educations

Industrial Experience

Services

Reviewer

ICLR, CVPR, ICCV, ACM MM, TCSVT

Contributor

huggingface/diffusers, the most widely-used library for diffusion models.
threestudio, a popular repo for 3d generation.

In-School

2023 Fall ~ 2025 Spring, part-time technology counselor in School of Software, Beihang University
2024 Spring, TA in Image Processing and Computer Vision, instructed by Prof. Lu Sheng
© 2025 Zehuan Huang