Friday, Oct. 10, 2025
10:20 – 11:10 a.m. in ETB 1020
Dr. Zhiwen Fan
Electrical & Computer Engineering Assistant Professor
Texas A&M University
Title: “Learning the World in 3D: Real-Time Algorithms and Scalable Data“
Abstract:
3D foundation models are a key step toward spatial and physical intelligence, yet their impact is often constrained by heavy computation and scarce, well-curated 3D data. In this talk, I will present a family of 3D algorithms that make reconstruction, rendering, and generation practical on real workloads, together with a scalable 3D data engine spanning human avatars, Martian-surface simulation, and cryo-EM microscopy.
First, I will cover: (1) city-scale 3D reconstruction and a real-time 3D perception Transformer that operates from uncalibrated images; (2) efficient neural rendering techniques that achieve 60% speedup over 3D Gaussian Splatting while preserving fidelity; and (3) methods by which explicit 3D understanding augments 2D vision–language models to improve spatial awareness prior to reasoning.
Next, I will introduce systematic remedies for 3D data scarcity: (1) an automated pipeline for multimodal human-avatar and environment annotation; (2) a reliable system for simulating Martian surfaces under limited texture availability; and (3) a learned inverse Fourier-slicing map that enables cryo-EM reconstructions up to 10× faster without sacrificing accuracy.
Taken together, these advances move us toward AI systems that interact with the physical world with genuine spatial understanding and real-time performance.
Biography:
Zhiwen (“Aaron”) Fan is an Assistant Professor in the Department of Electrical and Computer Engineering at Texas A&M University. He received his Ph.D. from The University of Texas at Austin. He was awarded the 2022 Qualcomm Innovation Fellowship and a Best Paper award at the CVPR AI4CC Workshop. He has served as an Area Chair for multiple AI/ML conferences and has completed research internships at Meta, NVIDIA, and Google.
Zhiwen Fan’s homepage: zhiwenfan.github.io