PhD candidate, Statistics · The University of Texas at Austin · graduating May 2026. Research areas: deep generative modeling, diffusion models, multimodal learning, trustworthy AI.
Education
Advised by Prof. Mingyuan Zhou, Department of Information, Risk, and Operations Management (IROM), McCombs School of Business.
Selected publications
A teacher–student score distillation that removes target classes or concepts from diffusion models without any real data — and as a side effect, gets up to 1000× sampling speedup.
Trajectory-level dense reward signals for aligning text-to-image diffusion to human preference.
A non-Gaussian diffusion model on the simplex via Beta distributions, well-suited for bounded data.
A binomial / Poisson hierarchical VAE for sparse, skewed, heavy-tailed, and heterogeneous data.
An information-theoretic surrogate for DkNN classification with matching attack and defense, achieving SOTA adversarial results.
An immune-system-inspired adversarial learning framework that defends against unseen attacks.
A general biomimetic evolutionary algorithm for robustifying deep neural networks against adversarial attacks.
Industry experience
Designed an end-to-end data curation and preprocessing pipeline for video super-resolution (VSR), including a temporally consistent simulator of real-world video degradation. Evaluated existing VSR baselines and developed a diffusion model based on CogVideoX 1.5.
Designed an automatic short-form video localization pipeline that adapts textual and graphical elements per locale using SOTA detection, OCR, inpainting, and segmentation models. Built a complete landscape-to-portrait conversion workflow with KMeans + Gaussian-process smoothing for stable subject tracking.
Identified four major limitations of existing visual in-context learning methods and proposed iPromptDiff — an SD-based architecture that decouples content vs. task and routes visual perception through text embeddings, beating baselines in-domain and OOD even when text prompts are missing.
Academic research
Studied non-Gaussian iterative corruption / recovery from a Bregman-divergence perspective. Proposed binomial / Poisson hierarchical VAE frameworks for sparse, skewed, heavy-tailed data. Built and open-sourced a PyTorch codebase covering DDPM, DDIM, and classifier-free guidance — 230★+ on GitHub.
Introduced an information-theoretic surrogate loss (ASK) for DkNN classification with matching attack and defense algorithms achieving SOTA results. Co-developed an immune-inspired adversarial framework (RAILS) capable of defending against unseen attacks.
Awards & service
Fellowships & awards
- University Graduate Continuing Fellowship 2025
- McCombs Dean's Fellowship 2022 – 2024
- NeurIPS Scholar Award 2023
- UT Professional Development Award 2023
- Graduate School Recruitment Fellowship 2021
- Fudan Excellent Freshman Scholarship — Top 1% 2015
Service & teaching
- Reviewer · ICLR '24 – '26
- Reviewer · NeurIPS '23 – '25
- Reviewer · ICML '23 – '25
- Reviewer · AISTATS '21, '26
- Teaching Assistant · UT Austin '21 – '22, '24
- Graduate Student Instructor · U-M '20 – '21
Toolbox
Programming & ML
- Python
- R
- PyTorch
- JAX
- TensorFlow
- NumPy
- SciPy
- scikit-learn
- CUDA
Generative modeling
- Diffusion models
- VAEs
- RLHF / DPO
- Flow matching
- CogVideoX
- Stable Diffusion
- Score distillation
Languages
- English
- Chinese (native)
- Japanese (intermediate)