I am currently an MPhil student at HKUST(GZ), where I am advised by Prof. Xuming Hu. Concurrently, I am fortunate to conduct research on AI Governance under the guidance of Prof. Mingxun Zhou at HKUST. Previously, I obtained my B.S. in Data Science from Tongji University, under the supervision of Prof. Zhihua Wei.
My research interests lie at the intersection of Multimodal Learning and Trustworthy AI, with a specific focus on generative models, model watermarking, and interpretability. Beyond research, I am deeply passionate about mathematics, programming, and data visualization. Previously I also have the privilege of collaborating with distinguished researchers, including Yibo Yan, Kaichen Huang, and Na Min An.
π’ I am actively seeking academic collaborations! If you are interested in my research, please feel free to reach out at jiahaohuotj@gmail.com.
π₯ News
2026
- 2026.02: π Code of CausalEmbed released.
- 2026.02: π Arriving in Chicago as visting student, advised by Prof. Philip S. Yu.
- 2026.01: π One paper accepted by ICLRβ26!
- 2026.01: π One paper submitted to ICMLβ26.
2025
- 2025.09: π Code of PMark released.
- 2025.09: π One paper submitted to ICLRβ26.
- 2025.07: π€ Glad to contribute to MemOS (GitHub 4.6k Stars)!
- 2025.06: π Code of MMUnlearner released.
- 2025.05: π Two papers accepted by ACLβ25 Findings; One paper accepted by ACLβ25 (Industry) Oral!
- 2025.02: π One paper submitted to ACLβ25.
2024
- 2024.09: π Code of MMNeuron released.
- 2024.09: π One paper accepted by EMNLPβ24!
- 2024.06: π One paper submitted to EMNLPβ24.
π Publications

CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding
Jiahao Huo, Yu Huang, Yibo Yan, Ye Pan, Yi Cao, Mingdong Ou, Philip S. Yu, Xuming Hu
- Auto-regressive embedding generation for multi-vector visual document retrieval.
- 30-155x reduction in token count and test-time sclaing for retrieval performance.

PMark: Towards Robust and Distortion-free Semantic-level Watermarking with Channel Constraints
Jiahao Huo, Shuliang Liu, Bin Wang, Junyan Zhang, Yibo Yan, Aiwei Liu, Xuming Hu, Mingxun Zhou
- Novel and unified theoretical framework on semantic-level watermarking.
- Distortion-free and robust semantic-level watermarking for LLMs.

Jiahao Huo, Yibo Yan, Xu Zheng, Yuanhuiyi Lyu, Xin Zou, Zhihua Wei, Xuming Hu
- Reformulate the task of multimodal MU in the era of MLLMs.
- Aims to erase only the visual patterns associated with a given entity while preserving the corresponding textual knowledge encoded within the original parameters of the language model backbone.
- Develop a novel geometry-constrained gradient ascent method MMUnlearner.

Yibo Yan, Shen Wang, Jiahao Huo, Philip S. Yu, Xuming Hu, Qingsong Wenpng
- Decomposes error detection into three phases
- Each phase handled by a specialized agent: an image-text consistency validator, a visual semantic interpreter, and an integrative error analyzer.

MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model
Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu
- Investigating the distribution of domain-specific neurons and the mechanism of how MLLMs process features from diverse domains.
π Educations
- 2025.10 - present, MPhil Student, HKUST(GZ).
- 2024.10 - 2025.03, Exchange Student, Technical University of Munich.
- 2021.09 - 2025.07, Undergraduate Student, Tongji University.
π» Internships
| University of Illinois Chicago | Chicago, USA | Β |
| Visiting Student | 2026.02 - Present |
Supervisor: Prof. Philip S. Yu |
| Alibaba Cloud Computing | Hangzhou, China | Β |
| Remote Intern | 2026.02 - Present |
Mentor: Dr. Mingdong Ou |
| Alibaba Group | Hangzhou, China | Β |
| Research Intern | 2025.06 - 2025.09 |
Mentor: Dr. Chengfei Lv |
| MemTensor | Shanghai, China | Β |
| Visiting Student | 2025.05 - 2025.07 |
Mentor: Dr. Zhiyu Li |
π Services
- Conference Reviewer: ICLR 2025, ACL 2025/2026, ICML 2026, SIGIR 2026