I received my Ph.D. in computer science from Sun Yat-sen University,
advised by Professor Xiaodan Liang and Professor Liang Lin.
My work has been recognized with the 2024 ACM SIGCSE China Excellent Doctoral Dissertation Award
and the 2024 SAAI Outstanding Doctoral Dissertation Award.
I am grateful to be supported by the ETH Career Seed Award and the ETH AI Center Fellowship.
I have published in TPAMI, ICLR, NeurIPS, ICML, ACL, and other top venues.
My research lies at the intersection of formal reasoning and large language models,
building AI systems whose reasoning is rigorous and verifiable.
Rooted in formal logic, my work bridges LLMs with proof assistants, and extends verifiable
reasoning to scientific domains where verification is hard but increasingly necessary.
Methodologically, I combine automated data synthesis, rigorous benchmark design, and reinforcement learning with verifiable rewards.
My long-term goal is AI that reasons with the rigor of mathematics and the openness of natural language.
Formal Reasoning with LLMs:
bridging LLMs with proof assistants for automatic theorem proving, autoformalization, and formal-informal alignment.
[MUSTARD][FVEL][FormalRx][FormalAlign]
Reliable Reasoning:
studying the logical structure, step-wise verification, and counterfactual robustness that underpin trustworthy machine reasoning.
[DoVerifier][CLOMO][MetaLogic][DAGN]
CauSciBench: Evaluating LLM Causal Inference for Scientific Research
Sawal Acharya*, Terry Jingchen Zhang*, Andrew Kim, Anahita Haghighat, Xianlin Sun, Pepijn Cobben, Rahul Babu Shrestha, Maximilian Mordig, Jacob T. Emmerson, Furkan Danisman, Yuen Chen, Clijo Jose, Andrei Ioan Muresanu, Justin Cui, Jiarui Liu, Yahang Qi, Punya Syon Pandey, Yinya Huang, Bernhard Schölkopf, Zhijing Jin
ICML 2026
Test of Time: Rethinking Temporal Signal of Benchmark Contamination
Terry Jingchen Zhang*, Gopal Dev*, Ning Wang, Max Obreiter, Wenyuan Jiang, Punya Syon Pandey, Keenan Samway, Yinya Huang, Bernhard Schölkopf, Mrinmaya Sachan, Zhijing Jin
ACL 2026
LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Yu Fan*, Jingwei Ni*, Jakob Merane*, Yang Tian*, Yoan Hermstrüwer, Yinya Huang, Mubashara Akhtar, Etienne Salimbeni, Florian Geering, Oliver Dreyer, Daniel Brunner, Markus Leippold, Mrinmaya Sachan, Alexander Stremitzer, Christoph Engel, Elliott Ash, Joel Niklaus
ICLR 2026
AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning
Kun Xiang*, Zhili Liu*, Terry Jingchen Zhang, Yinya Huang, Yunshuang Nie, Kaixin Cai, Yiyang Yin, Runhui Huang, Hanhui Li, Yihan Zeng, Yu-Jie Yuan, Jianhua Han, Lanqing Hong, Hang Xu, Xiaodan Liang
IEEE TPAMI 2026
Integrating Large Language Models into Recommendation via Mutual Augmentation and Adaptive Aggregation
Sichun Luo, Yuxuan Yao, Bowei He, Wei Shao, Jian Xu, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Hanxu Hou, Mingjie Zhan, Linqi Song
IEEE JSTSP 2026
SeePhys: Does Seeing Help Thinking? – Benchmarking Vision-Based Physics Reasoning
Kun Xiang*, Heng Li*, Terry Jingchen Zhang*, Yinya Huang*, Zirong Liu, Peixin Qu, Jixi He, Jiaqi Chen, Yu-Jie Yuan, Jianhua Han, Hang Xu, Hanhui Li, Mrinmaya Sachan, Xiaodan Liang
NeurIPS 2025 (Datasets & Benchmarks Track) [Code] [Data] [Project]
ORMind: A Cognitive-Inspired End-to-End Reasoning Framework for Operations Research
Zhiyuan Wang, Bokui Chen, Yinya Huang, Qingxing Cao, Ming He, Jianping Fan, Xiaodan Liang
ACL 2025 (Industry Track)