Yinya Huang

I am a Postdoctoral Fellow at the ETH AI Center, ETH Zürich, working with Professor Mrinmaya Sachan and Professor Elliott Ash.

I received my Ph.D. in computer science from Sun Yat-sen University, advised by Professor Xiaodan Liang and Professor Liang Lin. My work has been recognized with the 2024 ACM SIGCSE China Excellent Doctoral Dissertation Award and the 2024 SAAI Outstanding Doctoral Dissertation Award. I am grateful to be supported by the ETH Career Seed Award and the ETH AI Center Fellowship. I have published in TPAMI, ICLR, NeurIPS, ICML, ACL, and other top venues.

My research lies at the intersection of formal reasoning and large language models, building AI systems whose reasoning is rigorous and verifiable. Rooted in formal logic, my work bridges LLMs with proof assistants, and extends verifiable reasoning to scientific domains where verification is hard but increasingly necessary. Methodologically, I combine automated data synthesis, rigorous benchmark design, and reinforcement learning with verifiable rewards. My long-term goal is AI that reasons with the rigor of mathematics and the openness of natural language.


News



Selected Publications


(* equal contribution    corresponding author)
A full list is available on my Google Scholar page.


Uncovering Hidden Correctness in LLM Causal Reasoning via Symbolic Verification
Paul He, Yinya Huang, Mrinmaya Sachan, Zhijing Jin
EACL 2026 

FormalRx: Rectify and eXamine Semantic Failures in Autoformalization
Haocheng Wang*, Baiyu Huang*, Yingjia Wan*, Xiao Zhu, Xiaoyang Liu, Yinya Huang†, Zhijiang Guo†
ICML 2026

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Yongxin Wang, Wenlei Shi, Yiwei Wang, Xiaodan Liang, Jing Tang
ICML 2026

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Yongxin Wang, Dongchun Xie, Yiwei Wang, Xiaodan Liang, Jing Tang
ICML 2026

CauSciBench: Evaluating LLM Causal Inference for Scientific Research
Sawal Acharya*, Terry Jingchen Zhang*, Andrew Kim, Anahita Haghighat, Xianlin Sun, Pepijn Cobben, Rahul Babu Shrestha, Maximilian Mordig, Jacob T. Emmerson, Furkan Danisman, Yuen Chen, Clijo Jose, Andrei Ioan Muresanu, Justin Cui, Jiarui Liu, Yahang Qi, Punya Syon Pandey, Yinya Huang, Bernhard Schölkopf, Zhijing Jin
ICML 2026

Test of Time: Rethinking Temporal Signal of Benchmark Contamination
Terry Jingchen Zhang*, Gopal Dev*, Ning Wang, Max Obreiter, Wenyuan Jiang, Punya Syon Pandey, Keenan Samway, Yinya Huang, Bernhard Schölkopf, Mrinmaya Sachan, Zhijing Jin
ACL 2026

LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Yu Fan*, Jingwei Ni*, Jakob Merane*, Yang Tian*, Yoan Hermstrüwer, Yinya Huang, Mubashara Akhtar, Etienne Salimbeni, Florian Geering, Oliver Dreyer, Daniel Brunner, Markus Leippold, Mrinmaya Sachan, Alexander Stremitzer, Christoph Engel, Elliott Ash, Joel Niklaus
ICLR 2026 

AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning
Kun Xiang*, Zhili Liu*, Terry Jingchen Zhang, Yinya Huang, Yunshuang Nie, Kaixin Cai, Yiyang Yin, Runhui Huang, Hanhui Li, Yihan Zeng, Yu-Jie Yuan, Jianhua Han, Lanqing Hong, Hang Xu, Xiaodan Liang
IEEE TPAMI 2026

Integrating Large Language Models into Recommendation via Mutual Augmentation and Adaptive Aggregation
Sichun Luo, Yuxuan Yao, Bowei He, Wei Shao, Jian Xu, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Hanxu Hou, Mingjie Zhan, Linqi Song
IEEE JSTSP 2026 

SeePhys: Does Seeing Help Thinking? – Benchmarking Vision-Based Physics Reasoning
Kun Xiang*, Heng Li*, Terry Jingchen Zhang*, Yinya Huang*, Zirong Liu, Peixin Qu, Jixi He, Jiaqi Chen, Yu-Jie Yuan, Jianhua Han, Hang Xu, Hanhui Li, Mrinmaya Sachan, Xiaodan Liang
NeurIPS 2025 (Datasets & Benchmarks Track)  [Code]  [Data]  [Project] 

ORMind: A Cognitive-Inspired End-to-End Reasoning Framework for Operations Research
Zhiyuan Wang, Bokui Chen, Yinya Huang, Qingxing Cao, Ming He, Jianping Fan, Xiaodan Liang
ACL 2025 (Industry Track) 

FormalAlign: Automated Alignment Evaluation for Autoformalization
Jianqiao Lu*, Yingjia Wan*, Yinya Huang, Jing Xiong, Zhengying Liu, Zhijiang Guo
ICLR 2025  [Code] 

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling
Zhicheng Yang, Yiwei Wang, Yinya Huang, Zhijiang Guo, Wei Shi, Xiongwei Han, Liang Feng, Linqi Song, Xiaodan Liang, Jing Tang
ICLR 2025  [Code] 

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Xiaohan Lin*, Qingxing Cao*, Yinya Huang*, Haiming Wang*, Jianqiao Lu, Zhengying Liu, Linqi Song, Xiaodan Liang
NeurIPS 2024 (Datasets & Benchmarks Track)  [Code] 

Proving Theorems Recursively
Haiming Wang, Huajian Xin, Zhengying Liu, Wenda Li, Yinya Huang, Jianqiao Lu, Zhicheng Yang, Jing Tang, Jian Yin, Zhenguo Li, Xiaodan Liang
NeurIPS 2024  [Code] 

CLOMO: Counterfactual Logical Modification with Large Language Models
Yinya Huang*, Ruixin Hong*, Hongming Zhang, Wei Shao, Zhicheng Yang, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song
ACL 2024  [Project]  [Code] 

MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang
ICLR 2024  [Code]  [Slides] 

ATG: Benchmarking Automated Theorem Generation for Generative Language Models
Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang
NAACL 2024 (Findings) 

LEGO-Prover: Neural Theorem Proving with Growing Libraries
Haiming Wang*, Huajian Xin*, Chuanyang Zheng, Lin Li, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Xiaodan Liang
ICLR 2024  [Code] 

AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng Yang, Yinya Huang, Jing Xiong, Liang Feng, Xiaodan Liang, Yiwei Wang, Jing Tang
EMNLP 2024 (Findings)  [Code] 

RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation
Sichun Luo, Bowei He, Haohan Zhao, Yinya Huang, Aojun Zhou, Zongpeng Li, Yuanzhang Xiao, Mingjie Zhan, Linqi Song
ACM TOIS 2024 

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu
EMNLP 2023  [Code] 

Discourse-Aware Graph Networks for Textual Logical Reasoning
Yinya Huang, Lemao Liu, Kun Xu, Meng Fang, Liang Lin, Xiaodan Liang
IEEE TPAMI 2023  [Code] 

MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure
Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, Dong Yu
EMNLP 2022  [Code] 

DAGN: Discourse-Aware Graph Network for Logical Reasoning
Yinya Huang, Meng Fang, Yu Cao, Liwei Wang, Xiaodan Liang
NAACL 2021  [Code] 

REM-Net: Recursive Erasure Memory Network for Commonsense Evidence Refinement
Yinya Huang, Meng Fang, Xunlin Zhan, Qingxing Cao, Xiaodan Liang, Liang Lin
AAAI 2021  [Code] 

PathReasoner: Explainable Reasoning Paths for Commonsense Question Answering
Xunlin Zhan*, Yinya Huang*, Xiao Dong, Qingxing Cao, Xiaodan Liang
KBS 2021 

SeePhys Pro: Diagnosing Modality Transfer and Blind-Training Effects in Multimodal RLVR for Physics Reasoning
Kun Xiang, Terry Jingchen Zhang, Zirong Liu, Bokai Zhou, Yueling Tang, Junjie Yu, Jiacong Lu, Shangrui Huang, Heng Li, Likui Zhang, Kunkun Liu, Changzheng Zhang, Yangle Fang, Boqiang Guo, Hui-Ling Zhen, Dandan Tu, Yinya Huang, Xiaodan Liang
arXiv:2605.09266 

Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Kun Xiang*, Terry Jingchen Zhang*, Yinya Huang*, Jixi He, Zirong Liu, Yueling Tang, Ruizhe Zhou, Lijing Luo, Youpeng Wen, Xiuwei Chen, Bingqian Lin, Jianhua Han, Hang Xu, Hanhui Li, Bin Dong, Xiaodan Liang
arXiv:2510.04978 

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning
Hanxu Hu, Yuxuan Wang, Maggie Huan, Jannis Vamvas, Yinya Huang, Zhijiang Guo, Rico Sennrich
arXiv:2603.11193 

Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Yongxin Wang, Yiwei Wang, Xiaodan Liang, Jing Tang
arXiv:2509.23152 

TreeRPO: Tree Relative Policy Optimization
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Xiaodan Liang, Yiwei Wang, Jing Tang
arXiv:2506.05183 

Process-Driven Autoformalization in Lean 4
Jianqiao Lu, Zhengying Liu, Yingjia Wan, Yinya Huang, Haiming Wang, Zhicheng Yang, Jing Tang, Zhijiang Guo
arXiv:2406.01940  [Code] 


Teaching



Mentorship & Supervision


I am fortunate to work with a group of talented students.

ETH Zürich (Primary Supervisor)

Informal Mentorship


Professional Service


Conferences

Workshops

Program Committee Member

Journal Reviewer


Selected Awards


Grants

Fellowships

Honors & Awards


Contact


Andreasstrasse 5, 8092 Zürich, Switzerland
yinya [dot] el [dot] huang [at] gmail [dot] com


© Yinya Huang 2026