Yinya Huang

I am a Postdoctoral Fellow at the ETH AI Center, ETH Zürich, working with Professor Mrinmaya Sachan and Professor Elliott Ash.

I received my Ph.D. in computer science from Sun Yat-sen University, advised by Professor Xiaodan Liang and Professor Liang Lin. My work has been recognized with the ACM SIGCSE China Excellent Doctoral Dissertation Award and the SAAI Outstanding Doctoral Dissertation Award. I am grateful to be supported by the ETH Career Seed Award and the ETH AI Center Fellowship. I have published in TPAMI, ICLR, NeurIPS, ICML, ACL, and other top venues.

My research lies at the intersection of formal reasoning and large language models, building AI systems whose reasoning is rigorous and verifiable. Rooted in formal logic, my work bridges LLMs with proof assistants, and extends verifiable reasoning to scientific domains where verification is hard but increasingly necessary. Methodologically, I combine automated data synthesis, rigorous benchmark design, reinforcement learning with verifiable rewards, and formal-informal alignment. My long-term goal is AI that reasons with the rigor of mathematics and the openness of natural language.

Formal Reasoning with LLMs: bridging LLMs with proof assistants for automatic theorem proving, autoformalization, and formal-informal alignment. [MUSTARD] [FVEL] [FormalRx] [FormalAlign]
Reliable Reasoning: studying the logical structure, step-wise verification, and counterfactual robustness that underpin trustworthy machine reasoning. [DoVerifier] [CLOMO] [MetaLogic] [DAGN]
AI for Science and Verification: extending verifiable reasoning to physics, causal reasoning, and other scientific fields. [SeePhys] [SeePhys Pro] [OptiBench] [ORMind] [CauSciBench]

News

[2026-06] Our AI for physics survey is accepted to TPAMI!
[2026-05] Excited to have four papers accepted to ICML 2026!
[2026-03] Glad to be invited to serve as a Senior Area Chair for EMNLP 2026!
[2026-03] Glad to be invited to serve as an Area Chair for NeurIPS 2026!
[2026-01] One paper is accepted to ICLR 2026!
[2026-01] One paper on formalized verification for causal reasoning is accepted to EACL 2026 (main conference)!
[2026-01] One co-authored paper on multimodal math step reasoning is accepted to TPAMI!
[2025-12] Glad to be invited to serve as a Senior Area Chair for ACL 2026!
[2025-12] I am honored to receive the ETH Zürich Career Seed Award!
[2025-09] Glad to be invited to serve as an Area Chair for ICLR 2026!
[2025-09] The full-spectrum vision-based physics reasoning benchmark SeePhys is accepted to NeurIPS 2025 D&B Track!
[2025-05] One paper on multi-agent systems for operations problem solving is accepted to ACL 2025 Industry Track!
[2025-04] Glad to be invited to serve as a Senior Area Chair for EMNLP 2025!
[2025-03] I am co-organizing the 2nd AI for Math Workshop at ICML 2025!
[2025-03] Our workshop proposal for AI for Math is accepted to ICML 2025!
[2025-03] Happy to receive the ETH AI Center Postdoctoral Fellowship!
[2025-01] Two papers are accepted to ICLR 2025!
[2024-09] Two papers are accepted to NeurIPS 2024 (one Main Track, one D&B Track).
[2024-09] One paper on LLM math reasoning is accepted to EMNLP 2024 Findings.
[2024-07] Thanks mlcontests.com for the coverage of our 2024 ICML AI for Math Workshop: Challenge tracks, morning session, and afternoon session!

[2024-06] Three papers on multi-level theorem proving, autoformalization, and complex reasoning are available at Preprint.
[2024-05] One paper on LLM counterfactual reasoning is accepted to ACL 2024.
[2024-05] Honored to deliver an invited talk on LLM formal reasoning at UiT Machine Learning Group.
[2024-04] Excited to receive Outstanding Doctoral Dissertation Award from SAAI (3 awardees per year)!
[2024-03] I am co-organizing the AI for Math Workshop and Challenges at ICML 2024.
[2024-03] Our workshop proposal for AI for Math is accepted to ICML 2024.
[2024-03] One paper on formal math reasoning is accepted to NAACL 2024 Findings.
[2024-01] Two papers (one spotlight, one oral) on formal math reasoning are accepted to ICLR 2024.
[2023-10] One paper on formal math reasoning is accepted to EMNLP 2023.
[2023-06] Excited to receive Honors Graduate from Sun Yat-sen University!
[2023-05] I successfully defended my Ph.D. dissertation!
[2023-05] One first-authored paper is accepted to IEEE TPAMI.

Selected Publications

(* equal contribution † corresponding author)
A full list is available on my Google Scholar page.

Uncovering Hidden Correctness in LLM Causal Reasoning via Symbolic Verification
Paul He, Yinya Huang, Mrinmaya Sachan, Zhijing Jin
EACL 2026

FormalRx: Rectify and eXamine Semantic Failures in Autoformalization
Haocheng Wang*, Baiyu Huang*, Yingjia Wan*, Xiao Zhu, Xiaoyang Liu, Yinya Huang†, Zhijiang Guo†
ICML 2026

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Yongxin Wang, Wenlei Shi, Yiwei Wang, Xiaodan Liang, Jing Tang
ICML 2026

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Yongxin Wang, Dongchun Xie, Yiwei Wang, Xiaodan Liang, Jing Tang
ICML 2026

CauSciBench: Evaluating LLM Causal Inference for Scientific Research
Sawal Acharya*, Terry Jingchen Zhang*, Andrew Kim, Anahita Haghighat, Xianlin Sun, Pepijn Cobben, Rahul Babu Shrestha, Maximilian Mordig, Jacob T. Emmerson, Furkan Danisman, Yuen Chen, Clijo Jose, Andrei Ioan Muresanu, Justin Cui, Jiarui Liu, Yahang Qi, Punya Syon Pandey, Yinya Huang, Bernhard Schölkopf, Zhijing Jin
ICML 2026

Test of Time: Rethinking Temporal Signal of Benchmark Contamination
Terry Jingchen Zhang*, Gopal Dev*, Ning Wang, Max Obreiter, Wenyuan Jiang, Punya Syon Pandey, Keenan Samway, Yinya Huang, Bernhard Schölkopf, Mrinmaya Sachan, Zhijing Jin
ACL 2026

LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Yu Fan*, Jingwei Ni*, Jakob Merane*, Yang Tian*, Yoan Hermstrüwer, Yinya Huang, Mubashara Akhtar, Etienne Salimbeni, Florian Geering, Oliver Dreyer, Daniel Brunner, Markus Leippold, Mrinmaya Sachan, Alexander Stremitzer, Christoph Engel, Elliott Ash, Joel Niklaus
ICLR 2026

AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning
Kun Xiang*, Zhili Liu*, Terry Jingchen Zhang, Yinya Huang, Yunshuang Nie, Kaixin Cai, Yiyang Yin, Runhui Huang, Hanhui Li, Yihan Zeng, Yu-Jie Yuan, Jianhua Han, Lanqing Hong, Hang Xu, Xiaodan Liang
IEEE TPAMI 2026

Integrating Large Language Models into Recommendation via Mutual Augmentation and Adaptive Aggregation
Sichun Luo, Yuxuan Yao, Bowei He, Wei Shao, Jian Xu, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Hanxu Hou, Mingjie Zhan, Linqi Song
IEEE JSTSP 2026

Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Kun Xiang*, Terry Jingchen Zhang*, Yinya Huang*, Jixi He, Zirong Liu, Yueling Tang, Ruizhe Zhou, Lijing Luo, Youpeng Wen, Xiuwei Chen, Bingqian Lin, Jianhua Han, Hang Xu, Hanhui Li, Bin Dong, Xiaodan Liang
IEEE TPAMI 2026 [Project]

SeePhys: Does Seeing Help Thinking? – Benchmarking Vision-Based Physics Reasoning
Kun Xiang*, Heng Li*, Terry Jingchen Zhang*, Yinya Huang*, Zirong Liu, Peixin Qu, Jixi He, Jiaqi Chen, Yu-Jie Yuan, Jianhua Han, Hang Xu, Hanhui Li, Mrinmaya Sachan, Xiaodan Liang
NeurIPS 2025 (Datasets & Benchmarks Track) [Code] [Data] [Project]

ORMind: A Cognitive-Inspired End-to-End Reasoning Framework for Operations Research
Zhiyuan Wang, Bokui Chen, Yinya Huang, Qingxing Cao, Ming He, Jianping Fan, Xiaodan Liang
ACL 2025 (Industry Track)

FormalAlign: Automated Alignment Evaluation for Autoformalization
Jianqiao Lu*, Yingjia Wan*, Yinya Huang, Jing Xiong, Zhengying Liu, Zhijiang Guo
ICLR 2025 [Code]

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling
Zhicheng Yang, Yiwei Wang, Yinya Huang, Zhijiang Guo, Wei Shi, Xiongwei Han, Liang Feng, Linqi Song, Xiaodan Liang, Jing Tang
ICLR 2025 [Code]

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
Xiaohan Lin*, Qingxing Cao*, Yinya Huang*, Haiming Wang*, Jianqiao Lu, Zhengying Liu, Linqi Song, Xiaodan Liang
NeurIPS 2024 (Datasets & Benchmarks Track) [Code]

Proving Theorems Recursively
Haiming Wang, Huajian Xin, Zhengying Liu, Wenda Li, Yinya Huang, Jianqiao Lu, Zhicheng Yang, Jing Tang, Jian Yin, Zhenguo Li, Xiaodan Liang
NeurIPS 2024 [Code]

CLOMO: Counterfactual Logical Modification with Large Language Models
Yinya Huang*, Ruixin Hong*, Hongming Zhang, Wei Shao, Zhicheng Yang, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song
ACL 2024 [Project] [Code]

MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang
ICLR 2024 [Code] [Slides]

ATG: Benchmarking Automated Theorem Generation for Generative Language Models
Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang
NAACL 2024 (Findings)

LEGO-Prover: Neural Theorem Proving with Growing Libraries
Haiming Wang*, Huajian Xin*, Chuanyang Zheng, Lin Li, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Xiaodan Liang
ICLR 2024 [Code]

AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng Yang, Yinya Huang, Jing Xiong, Liang Feng, Xiaodan Liang, Yiwei Wang, Jing Tang
EMNLP 2024 (Findings) [Code]

RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation
Sichun Luo, Bowei He, Haohan Zhao, Yinya Huang, Aojun Zhou, Zongpeng Li, Yuanzhang Xiao, Mingjie Zhan, Linqi Song
ACM TOIS 2024

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu
EMNLP 2023 [Code]

Discourse-Aware Graph Networks for Textual Logical Reasoning
Yinya Huang, Lemao Liu, Kun Xu, Meng Fang, Liang Lin, Xiaodan Liang
IEEE TPAMI 2023 [Code]

MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure
Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, Dong Yu
EMNLP 2022 [Code]

DAGN: Discourse-Aware Graph Network for Logical Reasoning
Yinya Huang, Meng Fang, Yu Cao, Liwei Wang, Xiaodan Liang
NAACL 2021 [Code]

REM-Net: Recursive Erasure Memory Network for Commonsense Evidence Refinement
Yinya Huang, Meng Fang, Xunlin Zhan, Qingxing Cao, Xiaodan Liang, Liang Lin
AAAI 2021 [Code]

PathReasoner: Explainable Reasoning Paths for Commonsense Question Answering
Xunlin Zhan*, Yinya Huang*, Xiao Dong, Qingxing Cao, Xiaodan Liang
KBS 2021

SeePhys Pro: Diagnosing Modality Transfer and Blind-Training Effects in Multimodal RLVR for Physics Reasoning
Kun Xiang, Terry Jingchen Zhang, Zirong Liu, Bokai Zhou, Yueling Tang, Junjie Yu, Jiacong Lu, Shangrui Huang, Heng Li, Likui Zhang, Kunkun Liu, Changzheng Zhang, Yangle Fang, Boqiang Guo, Hui-Ling Zhen, Dandan Tu, Yinya Huang, Xiaodan Liang
arXiv:2605.09266

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning
Hanxu Hu, Yuxuan Wang, Maggie Huan, Jannis Vamvas, Yinya Huang, Zhijiang Guo, Rico Sennrich
arXiv:2603.11193

Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Yongxin Wang, Yiwei Wang, Xiaodan Liang, Jing Tang
arXiv:2509.23152

TreeRPO: Tree Relative Policy Optimization
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Xiaodan Liang, Yiwei Wang, Jing Tang
arXiv:2506.05183

Process-Driven Autoformalization in Lean 4
Jianqiao Lu, Zhengying Liu, Yingjia Wan, Yinya Huang, Haiming Wang, Zhicheng Yang, Jing Tang, Zhijiang Guo
arXiv:2406.01940 [Code]

Invited Talks

Toward Bridging the Informal-Formal Gap [Slides]
3rd AI for Math Workshop, ICML 2026, Jul 2026
Does Seeing Help Thinking? Physics Reasoning with Vision Language Model [Slides]
ETH Cosmology Research Group, Physics AI Seminar, Mar 2026 · Host: Prof. Dr. Alexandre Refregier

Teaching

Lecturer, ETH Zürich, Spring 2026
Course: AI Center Projects in Machine Learning Research (FS 2026)
Lecturer, University of Zürich, Spring 2026
Course: Introduction to Artificial Intelligence (FS 2026)
Teaching Assistant, Sun Yat-sen University, Fall 2019
Course: Introduction to Deep Learning
Teaching Assistant, Sun Yat-sen University, Spring 2018
Course: Deep Learning in Practice

Mentorship & Supervision

I am fortunate to work with a group of talented students.

ETH Zürich (Primary Supervisor)

Zixuan Chen (ETH Zürich) — M.Sc. thesis student
Ning Wang (ETH Zürich) — M.Sc. thesis student
Rongchuan Liu (ETH Zürich) — M.Sc. thesis student
Terry Jingchen Zhang (ETH Zürich)
Clijo Jose (École Polytechnique & ETH Zürich Summer Fellowship)

Informal Mentorship

Paul He (UToronto → NTU CS Ph.D.)
Zhicheng Yang (HKUST-GZ CS Ph.D.)

Professional Service

Conferences

Senior Area Chair: ACL 2026, EMNLP 2026, EMNLP 2025
Area Chair (2026): NeurIPS, ICLR
Area Chair (2025): ACL ARR October, ACL ARR May, ACL ARR February

Workshops

Primary Organizer & Program Chair, 2nd AI for Math Workshop and Challenge at ICML 2025
Primary Organizer & Program Chair, AI for Math Workshop and Challenge at ICML 2024

Program Committee Member

2026: ICML
2025: ICML, ICLR, NeurIPS
2024: ICLR, NeurIPS, ACL, EMNLP, NAACL, COLM, ACM MM, IJCAI
2023: NeurIPS, ACL, EMNLP
2022 and before: EMNLP (2022), COLING (2020), AAAI (2020)

Journal Reviewer

Selected Awards

Grants

ETH Career Seed Award (CHF 30,000), ETH Zürich, 2025 — PI
CSCS SwissAI Compute Grant (50,000 GPU-hours), Swiss National Supercomputing Centre, 2025 — PI
CSCS SwissAI Compute Grant (25,000 GPU-hours), Swiss National Supercomputing Centre, 2025 — PI

Fellowships

ETH AI Center Postdoctoral Fellowship, ETH AI Center, ETH Zürich, 2025

Honors & Awards

ACM SIGCSE China Excellent Doctoral Dissertation Award, ACM SIGCSE China, 2024 (3 awardees/year)
Outstanding Doctoral Dissertation Award, Shenzhen Association for Artificial Intelligence (SAAI), 2024 (3 awardees/year)
ICLR Financial Assistance Award, ICLR, 2024
Honors Graduate, Sun Yat-sen University, 2023
Excellent Student Award, Rhino-Bird Elite Talent Development Program, Tencent, 2021
1st Class Scholarship, Sun Yat-sen University, 2017
Outstanding Undergraduate Thesis, Sun Yat-sen University, 2015

Contact

Andreasstrasse 5, 8092 Zürich, Switzerland
yinya [dot] el [dot] huang [at] gmail [dot] com