Yinya Huang
Yinya Huang
Home
News
Featured
Publications
Service
Awards
Light
Dark
Automatic
Computer Science - Machine Learning
LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Long-form legal reasoning remains a key challenge for large language models (LLMs) in spite of recent advances in test-time scaling. We …
Yu Fan
,
Jingwei Ni
,
Jakob Merane
,
Etienne Salimbeni
,
Yang Tian
,
Yoan Hermstrüwer
,
Yinya Huang
,
Mubashara Akhtar
,
Florian Geering
,
Oliver Dreyer
,
Daniel Brunner
,
Markus Leippold
,
Mrinmaya Sachan
,
Alexander Stremitzer
,
Christoph Engel
,
Elliott Ash
,
Joel Niklaus
Cite
DOI
URL
TreeRPO: Tree Relative Policy Optimization
Large Language Models (LLMs) have shown remarkable reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR) …
Zhicheng Yang
,
Zhijiang Guo
,
Yinya Huang
,
Xiaodan Liang
,
Yiwei Wang
,
Jing Tang
Cite
DOI
URL
Cite
×