Computer Science - Artificial Intelligence

ORMind: A Cognitive-Inspired End-to-End Reasoning Framework for Operations Research

Operations research (OR) is widely deployed to solve critical decision-making problems with complex objectives and constraints, …

Zhiyuan Wang, Bokui Chen, Yinya Huang, Qingxing Cao, Ming He, Jianping Fan, Xiaodan Liang

TreeRPO: Tree Relative Policy Optimization

Large Language Models (LLMs) have shown remarkable reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR) …

Zhicheng Yang, Zhijiang Guo, Yinya Huang, Xiaodan Liang, Yiwei Wang, Jing Tang