Building autonomous AI physicists for frontier physics research
- 日時
- 2026年4月30日(木)15:00 - 16:00 (JST)
- 講演者
-
- Tingjia Miao (Ph.D. Student, School of Artificial Intelligence, Shanghai Jiao Tong University, China)
- 会場
- via Zoom
- 言語
- 英語
- ホスト
- Lingxiao Wang
Advances in LLMs have led to agents with knowledge and operational capabilities comparable to human scientists, suggesting potential to assist, accelerate, and automate research. Physics, especially theoretical and computational physics, which requires integrating analytical reasoning, code-based computation, and profound domain expertise, is well suited for verifying the end-to-end research capabilities of AI scientists. Accordingly, we construct a general-purpose AI physicist PhysMaster, equipped with a layered academic knowledge base, adapted to the agent skill ecosystem, and adopting an adaptive exploration strategy that balances efficiency and exploration, enabling robust performance in ultra-long-horizon tasks; PhysMaster has been open-sourced. Meanwhile, we introduce PRL-Bench (Physics Research by LLMs), a benchmark with 100 tasks adapted from recent Physical Review Letters papers, covering astrophysics, condensed matter physics, high-energy physics, quantum information, and statistical physics. Evaluation across frontier models shows that failures are dominated by conceptual and formulaic errors, and that exploration and derivations remain unstable over long horizons. In addition, we develop domain-specialized AI scientists, including LQCD Master, which integrates Lattice QCD workflows and expert skills, enabling automated generation and submission of lattice computation scripts from concise physics goals.
References
- Miao, Tingjia and others, PhysMaster: Building an Autonomous AI Physicist for Theoretical and Computational Physics Research, arXiv: 2512.19799
- Tan, Jin-Xin, Miao, Ting-Jia and others, Automated Extraction of Collins-Soper Kernel from Lattice QCD using An Autonomous AI Physicist System, arXiv: 2603.22471
- Miao, Tingjia and others, PRL-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research, arXiv: 2604.15411
このイベントは研究者向けのクローズドイベントです。一般の方はご参加頂けません。メンバーや関係者以外の方で参加ご希望の方は、フォームよりお問い合わせ下さい。講演者やホストの意向により、ご参加頂けない場合もありますので、ご了承下さい。