摘要 Abstract
主体大型语言模型(agentic LLMs)因其能够作为主体行动而备受关注。本文回顾了该领域的研究成果,并提出了研究议程。主体大型语言模型是指(1)推理、(2)行动以及(3)交互的大型语言模型。我们根据这三个类别组织文献。第一类研究聚焦于推理、反思与检索,旨在提升决策能力;第二类研究聚焦于行动模型、机器人及工具,旨在打造有用的辅助工具;第三类研究聚焦于多主体系统,旨在实现协作任务解决并模拟交互以研究涌现的社会行为。我们发现,各领域研究相互受益:检索支持工具使用,反思改善多主体协作,推理惠及所有领域。本文讨论了主体大型语言模型的应用,并提出了进一步研究的议程。重要应用包括医学诊断、物流和金融市场分析。同时,自我反思型主体扮演角色并相互互动可以增强科学研究本身的过程。此外,主体大型语言模型可能为大型语言模型训练数据耗尽的问题提供解决方案:推理时的行为可生成新的训练状态,从而使大型语言模型无需更大的数据集即可持续学习。我们注意到,大型语言模型助手在现实世界中采取行动存在风险,但主体大型语言模型也可能造福社会。
There is great interest in agentic LLMs, large language models that act as agents. We review the growing body of work in this area and provide a research agenda. Agentic LLMs are LLMs that (1) reason, (2) act, and (3) interact. We organize the literature according to these three categories. The research in the first category focuses on reasoning, reflection, and retrieval, aiming to improve decision making; the second category focuses on action models, robots, and tools, aiming for agents that act as useful assistants; the third category focuses on multi-agent systems, aiming for collaborative task solving and simulating interaction to study emergent social behavior. We find that works mutually benefit from results in other categories: retrieval enables tool use, reflection improves multi-agent collaboration, and reasoning benefits all categories. We discuss applications of agentic LLMs and provide an agenda for further research. Important applications are in medical diagnosis, logistics and financial market analysis. Meanwhile, self-reflective agents playing roles and interacting with one another augment the process of scientific research itself. Further, agentic LLMs may provide a solution for the problem of LLMs running out of training data: inference-time behavior generates new training states, such that LLMs can keep learning without needing ever larger datasets. We note that there is risk associated with LLM assistants taking action in the real world, while agentic LLMs are also likely to benefit society.