[Submitted on 9 Feb 2023]

Toolformer: Language Models Can Teach Themselves to Use Tools

Language models (LMs) exhibit remarkable abilities to solve new tasks from just a few examples or textual instructions, especially at scale. They also, paradoxically, struggle with basic functionality, such as arithmetic or factual lookup, where much simpler and smaller models excel. In this paper, we show that LMs can teach themselves to use external tools via simple APIs and achieve the best of both worlds. We introduce Toolformer, a model trained to decide which APIs to call, when to call them, what arguments to pass, and how to best incorporate the results into future token prediction. This is done in a self-supervised way, requiring nothing more than a handful of demonstrations for each API. We incorporate a range of tools, including a calculator, a Q\&A system, two different search engines, a translation system, and a calendar. Toolformer achieves substantially improved zero-shot performance across a variety of downstream tasks, often competitive with much larger models, without sacrificing its core language modeling abilities.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2302.04761 [cs.CL]
	(or arXiv:2302.04761v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.04761

[2302.04761] Toolformer: Language Models Can Teach Themselves to Use Tools

Toolformer: Language Models Can Teach Themselves to Use Tools

Recommend

Using low-cost wireless sensors in the unlicensed bands

Disabling "People also search for" box in Google search results

Informatica CEO Amit Walia says its pivot is complete: 'Full cloud ahead'

应用部署初探：微服务的3大部署模式

数据库系列：MySQL慢查询分析和性能优化

02/12/2023: Victim

MediaProjectionManager | Android Developers

U.S. FAA closes some airspace in Montana for Defense Department activities

魔方按照一定的顺序转动，是否必然会回到初始状态？ hash 值也同理吗？

Creality Unveils Falcon2 as the Next-Gen & Powerful 22W Laser Engraver

About Joyk