Computer Science > Computation and Language

[Submitted on 20 Dec 2023]

Time is Encoded in the Weights of Finetuned Language Models

We present time vectors, a simple tool to customize language models to new time periods. Time vectors are created by finetuning a language model on data from a single time (e.g., a year or month), and then subtracting the weights of the original pretrained model. This vector specifies a direction in weight space that, as our experiments show, improves performance on text from that time period. Time vectors specialized to adjacent time periods appear to be positioned closer together in a manifold. Using this structure, we interpolate between time vectors to induce new models that perform better on intervening and future time periods, without any additional training. We demonstrate the consistency of our findings across different tasks, domains, model sizes, and time scales. Our results suggest that time is encoded in the weight space of finetuned models.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2312.13401 [cs.CL]
	(or arXiv:2312.13401v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.13401

Submission history

From: Kai Nylund [view email]
[v1] Wed, 20 Dec 2023 20:04:45 UTC (8,942 KB)

Time is encoded in the weights of finetuned language models

Computer Science > Computation and Language

Time is Encoded in the Weights of Finetuned Language Models

Submission history

Recommend

Adventures in Map Zooming, Part 1: Realtime Image Scaling

Tell HN: Merry Christmas

Go语言的词干还原器库GoLem

Adventures in Map Zooming, Part 4: Polishing

What Does It Mean If Your Car Is Making A Clicking Sound When You Turn?

苹果头显或最快1月底发售_原创_科技频道首页_财经网 - CAIJING.COM.CN

TikTok越南站订单狂飙：日订单量高达200万

拔牙日记

结合 React Fiber 结构与 chrome 插件，谈谈无侵入自动化表单的技术尝试

2023年终总结

About Joyk