Large models on CPUs

1 year ago

source link: https://changelog.com/practicalai/221
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Brought to you by

Model sizes are crazy these days with billions and billions of parameters. As Mark Kurtz explains in this episode, this makes inference slow and expensive despite the fact that up to 90%+ of the parameters don’t influence the outputs at all.

Mark helps us understand all of the practicalities and progress that is being made in model optimization and CPU inference, including the increasing opportunities to run LLMs and other Generative AI models on commodity hardware.

Recommend

Software Testing Essentials: A Key to Product Quality
“AI教父”坦言后悔研发人工智能技术
Early Adopters and Change Champions: Identifying Key Players for Agile Transform...
GA4 custom funnel reports are here
最新测量结果表明，我们应该重新考虑银河系的形状
‘Something has to be done’: An alarming number of working Americans are making t...
The Big Door Prize is (almost) all about Izzy this week [Apple TV+ recap]
Large Language Models and Windows with Paul Thurrott
Go坑：time.After可能导致的内存泄露问题分析 - 九卷
IBM官宣：计划用AI取代7800个工作岗位

About Joyk

Aggregate valuable and interesting links.
Joyk means Joy of geeK