You Can Now Run a GPT-3 Level AI Model On Your Laptop, Phone, and Raspberry Pi

Follow Slashdot stories on Twitter

binspam dupe notthebest offtopic slownewsday stale stupid fresh funny insightful interesting maybe offtopic flamebait troll redundant overrated insightful interesting informative funny underrated descriptive typo dupe error

Do you develop on GitHub? You can keep using GitHub but automatically sync your GitHub releases to SourceForge quickly and easily with this tool so your projects have a backup location, and get your project in front of SourceForge's nearly 30 million monthly users. It takes less than a minute. Get new users downloading your project releases today!

Sign up for the Slashdot newsletter! or check out the new Slashdot job board to browse remote jobs or jobs in your area.

You Can Now Run a GPT-3 Level AI Model On Your Laptop, Phone, and Raspberry Pi 17

Posted by BeauHD

on Tuesday March 14, 2023 @09:00AM from the what-will-they-think-of-next dept.

An anonymous reader quotes a report from Ars Technica: On Friday, a software developer named Georgi Gerganov created a tool called "llama.cpp" that can run Meta's new GPT-3-class AI large language model, LLaMA, locally on a Mac laptop. Soon thereafter, people worked out how to run LLaMA on Windows as well. Then someone showed it running on a Pixel 6 phone, and next came a Raspberry Pi (albeit running very slowly). If this keeps up, we may be looking at a pocket-sized ChatGPT competitor before we know it. [...]

Typically, running GPT-3 requires several datacenter-class A100 GPUs (also, the weights for GPT-3 are not public), but LLaMA made waves because it could run on a single beefy consumer GPU. And now, with optimizations that reduce the model size using a technique called quantization, LLaMA can run on an M1 Mac or a lesser Nvidia consumer GPU. After obtaining the LLaMA weights ourselves, we followed [independent AI researcher Simon Willison's] instructions and got the 7B parameter version running on an M1 Macbook Air, and it runs at a reasonable rate of speed. You call it as a script on the command line with a prompt, and LLaMA does its best to complete it in a reasonable way.

There's still the question of how much the quantization affects the quality of the output. In our tests, LLaMA 7B trimmed down to 4-bit quantization was very impressive for running on a MacBook Air -- but still not on par with what you might expect from ChatGPT. It's entirely possible that better prompting techniques might generate better results. Also, optimizations and fine-tunings come quickly when everyone has their hands on the code and the weights -- even though LLaMA is still saddled with some fairly restrictive terms of use. The release of Alpaca today by Stanford proves that fine tuning (additional training with a specific goal in mind) can improve performance, and it's still early days after LLaMA's release. A step-by-step instruction guide for running LLaMA on a Mac can be found here (Warning: it's fairly technical).

Do you have a GitHub project? Now you can sync your releases automatically with SourceForge and take advantage of both platforms.
Do you have a GitHub project? Now you can automatically sync your releases to SourceForge & take advantage of both platforms. The GitHub Import Tool allows you to quickly & easily import your GitHub project repos, releases, issues, & wiki to SourceForge with a few clicks. Then your future releases will be synced to SourceForge automatically. Your project will reach over 35 million more people per month and you’ll get detailed download statistics.
Sync Now

You Can Now Run a GPT-3 Level AI Model On Your Laptop, Phone, and Raspberry Pi 17

Recommend

Marvel subpoenas Google and Reddit to identify Ant-Man leaker

总奖金达400万元！阿里巴巴召开2023全球数学竞赛

375: Deleting Code

Google opens up its AI language model PaLM to challenge OpenAI and GPT-3

iPad Air 6首曝：升级M2处理器性能增幅可达40%

2023年中国体育服装行业市场现状及发展趋势分析市场规模超3700亿元【组图】

产业带上钉钉成趋势，海宁皮革城签约专属钉钉-品玩

Wealthiest People in Austria (March 13, 2023)

VR 战斗飞行模拟游戏《雷霆王牌》将登陆 PSVR2 头显

【投资视角】启示2023：中国冠脉支架行业投融资及兼并重组分析(附投融资汇总、产业基...

About Joyk