grgv.xyz

11 months ago

source link: https://grgv.xyz/blog/llama/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Bird-eye view visualization of LLM activations

I’m starting to learn about mechanistic interpretability, and I’m seeing lots of great visualizations of transformer internals, but somehow I’ve never seen the whole large model’s internal state shown at once, on one image.

So I made this visualization, for Llama-2-7B. Attention matrices are on the left, in 32 rows for 32 blocks, top to bottom. To the right, there are 64 rows: residual stream (odd rows) and internal MLP activations (even rows). Finally, output MLP and unembedding layer are on the bottom.

Activation maps are downscaled horizontally, with maxpooling, to fit into 1000px wide image.

Example for a prompt “2+2=”:

And an example for the prompt: "William Shakespeare was born in the year”:

And for the prompt "blue pencils fly over moonlit toasters”:

Probably not especially useful for interpretability, but at least it looks pretty )

Recommend

grgv.xyz

Bird-eye view visualization of LLM activations

Recommend

工信部等三部门：开展2023年智慧健康养老应用试点示范遴选

国轩高科与美国伊利诺伊州政府合作，共同建设锂电池项目

Mastering Virtual Threads: A Comprehensive Tutorial

专访 | 半年营收近2亿，HeyBetter如何用儿童基本款做消费升级？

Building a QR Coder Web Component

Add `manual_hash_one` lint by Alexendoo · Pull Request #11556 · rust-lang/rust-c...

ConstParamTy: require Eq as supertrait by RalfJung · Pull Request #116125 · rust...

SEC推迟ARK 21Shares BTC ETF决定至2024年

Birthday Week recap: everything we announced — plus an AI-powered opportunity fo...

Use Vec::retain in remove_dead_blocks. by cjgillot · Pull Request #116154 · rust...

About Joyk