Anthropic's conceptual mapping helps explain why LLMs behave the way they do.

With most computer programs—even complex ones—you can meticulously trace through the code and memory usage to figure out why that program generates any specific behavior or output. That's generally not true in the field of generative AI, where the non-interpretable neural networks underlying these models make it hard for even experts to figure out precisely 

 offers a new window into what's going on inside the Claude LLM's "black box." The company's 

 on "Extracting Interpretable Features from Claude 3 Sonnet" describes a powerful new method for at least partially explaining just how the model's millions of artificial neurons fire to create 

BooksAndArticles

**Anthropic's conceptual mapping helps explain why LLMs behave the way they do.**

![](https://m.stacker.news/32093)

> With most computer programs—even complex ones—you can meticulously trace through the code and memory usage to figure out why that program generates any specific behavior or output. That's generally not true in the field of generative AI, where the non-interpretable neural networks underlying these models make it hard for even experts to figure out precisely [why they often confabulate information](https://arstechnica.com/information-technology/2023/04/why-ai-chatbots-are-the-ultimate-bs-machines-and-how-people-hope-to-fix-them/), for instance.
>
> Now, [new research from Anthropic](https://www.anthropic.com/research/mapping-mind-language-model) offers a new window into what's going on inside the Claude LLM's "black box." The company's [new paper](https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html) on "Extracting Interpretable Features from Claude 3 Sonnet" describes a powerful new method for at least partially explaining just how the model's millions of artificial neurons fire to create [surprisingly lifelike responses](https://arstechnica.com/information-technology/2024/03/the-ai-wars-heat-up-with-claude-3-claimed-to-have-near-human-abilities/) to general queries.
>
> ### - Opening the hood
>
> [...]
>
> ![](https://m.stacker.news/32095)
>
> [...]
>
> ![](https://m.stacker.news/32096)
>
> [...]
>
> ![](https://m.stacker.news/32097)
>
> ### - Change your (artificial) mind
>
> [...]
>
> ![](https://m.stacker.news/32098)
> ![](https://m.stacker.news/32099)
>
> ### [... read more at arstechnica.com](https://arstechnica.com/ai/2024/05/heres-whats-really-going-on-inside-an-llms-neural-network/)**


- Opening the hood- Opening the hood

- Change your (artificial) mind- Change your (artificial) mind

... read more at arstechnica.com... read more at arstechnica.com

- Opening the hood- Opening the hood

- Change your (artificial) mind- Change your (artificial) mind

... read more at arstechnica.com**... read more at arstechnica.com**

... read more at arstechnica.com... read more at arstechnica.com