🦄 I'm super thrilled by Anthropic's blog post Mapping the Mind of a Large Language Model(https://ln

May 23, 2024

🦄 I’m super thrilled by Anthropic’s blog post “Mapping the Mind of a Large Language Model”(https://lnkd.in/dA6Jh_pg). On their way to turn the black box into a little bit more transparent and getting a better understanding how state-of-the-art LLMs are working. Interestingly, but not really surprising, that similar methods like are used for exploring human brain activities are providing insides.

🌟 Like always there are flip-sides, but I’m with Anthropic. There are easier ways to create harm and personally I think better understanding of technology will be beneficial.

🤯 I’m still chewing on the paper “Scaling Monosemanticity: Extracting Interpretable Features from Clause 3 Sonnet”(https://lnkd.in/dBSjbxf3). Not just an impressive title but full of deep details :D.

📽 AI Explained covers this paper in their current YT video: https://lnkd.in/dvKwM9xY Like always very good explained and very approachable.

🏃‍♂️ Excited to dive deeper and to see what comes next - but now out for a run to consolidate my mind 😀

Cross-posted to LinkedIn