🦄 I'm super thrilled by Anthropic's blog post Mapping the Mind of a Large Language Model(https://ln
🦄 I’m super thrilled by Anthropic’s blog post “Mapping the Mind of a Large Language Model”(https://lnkd.in/dA6Jh_pg). On their way to turn the black box into a little bit more transparent and getting a better understanding how state-of-the-art LLMs are working. Interestingly, but not really surprising, that similar methods like are used for exploring human brain activities are providing insides.
🌟 Like always there are flip-sides, but I’m with Anthropic. There are easier ways to create harm and personally I think better understanding of technology will be beneficial.
🤯 I’m still chewing on the paper “Scaling Monosemanticity: Extracting Interpretable Features from Clause 3 Sonnet”(https://lnkd.in/dBSjbxf3). Not just an impressive title but full of deep details :D.
📽 AI Explained covers this paper in their current YT video: https://lnkd.in/dvKwM9xY Like always very good explained and very approachable.
🏃♂️ Excited to dive deeper and to see what comes next - but now out for a run to consolidate my mind 😀
Cross-posted to LinkedIn