🧠All Things AI — by Subhojit DeyAll Things AI
🌱Start Here🔧Build with AIDaily StackDevelopersVibe CodingOthersLocal🏢Industry🛡️Legal🔬Deep Dive📰News
🧠 All Things AI
🌱🧠🔧⚡⚡🤖✨🔍🔶🎯💜⚡🪟🦙🤗🦞🔁🌊✕🔀🛠️🏢🛡️✅🏭🔬📰
🔬Deep Dive
Math Foundations
Neural Networks
Transformer Architecture
Scaling
LLM Pre-training
Alignment Techniques
Reasoning Internals
Interpretability
Model Architectures
Hardware & Compute
Fine-tuning & Adaptation
Research Skills
AI Economics & Impact
🔬Deep Dive
Math Foundations
Neural Networks
Transformer Architecture
Scaling
LLM Pre-training
Alignment Techniques
Reasoning Internals
Interpretability
Model Architectures
Hardware & Compute
Fine-tuning & Adaptation
Research Skills
AI Economics & Impact
Deep DiveInterpretability

Interpretability

Understanding what neural networks are computing — from mechanistic circuit analysis to sparse autoencoders, probing classifiers, and what Anthropic's research has found.

In This Section

Mechanistic Interpretability — What & Why

Features, circuits, superposition, and the goal of reverse-engineering model algorithms.

Attention Visualization & Probing

Induction heads, probing classifiers, causal tracing, and the limits of attention as explanation.

Circuits & Features in LLMs

Induction circuits, GPT-2 IOI task, monosemanticity, and safety-relevant features.

Previous← DeepSeek-R1 InternalsNextMechanistic Interpretability →

Page built: 01 Jun 2026