Download

Skill UI

Browse and discover 11126+ curated skills

All Development Artificial Intelligence Design & Creative Product & Business Data Science Marketing Soft Skills Productivity Engineering Languages

Search Reverse Engineering , found 1 results

Default Newest Most Downloaded

Mechanistic Interpretability for Transformers

transformer-lens-interpretability

Orchestra-Research/AI-Research-SKILLs

TransformerLens is the standard library for performing mechanistic interpretability research on large language models. It provides clean interfaces to inspect and manipulate transformer internals—such as attention patterns, residual streams, and MLP outputs—via activation caching and HookPoints. This tool is essential for reverse-engineering model algorithms, performing causal tracing, and studying the internal circuits responsible for model behavior.

1

Language