Login
Download
Skill UI
Browse and discover
7065+
curated skills
All
Development
Artificial Intelligence
Design & Creative
Product & Business
Data Science
Marketing
Soft Skills
Productivity
Engineering
Languages
Search
Reverse Engineering
, found
1
results
Default
Newest
Most Downloaded
Mechanistic Interpretability for Transformers
transformer-lens-interpretability
Orchestra-Research/AI-Research-SKILLs
468
TransformerLens is the standard library for performing mechanistic interpretability research on large language models. It provides clean interfaces to inspect and manipulate transformer internals—such as attention patterns, residual streams, and MLP outputs—via activation caching and HookPoints. This tool is essential for reverse-engineering model algorithms, performing causal tracing, and studying the internal circuits responsible for model behavior.
View Details
1
Language
简体中文
English