Mechanistic Interpretability Technique

From GM-RKB
Jump to navigation Jump to search

A Mechanistic Interpretability Technique is an AI interpretability technique that analyzes neural network internals to understand computational mechanisms.