Mechanistic interpretability project that localizes entity-selective neurons (“entity cells”) in language models and uses causal interventions on PopQA-style factual question answering to study how these neurons mediate entity-centric factual recall.
Endorsements support MentaLeap.
Mechanistic interpretability project that localizes entity-selective neurons (“entity cells”) in language models and uses causal interventions on PopQA-style factual question answering to study how these neurons mediate entity-centric factual recall.
Endorsements support MentaLeap.
People
Updated 05/18/26 · By grantmaking.aiCo-author
Grants Received– no grants recorded
Updated 05/18/26 · By grantmaking.aiDiscussion
Sign in to comment
No comments yet. Be the first to share your thoughts.