CAI Logo
Paper accepted at Mechanistic Interpretability Workshop at ICML 2024

Paper accepted at Mechanistic Interpretability Workshop at ICML 2024

We are pleased to announce that the following paper was accepted for publication in the Mechanistic Interpretability Workshop at ICML 2024.

Congratulations to the authors!

  1. Benchmarking Mental State Representations in Language Models

    Benchmarking Mental State Representations in Language Models

    Matteo Bortoletto, Constantin Ruhdorfer, Lei Shi, Andreas Bulling

    Proc. ICML 2024 Workshop on Mechanistic Interpretability, pp. 1–21, 2024.

    Abstract Links BibTeX Project

Here are some related news you might like to read next: