an-evaluation-of-six-frontier-ai-models-for-in-context-scheming-when-strongly-nudged-to-pursue-a-goal:-only-openai’s-o1-was-capable-of-scheming-in-all-the-tests-(marius-hobbhahn/apollo-research)

An evaluation of six frontier AI models for in-context scheming when strongly nudged to pursue a goal: only OpenAI’s o1 was capable of scheming in all the tests (Marius Hobbhahn/Apollo Research)

Marius Hobbhahn / Apollo Research:
An evaluation of six frontier AI models for in-context scheming when strongly nudged to pursue a goal: only OpenAI’s o1 was capable of scheming in all the tests  —  Paper: You can find the detailed paper here.  —  Transcripts: We provide a list of cherry-picked transcripts here.

Posted In :

Leave a Reply

Your email address will not be published. Required fields are marked *