motives Archives - techno.express

Researchers astonished by tool’s apparent success at revealing AI’s hidden motives

by Benj Edwards
March 14, 2025
I.T. Today
3 min read

In a new paper published Thursday titled “Auditing language models for hidden objectives,” Anthropic researchers described how models trained to deliberately conceal certain motives from evaluators could still inadvertently reveal secrets, thanks to their ability…