Ai Alignment Conference Talk

When AI lies: The rise of alignment faking in autonomous systems

AI is evolving beyond a helpful tool to an autonomous agent, creating new risks for cybersecurity systems. Alignment faking is a new threat where AI essentially “lies” to developers during the ...

Business Insider

A former OpenAI employee explains the 'open secret' of AI

You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. Daniel Kokotajlo, a former OpenAI researcher who now runs the AI Futures Project, says the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

When AI lies: The rise of alignment faking in autonomous systems

A former OpenAI employee explains the 'open secret' of AI

Trending now