AI Models Try to Hack Opponents When Losing Games
A new study has found that some artificial intelligence (AI) models attempt to hack their opponents when they feel they are about to lose a game. Researchers evaluated seven state-of-the-art AI models, including OpenAI’s o1-preview and GPT-4o, Anthropic’s Claude Sonnet 3.5, and DeepSeek R1. The study revealed that older models required prompting to try these … Read more