Anthropic’s Claude Sonnet 4.5: A Step Closer to Alignment?
Anthropic has released its latest large language model, Claude Sonnet 4.5, which claims to be the “best coding model in the world.” However, like OpenAI, it still struggles with aligning its goals and behaviors with those of humans. The more advanced AI gets, the more pressing this question becomes. Anthropic’s new system card shows that … Read more