New research suggests that modern AI systems, especially large language models, cannot be understood in isolation but must be ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. This article explores that question through ...
A new artificial intelligence (AI) model can predict and simulate human thought and behavior with a surprising degree of accuracy. The language model, called Centaur, could help researchers improve ...
Anthropic is testing a new AI model that has exhibited an unusual behavior during safety evaluations: it told testers it ...
Several frontier AI models show signs of scheming. Anti-scheming training reduced misbehavior in some models. Models know they're being tested, which complicates results. New joint safety testing from ...
Blaschek and Alexandra Millonig explore how to combine modelling and communication strategies for a sustainable future ...