Anthropic is testing a new AI model that has exhibited an unusual behavior during safety evaluations: it told testers it ...