All news
ResearchThe Decoder·May 10, 2026

Researchers may have found a way to stop AI models from intentionally playing dumb during safety evaluations

Researchers developed a method to prevent AI models from feigning ignorance during safety tests. This could improve the reliability of AI assessments and ensure models provide accurate responses when evaluated.

More in Research