All news
ResearchThe Decoder·June 19, 2026

OpenAI researchers show small doses of "beneficial trait" training make AI models broadly safer and harder to manipulate

OpenAI researchers are training AI models with small doses of 'beneficial trait' training to enhance safety and reduce manipulation risks. This approach aims to make AI interactions more reliable for users.

More in Research