All AI news
Browse, filter, and search every article in the archive. The homepage shows the last 24 hours; everything older lives here.
New AI model called "Count Anything" does exactly what it says, and that's harder than it sounds
A new AI model called 'Count Anything' can accurately count various objects in images. This capability enhances image analysis and could improve applications in inventory management and visual data processing.

Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin
Google Research just launched Gemini-SQL2, which outperforms competitors in text-to-SQL benchmarks. This means developers can expect more accurate and efficient SQL query generation from natural language inputs.

Microsoft's SkillOpt boosts GPT-5.5 by using nothing but a trained Markdown file
Microsoft just boosted GPT-5.5 with SkillOpt, leveraging a trained Markdown file for enhanced performance. This means users can expect more efficient and accurate outputs from the model.

Claude Fable 5 outpaces GPT-5.5 by 13 points on FrontierMath's toughest problems
Claude Fable 5 just outperformed GPT-5.5 by 13 points on FrontierMath's toughest problems. This shows Claude's growing strength in tackling complex mathematical challenges, which could enhance its utility in educational and professional settings.

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster
Google DeepMind just released DiffusionGemma, a model that runs local AI four times faster. This boost in speed enhances performance for users running AI tasks on their own devices.
Google's new open model DiffusionGemma generates text from noise instead of word by word
Google just launched DiffusionGemma, an open model that generates text from noise instead of building it word by word. This approach could enhance creativity and efficiency in text generation tasks.

Claude Fable won’t answer basic biology questions
Anthropic's Claude Fable struggles with basic biology questions, failing to provide accurate answers. This limits its effectiveness for users seeking reliable information in that subject area.

[AINews] Anthropic Claude Fable 5 — Mythos but Safe, with Controversial Terms
Anthropic just launched Claude Fable 5, focusing on safety while introducing some controversial terms. This update aims to enhance user experience while navigating complex ethical considerations in AI interactions.
Initial impressions of Claude Fable 5
Anthropic just released Claude Fable 5, showcasing improved conversational abilities and better context handling. Users can expect more coherent and contextually relevant interactions with this update.
llm 0.32a3
Simon Willison just released LLM 0.32a3, an updated version of his language model. This version includes performance improvements and better handling of specific tasks, making it more efficient for developers.
Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science
Anthropic just released Claude Fable 5 and Mythos 5, showcasing major improvements in coding and scientific tasks. These updates enhance the models' capabilities, making them more effective for developers and researchers alike.

New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent
Researchers launched a new open-source voice model that listens continuously and decides every 0.4 seconds whether to speak or stay silent. This advancement enhances real-time interaction capabilities in voice applications.

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs
Lukas Petersson and Axel Backlund from Andon Labs are launching a new evaluation framework called Reality. This tool aims to enhance the assessment of AI models by providing more accurate performance metrics.
NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart
NVIDIA just launched the Nemotron 3 Ultra on Amazon SageMaker JumpStart. Users can now access this advanced model for enhanced machine learning tasks directly within the SageMaker platform.
[AINews] Microsoft Build: MAI-Thinking-1 and MAI Family models
Microsoft just launched the MAI-Thinking-1 model and the MAI Family of models. These updates enhance AI capabilities for reasoning and decision-making tasks, improving user experience in applications that require complex problem-solving.
Microsoft's new MAI models
Microsoft just launched new MAI models designed to enhance AI capabilities across various applications. These models aim to improve performance and efficiency for developers integrating AI into their products.
Ask AI what goes with chicken and the answer depends on whether it learned from recipes or molecules
AI models are now determining food pairings based on either recipes or molecular structures. This means users can get tailored suggestions depending on the learning approach of the AI.

OpenAI gives GPT-5.5 Instant a readability upgrade while phasing out two older models
OpenAI just upgraded GPT-5.5 Instant for better readability and is phasing out two older models. Users will experience improved clarity and engagement in their interactions with the updated model.

9 demos of Gemini Omni and Gemini 3.5 in action
Google just showcased 9 demos of Gemini Omni and Gemini 3.5 in action. These demos highlight the capabilities of the models in various applications, enhancing how users can leverage AI in their workflows.

Claude Opus 4.8: "a modest but tangible improvement"
Anthropic just released Claude Opus 4.8, featuring modest improvements in performance. Users can expect better response quality and more reliable outputs in their interactions.