Models

ModelsThe Decoder2d

New AI model called "Count Anything" does exactly what it says, and that's harder than it sounds

A new AI model called 'Count Anything' can accurately count various objects in images. This capability enhances image analysis and could improve applications in inventory management and visual data processing.

ModelsThe Decoder2d

Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin

Google Research just launched Gemini-SQL2, which outperforms competitors in text-to-SQL benchmarks. This means developers can expect more accurate and efficient SQL query generation from natural language inputs.

ModelsThe Decoder2d

Microsoft's SkillOpt boosts GPT-5.5 by using nothing but a trained Markdown file

Microsoft just boosted GPT-5.5 with SkillOpt, leveraging a trained Markdown file for enhanced performance. This means users can expect more efficient and accurate outputs from the model.

ModelsThe Decoder2d

Claude Fable 5 outpaces GPT-5.5 by 13 points on FrontierMath's toughest problems

Claude Fable 5 just outperformed GPT-5.5 by 13 points on FrontierMath's toughest problems. This shows Claude's growing strength in tackling complex mathematical challenges, which could enhance its utility in educational and professional settings.

ModelsArs Technica5d

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Google DeepMind just released DiffusionGemma, a model that runs local AI four times faster. This boost in speed enhances performance for users running AI tasks on their own devices.

ModelsThe Decoder5d

Google's new open model DiffusionGemma generates text from noise instead of word by word

Google just launched DiffusionGemma, an open model that generates text from noise instead of building it word by word. This approach could enhance creativity and efficiency in text generation tasks.

ModelsThe Verge5d

Claude Fable won’t answer basic biology questions

Anthropic's Claude Fable struggles with basic biology questions, failing to provide accurate answers. This limits its effectiveness for users seeking reliable information in that subject area.

ModelsLatent Space5d

[AINews] Anthropic Claude Fable 5 — Mythos but Safe, with Controversial Terms

Anthropic just launched Claude Fable 5, focusing on safety while introducing some controversial terms. This update aims to enhance user experience while navigating complex ethical considerations in AI interactions.

ModelsSimon Willison5d

Initial impressions of Claude Fable 5

Anthropic just released Claude Fable 5, showcasing improved conversational abilities and better context handling. Users can expect more coherent and contextually relevant interactions with this update.

ModelsSimon Willison6d

llm 0.32a3

Simon Willison just released LLM 0.32a3, an updated version of his language model. This version includes performance improvements and better handling of specific tasks, making it more efficient for developers.

ModelsThe Decoder6d

Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science

Anthropic just released Claude Fable 5 and Mythos 5, showcasing major improvements in coding and scientific tasks. These updates enhance the models' capabilities, making them more effective for developers and researchers alike.

ModelsThe DecoderJun 6

New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent

Researchers launched a new open-source voice model that listens continuously and decides every 0.4 seconds whether to speak or stay silent. This advancement enhances real-time interaction capabilities in voice applications.

ModelsLatent SpaceJun 4

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

Lukas Petersson and Axel Backlund from Andon Labs are launching a new evaluation framework called Reality. This tool aims to enhance the assessment of AI models by providing more accurate performance metrics.

ModelsAWS Machine LearningJun 4

NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart

NVIDIA just launched the Nemotron 3 Ultra on Amazon SageMaker JumpStart. Users can now access this advanced model for enhanced machine learning tasks directly within the SageMaker platform.

ModelsLatent SpaceJun 3

[AINews] Microsoft Build: MAI-Thinking-1 and MAI Family models

Microsoft just launched the MAI-Thinking-1 model and the MAI Family of models. These updates enhance AI capabilities for reasoning and decision-making tasks, improving user experience in applications that require complex problem-solving.

ModelsSimon WillisonJun 2

Microsoft's new MAI models

Microsoft just launched new MAI models designed to enhance AI capabilities across various applications. These models aim to improve performance and efficiency for developers integrating AI into their products.

ModelsThe DecoderMay 31

Ask AI what goes with chicken and the answer depends on whether it learned from recipes or molecules

AI models are now determining food pairings based on either recipes or molecular structures. This means users can get tailored suggestions depending on the learning approach of the AI.

ModelsThe DecoderMay 29

OpenAI gives GPT-5.5 Instant a readability upgrade while phasing out two older models

OpenAI just upgraded GPT-5.5 Instant for better readability and is phasing out two older models. Users will experience improved clarity and engagement in their interactions with the updated model.

ModelsGoogle AIMay 29

9 demos of Gemini Omni and Gemini 3.5 in action

Google just showcased 9 demos of Gemini Omni and Gemini 3.5 in action. These demos highlight the capabilities of the models in various applications, enhancing how users can leverage AI in their workflows.

ModelsSimon WillisonMay 28

Claude Opus 4.8: "a modest but tangible improvement"

Anthropic just released Claude Opus 4.8, featuring modest improvements in performance. Users can expect better response quality and more reliable outputs in their interactions.

ModelsSimon WillisonMay 28

llm-anthropic 0.25.1

Anthropic just released an update for their language model, version 0.25.1. This update enhances performance and stability, making it more reliable for developers using their API.

ModelsAWS Machine LearningMay 28

Training Azerbaijani language models on Amazon SageMaker AI

AWS just launched training for Azerbaijani language models on Amazon SageMaker AI. This makes it easier for developers to create applications that understand and generate Azerbaijani text.

ModelsThe DecoderMay 28

Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that tops GPT-5.5 in most benchmarks

Anthropic just launched Claude Opus 4.8, claiming it outperforms GPT-5.5 in most benchmarks. This update brings modest but tangible improvements, enhancing user experience with better performance metrics.

ModelsThe DecoderMay 27

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft just released MAI-Image-2.5, matching Google's Nano Banana 2 on key benchmarks. This competition pushes both companies to enhance their image processing capabilities, benefiting users with improved performance.

ModelsLatent SpaceMay 27

🔬ESMFold2: The Bitter Lesson is Coming for Proteins - Alex Rives, BioHub

ESMFold2 just launched, offering a new approach to protein folding predictions. This model aims to enhance accuracy in understanding protein structures, which could accelerate drug discovery and biological research.

ModelsThe VergeMay 23

Google’s new anything-to-anything AI model is wild

Google just launched a new anything-to-anything AI model that can generate diverse outputs from various inputs. This model expands creative possibilities for developers and users, enabling more versatile applications across different domains.

ModelsHugging FaceMay 23

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Hugging Face just introduced Nemotron-Labs, a new diffusion language model aimed at achieving speed-of-light text generation. This model promises to enhance the efficiency and responsiveness of AI-generated content, making it more practical for real-time applications.

ModelsPerplexityMay 22

Perplexity Is Open-Sourcing Bumblebee

Perplexity is open-sourcing Bumblebee, their new AI model designed for enhanced conversational capabilities. This move allows developers to customize and integrate Bumblebee into their applications, boosting accessibility and innovation in AI interactions.

ModelsSimon WillisonMay 19

llm-gemini 0.32

Gemini just released version 0.32 with improved performance and new features. Users can expect faster processing and enhanced capabilities for handling complex tasks.

ModelsSimon WillisonMay 19

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Google is rolling out Gemini 3.5, which comes with a higher price tag. This move signals their intent to integrate it across all their services, enhancing AI capabilities throughout their ecosystem.