News archive

All AI news

Browse, filter, and search every article in the archive. The homepage shows the last 24 hours; everything older lives here.

Clear all filters
ModelsSimon WillisonMay 28

llm-anthropic 0.25.1

Anthropic just released an update for their language model, version 0.25.1. This update enhances performance and stability, making it more reliable for developers using their API.

ModelsAWS Machine LearningMay 28

Training Azerbaijani language models on Amazon SageMaker AI

AWS just launched training for Azerbaijani language models on Amazon SageMaker AI. This makes it easier for developers to create applications that understand and generate Azerbaijani text.

ModelsThe DecoderMay 28

Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that tops GPT-5.5 in most benchmarks

Anthropic just launched Claude Opus 4.8, claiming it outperforms GPT-5.5 in most benchmarks. This update brings modest but tangible improvements, enhancing user experience with better performance metrics.

ModelsThe DecoderMay 27

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft just released MAI-Image-2.5, matching Google's Nano Banana 2 on key benchmarks. This competition pushes both companies to enhance their image processing capabilities, benefiting users with improved performance.

ModelsLatent SpaceMay 27

🔬ESMFold2: The Bitter Lesson is Coming for Proteins - Alex Rives, BioHub

ESMFold2 just launched, offering a new approach to protein folding predictions. This model aims to enhance accuracy in understanding protein structures, which could accelerate drug discovery and biological research.

ModelsThe VergeMay 23

Google’s new anything-to-anything AI model is wild

Google just launched a new anything-to-anything AI model that can generate diverse outputs from various inputs. This model expands creative possibilities for developers and users, enabling more versatile applications across different domains.

ModelsHugging FaceMay 23

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Hugging Face just introduced Nemotron-Labs, a new diffusion language model aimed at achieving speed-of-light text generation. This model promises to enhance the efficiency and responsiveness of AI-generated content, making it more practical for real-time applications.

ModelsPerplexityMay 22

Perplexity Is Open-Sourcing Bumblebee

Perplexity is open-sourcing Bumblebee, their new AI model designed for enhanced conversational capabilities. This move allows developers to customize and integrate Bumblebee into their applications, boosting accessibility and innovation in AI interactions.

ModelsSimon WillisonMay 19

llm-gemini 0.32

Gemini just released version 0.32 with improved performance and new features. Users can expect faster processing and enhanced capabilities for handling complex tasks.

ModelsSimon WillisonMay 19

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Google is rolling out Gemini 3.5, which comes with a higher price tag. This move signals their intent to integrate it across all their services, enhancing AI capabilities throughout their ecosystem.

ModelsSimon WillisonMay 19

llm-gemini 0.32a0

Gemini just released version 0.32a0, improving its performance and capabilities. Users can expect enhanced efficiency and more accurate outputs in their applications.

ModelsLatent SpaceMay 14

[AINews] Codex Rises, Claude Meters Programmatic Usage

OpenAI is enhancing Codex to improve its programmatic usage and reduce irrelevant outputs. This update aims to make Codex more effective for developers in real-world applications.

ModelsLatent SpaceMay 13

[AINews] The End of Finetuning

Latent Space just announced a new approach that eliminates the need for finetuning in AI models. This change simplifies the training process and could lead to faster deployment of AI solutions.

ModelsSimon WillisonMay 12

llm 0.32a2

Simon Willison just released LLM 0.32a2, an update to his language model. This version includes improved performance and new features for developers working with LLMs.

ModelsLatent SpaceMay 12

[AINews] Thinking Machines' Native Interaction Models - TML-Interaction-Small 276B-A12B - advances SOTA Realtime Voice and kills standard VAD

Thinking Machines just launched TML-Interaction-Small 276B-A12B, advancing state-of-the-art real-time voice interaction. This model replaces standard Voice Activity Detection (VAD) for smoother and more responsive voice applications.

ModelsThe VergeMay 11

OpenAI just released its answer to Claude Mythos

OpenAI just launched its new model, GPT-5, to compete with Anthropic's Claude Mythos. This release enhances capabilities for developers and businesses looking for advanced AI solutions.

ModelsThe DecoderMay 10

GPT-5.5 costs 49 to 92 percent more than its predecessor, depending on the input length

OpenAI is launching GPT-5.5, which costs 49 to 92 percent more than GPT-5 based on input length. This price increase reflects the enhanced capabilities and resources required for the new model.

ModelsHugging FaceMay 8

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

Hugging Face just launched CyberSecQwen-4B, a specialized model for defensive cybersecurity. This model allows organizations to run AI locally, enhancing security by minimizing data exposure to external servers.

ModelsArs TechnicaMay 8

Chrome's 4GB AI model isn't new, but you're not wrong for being confused

Google is clarifying that its 4GB AI model isn't a new release despite some confusion. This means users can expect consistent performance without the need to adapt to a new model.

ModelsHugging FaceMay 8

EMO: Pretraining mixture of experts for emergent modularity

Hugging Face just introduced EMO, a pretraining mixture of experts model designed for emergent modularity. This approach enhances model efficiency and adaptability, allowing for more specialized tasks without requiring extensive retraining.

Showing 2140 of 84·
Per page:10202550100