All news
ProductsAWS Machine Learning·May 4, 2026

Capacity-aware inference: Automatic instance fallback for SageMaker AI endpoints

AWS just introduced capacity-aware inference for SageMaker AI endpoints. This feature automatically adjusts instance types based on workload, improving efficiency and reducing costs for users.

More in Products