All tools
Evals & observability

Braintrust

By Braintrust

Eval platform for AI products — define test sets, run them across models, and track regressions over time. The default choice for teams shipping LLM features.

Best for

  • systematic AI evals
  • comparing prompts and models
  • catching regressions before deploy

Other Evals & observability