Phoenix by Arize is an open-source observability and evaluation tool for AI and LLM applications. It provides tracing, dataset management, and built-in evaluators to measure hallucination, relevance, and toxicity. Designed for ML engineers and data scientists who need deep insight into model and agent behavior during development and production.
Phoenix
phoenix.arize.com
Paid tool. Visit the site to view current pricing plans.