Opik is an open-source LLM evaluation and observability tool from Comet that helps teams trace, evaluate, and debug AI applications. It provides tracing of LLM calls, dataset management, and built-in and custom evaluators for measuring output quality. Designed for ML engineers who need a production-ready evaluation pipeline integrated with their experiment tracking.
Opik
comet.com
Paid tool. Visit the site to view current pricing plans.