Open-source LLM observability and evaluation platform for tracing, debugging, monitoring, and optimizing generative AI applications including RAG systems and agentic workflows. Provides comprehensive tracing infrastructure, LLM-as-a-judge evaluation metrics, experiment management, and production dashboards.