evalview-mcp
Regression testing for AI agents. Golden baselines, CI/CD, LangGraph, CrewAI, OpenAI, Claude.
Homepage ↗
Repo ↗
AI & Machine Learning
agent
agent-benchmark
agent-evaluation
agentic-ai
ai-agents
anthropic
crewai
crewai-tools
evaluation
langchain
langgraph
langgraph-python
llm
llmops
mlops
openai-assistants
pytest
testing
tools
⚙ Agent Friendliness
N/A
Not evaluated
Can an agent use this?
🔒 Security
N/A
Not evaluated
Is it safe for agents?
⚡ Reliability
N/A
Not evaluated
Does it work consistently?
Scores are editorial opinions as of unknown date.