eval-view
Python Mediumhidai25/eval-view
80 stars
17 forks
38 open issues
Active Mar 2026
Beginner-Friendly Issues 38
Issues tagged for new contributors
Show token cost breakdown in check output
#146 · Mar 31, 2026
enhancement help wanted good first issue
Add `--json` flag to `evalview snapshot`
#145 · Mar 31, 2026
enhancement help wanted good first issue
`evalview validate` — lint test files without running them
#144 · Mar 31, 2026
enhancement help wanted good first issue
Support TOML as alternative to YAML for test cases
#143 · Mar 31, 2026
enhancement help wanted good first issue
Add slow-agent warning before timeout fires
#142 · Mar 31, 2026
enhancement help wanted good first issue
`evalview diff` — pretty-print what changed between two runs
#141 · Mar 31, 2026
enhancement help wanted good first issue
🐕 Dogfood failed: regression, dogfood (2026-03-25)
#132 · Mar 25, 2026
dogfood
Test coverage report — identify untested tool paths
#131 · Mar 24, 2026
enhancement
`evalview sweep` — batch evaluation of N agent configs
#130 · Mar 24, 2026
enhancement
Agent Test Format spec — publish open standard
#129 · Mar 24, 2026
enhancement
Add AutoGen/AG2 integration guide
#128 · Mar 24, 2026
documentation help wanted
Add Vercel AI SDK integration guide
#127 · Mar 24, 2026
documentation help wanted
Community test registry — seed templates
#126 · Mar 24, 2026
documentation help wanted
Interactive HTML report — add filtering and search
#125 · Mar 24, 2026
enhancement help wanted
Watch mode — Textual TUI upgrade
#124 · Mar 24, 2026
enhancement help wanted
Badge — GitHub Actions step to auto-commit badge
#123 · Mar 24, 2026
enhancement help wanted
Add `evalview import --from pydantic-evals`
#122 · Mar 24, 2026
enhancement help wanted
Add `evalview export --format openai-evals`
#121 · Mar 24, 2026
enhancement help wanted
CrewAI — track cost per agent
#120 · Mar 24, 2026
enhancement help wanted
CrewAI adapter — streaming support
#119 · Mar 24, 2026
enhancement help wanted
OpenClaw — auto-research loop example
#118 · Mar 24, 2026
documentation help wanted
MCP contract drift — auto-detect schema changes
#117 · Mar 24, 2026
enhancement help wanted
Pydantic AI adapter — structured output validation
#116 · Mar 24, 2026
enhancement help wanted
CrewAI adapter — support hierarchical process type
#115 · Mar 24, 2026
enhancement help wanted
CrewAI adapter — capture per-agent reasoning steps
#114 · Mar 24, 2026
enhancement help wanted
Add test templates to `evalview init`
#113 · Mar 24, 2026
enhancement help wanted good first issue
Add `evalview doctor` command
#112 · Mar 24, 2026
enhancement help wanted good first issue
Improve error messages for common adapter failures
#111 · Mar 24, 2026
enhancement help wanted good first issue
Add Ollama example — local agent with no API keys
#110 · Mar 24, 2026
documentation help wanted good first issue
Write "EvalView vs pydantic_evals" comparison doc
#109 · Mar 24, 2026
documentation help wanted good first issue
Write "EvalView vs crewai test" comparison doc
#108 · Mar 24, 2026
documentation help wanted good first issue
Add Pydantic AI example — customer support agent
#107 · Mar 24, 2026
documentation help wanted good first issue
Add CrewAI example — Trip Planner crew
#106 · Mar 24, 2026
documentation help wanted good first issue
documentation help wanted
ci: add example GitHub Actions workflow for generated suite review
#96 · Mar 13, 2026
documentation enhancement help wanted
feat(importers): add generic HTTP trace export format support
#95 · Mar 13, 2026
enhancement help wanted good first issue
feat(importers): add CSV support for generate --from-log
#94 · Mar 13, 2026
enhancement help wanted good first issue
Add cron schedule syntax to evalview monitor
#84 · Mar 11, 2026
help wanted good first issue