#

eval-driven-development

Here are 7 public repositories matching this topic...

mega-edo / mega-security

Security optimization for AI agent systems.

security-optimization agent-security agent-optimization eval-driven-development eval-driven-optimization agent-security-optimization system-prompt-security

Updated May 7, 2026
Python

autoresearch

zircote / autoresearch

Autonomous skill improvement loop for Claude Code plugins — inspired by Karpathy's autoresearch. Modify → evaluate → keep/discard → repeat until convergence. Zero-touch quality iteration at scale.

python convergence quality-assurance autonomous-agents ai-agents karpathy claude-code skill-improvement claude-code-plugin eval-driven-development autoresearch improvement-loop

Updated Mar 27, 2026
Python

specforge-ai

yosuancrespo / specforge-ai

AI-augmented QA platform for spec-driven development and testing, RAG-grounded analysis, eval-driven development and contract validation across Python, Go, Rust and Solidity.

Updated Apr 2, 2026
Python

GeniusTechnoMystic / agentic-swe-grounding-system

Modular self-referencing Markdown grounding system for agentic AI software engineering and architecture

Updated Apr 30, 2026
Python

SAY-5 / genai-eval

Multilingual GenAI evaluation service across 5 task types and 3 languages, with regression-trend dashboard

multilingual nextjs fastapi llm-eval eval-driven-development

Updated May 7, 2026
Python

shahcolate / Product-Kit

Most AI plugins hope they work. These prove it. Eval-driven Claude plugins for product teams.

product-management claude product-strategy ai-tools llm-as-judge claude-plugin eval-driven-development llm-plugins behavioral-evals

Updated Mar 26, 2026
Python

jleonceo / llm-eval-contable

Eval-driven development for LLM accounting skills. 50 automated test cases. 66% to 94% in 5 iterations. AI bias mitigation techniques.

python ai accounting claude prompt-engineering anthropic llm-evaluation eval-driven-development

Updated May 27, 2026
Python

Improve this page

Add a description, image, and links to the eval-driven-development topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the eval-driven-development topic, visit your repo's landing page and select "manage topics."