Skip to content

feat: establish robust evaluation framework for workflow benchmarks#457

Merged
cocosheng-g merged 8 commits intomainfrom
feat/eval-framework
Feb 9, 2026
Merged

feat: establish robust evaluation framework for workflow benchmarks#457
cocosheng-g merged 8 commits intomainfrom
feat/eval-framework