feat(eval): public benchmark harness (PaperAudit, LimitGen, CLAIMCHECK, MMReview, PeerQA)
April 17, 2026 ยท #153
Python
Difficulty: Medium
Labels
enhancement area: pipeline priority: high
Parent Repository
Davidvandijcke/coarse
Python repository
124 21