Starter
Local-first evaluation for solo builders
$80 /yr
$6.67/month equivalent
Build datasets, define rubrics, and evaluate model behavior on your own machine.
2 Projects
5 Rubrics / project
50 Items / dataset
25 Runs / month
- Local-first evaluation workspace
- Dataset import (CSV / JSON)
- Rubric builder
- Rule-based scoring
- JSON schema validation
- Run history
- Run comparison
- Export results