Included
- Curated supervised fine-tuning JSONL records where available in the packaged build.
- Training schema reference.
- Benchmark scripts.
- Architecture notes for the training context.
The Vault is a starter product for people who want inspectable SFT data and benchmark references without pulling the whole monorepo into their first experiment.
Inspect one JSONL file, confirm it matches the schema, then run or review one benchmark script before training anything. The goal is to understand the data path before spending compute.