Files
CyberRanger/training_data/README.md
T
ranger d770ca89d5 Add README to every folder — guided tour for reviewers
Each folder now explains what's inside, why it matters,
and what to look at first. Teacher-friendly navigation.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-21 18:18:14 +01:00

21 lines
937 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Training Data — QLoRA Fine-Tuning Datasets
30 files containing the training datasets used across the CyberRanger version lineage.
## What's Here
- **22 JSON files** — Version-specific training data (V6V22), each containing paired attack/refusal examples
- **1 JSONL file** — Caring awareness training data
- **7 Markdown files** — Training strategy documents (Seven Pillars, Caring Patterns, System Prompt additions)
## Training Data Evolution
| Version Range | Dataset Size | Key Change |
|---------------|-------------|------------|
| V6V9 | ~500 pairs | Early identity training |
| V10V15 | ~1,000 pairs | Bicameral/hive/fractal architectures |
| V16V19 | ~2,000 pairs | Nervous system + sentinel training |
| V20V22 | ~5,000 pairs | Complete mind + refined responses |
The V42 Gold training dataset (~10,000 pairs with Claude Haiku gold-standard refusals) is published on HuggingFace, not stored here.