No description
sft renders via apply_chat_template; normalize_for_template folds system and merges consecutive turns; the loss mask trains assistant turns only and raises on a non-additive template; --inspect reports mask stats. |
||
|---|---|---|
| src/tiararodney/sekft | ||
| .gitignore | ||
| Dockerfile | ||
| Pipfile | ||
| pyproject.toml | ||
| TODO | ||
| tox.ini | ||