todo(2): open

This commit is contained in:
Tiara Rodney 2026-06-16 20:14:09 +02:00
parent 703cb863ec
commit fab16d175d
Signed by: tiara
GPG key ID: 5CD8EC1D46106723

15
TODO
View file

@ -31,3 +31,18 @@ Description: Turn the flat trainer scripts into an installable tiararodney.sekft
posix-sdc dependency and an optional gpu extra, console scripts, a posix-sdc dependency and an optional gpu extra, console scripts, a
Pipfile pinning posix-sdc as a local editable override, and tox Pipfile pinning posix-sdc as a local editable override, and tox
environments. environments.
--ISSUE
Content-Type: application/issue
ID: 2
Type: feature
Title: SFT trainer with chat-template render and assistant-only mask
Status: open
Priority: medium
Created: 2026-06-16
Module: sekft
Relationships:
Description: Add the supervised fine-tuner: render trajectories through the
tokenizer's own chat template (matching serving), canonicalise
turns (fold system, merge consecutive), derive an assistant-only
loss mask by token-prefix differencing, and train a QLoRA adapter.