todo(2): open
This commit is contained in:
parent
703cb863ec
commit
fab16d175d
1 changed files with 15 additions and 0 deletions
15
TODO
15
TODO
|
|
@ -31,3 +31,18 @@ Description: Turn the flat trainer scripts into an installable tiararodney.sekft
|
||||||
posix-sdc dependency and an optional gpu extra, console scripts, a
|
posix-sdc dependency and an optional gpu extra, console scripts, a
|
||||||
Pipfile pinning posix-sdc as a local editable override, and tox
|
Pipfile pinning posix-sdc as a local editable override, and tox
|
||||||
environments.
|
environments.
|
||||||
|
|
||||||
|
--ISSUE
|
||||||
|
Content-Type: application/issue
|
||||||
|
ID: 2
|
||||||
|
Type: feature
|
||||||
|
Title: SFT trainer with chat-template render and assistant-only mask
|
||||||
|
Status: open
|
||||||
|
Priority: medium
|
||||||
|
Created: 2026-06-16
|
||||||
|
Module: sekft
|
||||||
|
Relationships:
|
||||||
|
Description: Add the supervised fine-tuner: render trajectories through the
|
||||||
|
tokenizer's own chat template (matching serving), canonicalise
|
||||||
|
turns (fold system, merge consecutive), derive an assistant-only
|
||||||
|
loss mask by token-prefix differencing, and train a QLoRA adapter.
|
||||||
|
|
|
||||||
Loading…
Add table
Add a link
Reference in a new issue