todo(2): open
This commit is contained in:
parent
703cb863ec
commit
fab16d175d
1 changed files with 15 additions and 0 deletions
15
TODO
15
TODO
|
|
@ -31,3 +31,18 @@ Description: Turn the flat trainer scripts into an installable tiararodney.sekft
|
|||
posix-sdc dependency and an optional gpu extra, console scripts, a
|
||||
Pipfile pinning posix-sdc as a local editable override, and tox
|
||||
environments.
|
||||
|
||||
--ISSUE
|
||||
Content-Type: application/issue
|
||||
ID: 2
|
||||
Type: feature
|
||||
Title: SFT trainer with chat-template render and assistant-only mask
|
||||
Status: open
|
||||
Priority: medium
|
||||
Created: 2026-06-16
|
||||
Module: sekft
|
||||
Relationships:
|
||||
Description: Add the supervised fine-tuner: render trajectories through the
|
||||
tokenizer's own chat template (matching serving), canonicalise
|
||||
turns (fold system, merge consecutive), derive an assistant-only
|
||||
loss mask by token-prefix differencing, and train a QLoRA adapter.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue