sekft/TODO
Tiara Rodney 12f10854c3
todo(5): in-progress
README describes the trainer/eval/resident roles, the posix-sdc dependency, the chat-template render contract, and the box workflow.
2026-06-16 20:15:11 +02:00

91 lines
2.5 KiB
Text

--ISSUE
Content-Type: application/sprints
Sprints:
--ISSUE
Content-Type: application/modules
Modules:
- Name: sekft
Path: .
--ISSUE
Content-Type: application/bugzilla
URL: https://bugs.code.tiararodney.com/rest
Mappings:
- Module: sekft
Product: sek
Component: sekft
--ISSUE
Content-Type: application/issue
ID: 1
Type: feature
Title: Package sekft as an installable namespace package
Status: done
Priority: medium
Created: 2026-06-16
Module: sekft
Relationships:
Description: Turn the flat trainer scripts into an installable tiararodney.sekft
namespace package: src layout, pyproject with the abstract
posix-sdc dependency and an optional gpu extra, console scripts, a
Pipfile pinning posix-sdc as a local editable override, and tox
environments.
--ISSUE
Content-Type: application/issue
ID: 2
Type: feature
Title: SFT trainer with chat-template render and assistant-only mask
Status: done
Priority: medium
Created: 2026-06-16
Module: sekft
Relationships:
Description: Add the supervised fine-tuner: render trajectories through the
tokenizer's own chat template (matching serving), canonicalise
turns (fold system, merge consecutive), derive an assistant-only
loss mask by token-prefix differencing, and train a QLoRA adapter.
--ISSUE
Content-Type: application/issue
ID: 3
Type: feature
Title: Behavioural evaluator
Status: done
Priority: medium
Created: 2026-06-16
Module: sekft
Relationships:
Description: Add the behavioural eval: load base plus LoRA adapter, drop it into
held-out scenarios with no scaffold, drive them through a local
operator that renders with the model's chat template, and report
reach/terminate/checker rates.
--ISSUE
Content-Type: application/issue
ID: 4
Type: feature
Title: Resident-base train/eval harness
Status: done
Priority: medium
Created: 2026-06-16
Module: sekft
Relationships:
Description: Add the resident harness that loads the 14GB base once and keeps it
hot, training fresh LoRA adapters and evaluating them without
reloading the base, for the slow-OcuLink iterate loop.
--ISSUE
Content-Type: application/issue
ID: 5
Type: feature
Title: Pipeline overview README
Status: in-progress
Priority: medium
Created: 2026-06-16
Module: sekft
Relationships:
Description: Document the sekft pipeline: the trainer, evaluator, and resident
harness; how they consume the posix-sdc dataset; the render
contract; and how to run on the GPU box.