91 lines
2.5 KiB
Text
91 lines
2.5 KiB
Text
--ISSUE
|
|
Content-Type: application/sprints
|
|
Sprints:
|
|
|
|
--ISSUE
|
|
Content-Type: application/modules
|
|
Modules:
|
|
- Name: sekft
|
|
Path: .
|
|
|
|
--ISSUE
|
|
Content-Type: application/bugzilla
|
|
URL: https://bugs.code.tiararodney.com/rest
|
|
Mappings:
|
|
- Module: sekft
|
|
Product: sek
|
|
Component: sekft
|
|
|
|
--ISSUE
|
|
Content-Type: application/issue
|
|
ID: 1
|
|
Type: feature
|
|
Title: Package sekft as an installable namespace package
|
|
Status: done
|
|
Priority: medium
|
|
Created: 2026-06-16
|
|
Module: sekft
|
|
Relationships:
|
|
Description: Turn the flat trainer scripts into an installable tiararodney.sekft
|
|
namespace package: src layout, pyproject with the abstract
|
|
posix-sdc dependency and an optional gpu extra, console scripts, a
|
|
Pipfile pinning posix-sdc as a local editable override, and tox
|
|
environments.
|
|
|
|
--ISSUE
|
|
Content-Type: application/issue
|
|
ID: 2
|
|
Type: feature
|
|
Title: SFT trainer with chat-template render and assistant-only mask
|
|
Status: done
|
|
Priority: medium
|
|
Created: 2026-06-16
|
|
Module: sekft
|
|
Relationships:
|
|
Description: Add the supervised fine-tuner: render trajectories through the
|
|
tokenizer's own chat template (matching serving), canonicalise
|
|
turns (fold system, merge consecutive), derive an assistant-only
|
|
loss mask by token-prefix differencing, and train a QLoRA adapter.
|
|
|
|
--ISSUE
|
|
Content-Type: application/issue
|
|
ID: 3
|
|
Type: feature
|
|
Title: Behavioural evaluator
|
|
Status: done
|
|
Priority: medium
|
|
Created: 2026-06-16
|
|
Module: sekft
|
|
Relationships:
|
|
Description: Add the behavioural eval: load base plus LoRA adapter, drop it into
|
|
held-out scenarios with no scaffold, drive them through a local
|
|
operator that renders with the model's chat template, and report
|
|
reach/terminate/checker rates.
|
|
|
|
--ISSUE
|
|
Content-Type: application/issue
|
|
ID: 4
|
|
Type: feature
|
|
Title: Resident-base train/eval harness
|
|
Status: done
|
|
Priority: medium
|
|
Created: 2026-06-16
|
|
Module: sekft
|
|
Relationships:
|
|
Description: Add the resident harness that loads the 14GB base once and keeps it
|
|
hot, training fresh LoRA adapters and evaluating them without
|
|
reloading the base, for the slow-OcuLink iterate loop.
|
|
|
|
--ISSUE
|
|
Content-Type: application/issue
|
|
ID: 5
|
|
Type: feature
|
|
Title: Pipeline overview README
|
|
Status: open
|
|
Priority: medium
|
|
Created: 2026-06-16
|
|
Module: sekft
|
|
Relationships:
|
|
Description: Document the sekft pipeline: the trainer, evaluator, and resident
|
|
harness; how they consume the posix-sdc dataset; the render
|
|
contract; and how to run on the GPU box.
|