sekft

Author	SHA1	Message	Date
Tiara Rodney	e4e88c5c18	feat(10): structured logging for the trainer The trainer was nearly silent: outside an example count and a save line it printed nothing through tokenizer load, the base-model load, example building, or the training loop, and trajectories dropped for length or empty mask vanished without a trace. Add _log.py (a shared stderr logging setup so stdout stays clean for results) and a module logger. sekft-train gains -v/--verbose and -q/--quiet. Log the run config and each phase; report dataset accounting (keepers -> usable, with counts dropped for over-length and empty-mask and a warning when any are dropped); inside the build loop, a per-trajectory debug line and a progress line every 100; raise transformers' verbosity during training so the per-step curve shows. Prints in train() and inspect() are routed through the logger.	2026-06-17 23:47:42 +02:00
Tiara Rodney	47b84a0dce	todo(10): in-progress sekft-train logs run config and each phase (tokenizer load, model load, training, save) via a module logger to stderr; -v/--verbose and -q/--quiet control level; dataset accounting reports keepers->usable with counts dropped for length and empty-mask and warns when any are dropped; transformers verbosity raised so the per-step curve shows during training; inspect() logs likewise; existing sft tests stay green; mypy strict clean.	2026-06-17 23:43:21 +02:00
Tiara Rodney	814261dc56	todo(10): open	2026-06-17 23:43:08 +02:00
Tiara Rodney	86df915524	Merge branch 'feature/9'	2026-06-17 14:03:55 +02:00
Tiara Rodney	90e0ebbb45	todo(9): done mypy --strict src tests passes (no issues, 6 files); ML/posix-sdc imports ignored via overrides; sft/eval/resident/tests fully annotated; mypy is a declared dev dep; py.typed ships in the package.	2026-06-17 14:03:54 +02:00
Tiara Rodney	64020c321d	test: annotate the sft test helpers	2026-06-17 14:03:52 +02:00
Tiara Rodney	9397280e6f	refactor: annotate the trainer modules under mypy strict	2026-06-17 14:03:52 +02:00
Tiara Rodney	e60495b2ce	chore: set up mypy strict checking and ship py.typed	2026-06-17 14:03:46 +02:00
Tiara Rodney	ee2a729438	todo(9): in-progress mypy --strict src tests passes; ML imports are configured to not error; eval/resident/sft are fully annotated; py.typed ships in the package.	2026-06-17 13:53:09 +02:00
Tiara Rodney	4fa082478c	todo(9): open	2026-06-17 13:53:02 +02:00
Tiara Rodney	e46e12c70b	Merge branch 'feature/8'	2026-06-16 23:49:04 +02:00
Tiara Rodney	26f956f25a	todo(8): done README rewritten for the trainer (sft/eval/resident, posix-sdc dependency, gpu extra, render contract, console-script usage); module docstrings updated to sekft-train/eval/resident and apply_chat_template.	2026-06-16 23:49:04 +02:00
Tiara Rodney	9e487bc2bf	docs: refresh module docstrings for the packaged layout and render contract	2026-06-16 23:49:02 +02:00
Tiara Rodney	a0b1fbc0c1	docs: rewrite README for the packaged trainer	2026-06-16 23:49:01 +02:00
Tiara Rodney	9cdb2bdc97	todo(8): in-progress README describes the trainer/eval/resident, the posix-sdc dependency, install via the gpu extra, the chat-template render contract, and console-script usage; module docstrings reference sekft-train/eval/resident and apply_chat_template, not the old flat scripts.	2026-06-16 23:47:43 +02:00
Tiara Rodney	4646d34d9d	todo(8): open	2026-06-16 23:47:41 +02:00
Tiara Rodney	9731dd250c	Merge branch 'feature/7'	2026-06-16 20:28:20 +02:00
Tiara Rodney	a34ce78dc5	todo(7): done GPL-2.0 LICENSE added with pyproject license-files and GPLv2 classifier; Dockerfile removed (now in posix-sdc docker/alpine-dash).	2026-06-16 20:28:19 +02:00
Tiara Rodney	f471db2cde	chore: remove Dockerfile relocated to posix-sdc	2026-06-16 20:28:18 +02:00
Tiara Rodney	66365f3d5f	chore: add GPL-2.0 license	2026-06-16 20:28:17 +02:00
Tiara Rodney	acaf8dd061	todo(7): in-progress LICENSE holds the GPL-2.0 text; pyproject declares it via license-files and a GPLv2 classifier; the Dockerfile is gone from sekft.	2026-06-16 20:27:51 +02:00
Tiara Rodney	0a03dd0cfa	todo(7): open	2026-06-16 20:27:50 +02:00
Tiara Rodney	d84b02bba3	Merge branch 'feature/6'	2026-06-16 20:15:22 +02:00
Tiara Rodney	95ce275301	todo(6): done 6 tests pass (4 unit render/mask, 2 smoke entry points); mask test covers assistant-only training and the non-additive guard.	2026-06-16 20:15:21 +02:00
Tiara Rodney	0ea77d434f	test: add entry-point smoke tests	2026-06-16 20:15:20 +02:00
Tiara Rodney	4d1ec9e7fa	test: add SFT render and mask unit tests	2026-06-16 20:15:19 +02:00
Tiara Rodney	45120dea97	todo(6): in-progress tests/unit and tests/smoke pass under pytest; the mask test proves assistant-only training and raises on non-additive templates; entry-point smoke tests pass without torch.	2026-06-16 20:15:18 +02:00
Tiara Rodney	9bd61f99a3	todo(6): open	2026-06-16 20:15:16 +02:00
Tiara Rodney	17fade4b3f	Merge branch 'feature/5'	2026-06-16 20:15:14 +02:00
Tiara Rodney	cf130e032e	todo(5): done README documents the pipeline, the posix-sdc dependency, and the box workflow.	2026-06-16 20:15:13 +02:00
Tiara Rodney	ea31b7fceb	docs: add pipeline overview	2026-06-16 20:15:12 +02:00
Tiara Rodney	12f10854c3	todo(5): in-progress README describes the trainer/eval/resident roles, the posix-sdc dependency, the chat-template render contract, and the box workflow.	2026-06-16 20:15:11 +02:00
Tiara Rodney	a24c6c9681	todo(5): open	2026-06-16 20:15:09 +02:00
Tiara Rodney	8726e3c9b4	Merge branch 'feature/4'	2026-06-16 20:14:49 +02:00
Tiara Rodney	fd30ab47ed	todo(4): done Resident base loads once; fit/evaluate cycle adapters without reloading; renders match the trainer.	2026-06-16 20:14:49 +02:00
Tiara Rodney	23e749c878	feat: add resident-base train and eval harness	2026-06-16 20:14:47 +02:00
Tiara Rodney	e810d0e442	todo(4): in-progress Resident loads the base once; fit trains an adapter and unloads it; evaluate attaches an adapter (or the base baseline) and renders via the shared chat-template canonicalisation.	2026-06-16 20:14:46 +02:00
Tiara Rodney	6f5245cda6	todo(4): open	2026-06-16 20:14:44 +02:00
Tiara Rodney	a374e3a69c	Merge branch 'feature/3'	2026-06-16 20:14:41 +02:00
Tiara Rodney	0dc0380537	todo(3): done Evaluator drives held-out scenarios with a chat-template local operator and reports the behavioural rates.	2026-06-16 20:14:41 +02:00
Tiara Rodney	121d79c89b	feat: add behavioural evaluator	2026-06-16 20:14:39 +02:00
Tiara Rodney	7ee0d63522	todo(3): in-progress eval reuses the posix-sdc rollout with a local operator; renders via apply_chat_template + normalize_for_template; reports command-mode, terminate, and verified rates over held-out scenarios.	2026-06-16 20:14:38 +02:00
Tiara Rodney	1d672ff8d9	todo(3): open	2026-06-16 20:14:35 +02:00
Tiara Rodney	908f59f5a0	Merge branch 'feature/2'	2026-06-16 20:14:16 +02:00
Tiara Rodney	4e45bacf0b	todo(2): done Trainer renders with the model's chat template, canonicalises turns, masks to assistant-only by token-prefix differencing, and trains a QLoRA adapter.	2026-06-16 20:14:15 +02:00
Tiara Rodney	0bb9c1983b	feat: add SFT trainer with chat-template render and assistant-only mask	2026-06-16 20:14:12 +02:00
Tiara Rodney	4533a04021	todo(2): in-progress sft renders via apply_chat_template; normalize_for_template folds system and merges consecutive turns; the loss mask trains assistant turns only and raises on a non-additive template; --inspect reports mask stats.	2026-06-16 20:14:11 +02:00
Tiara Rodney	fab16d175d	todo(2): open	2026-06-16 20:14:09 +02:00
Tiara Rodney	703cb863ec	Merge branch 'feature/1'	2026-06-16 20:13:52 +02:00
Tiara Rodney	046683a371	todo(1): done Namespace package builds; gpu extra isolates torch/transformers/peft/datasets; posix-sdc declared in pyproject and overridden editable in Pipfile; console scripts wired; tox envs defined.	2026-06-16 20:13:51 +02:00

1 2

56 commits