sekft

Author	SHA1	Message	Date
Tiara Rodney	b8843557d7	Merge branch 'feature/12' feat(12): load training data from a raw dir, a curated jsonl, or the Hub	2026-06-18 00:05:36 +02:00
Tiara Rodney	15e598bda1	todo(12): done sft.load_turns(data, hub, revision) yields turns from a raw rollout dir (keep-filtered), a curated .jsonl file, or the published corpus via posix-sdc load_trajectories (lazy import; Hub fallback) - verified the hub path yields all 787 trajectories via the editable checkout's data/; sekft-train gains --hub/--revision and dispatches --data by dir-vs-.jsonl; train()+inspect() use it; 9 tests pass (3 new for raw-dir/jsonl/reject); mypy strict clean (5 files). No submodule changes.	2026-06-18 00:05:36 +02:00
Tiara Rodney	414e963825	feat(12): load training data from a raw dir, a curated jsonl, or the Hub iter_keepers read only raw per-trajectory .json -- one of three input shapes. Add load_turns(data, hub, revision) yielding assistant-bearing turns from a raw rollout dir (keep-filtered), a curated .jsonl corpus (one record per line), or the published corpus via posix-sdc's load_trajectories (the in-repo data/ of a checkout, else the Hugging Face Hub). sekft-train gains --hub and --revision and dispatches --data by dir-vs-.jsonl; train() and inspect() use it. Raw-rollout reading stays sekft-local; curated + Hub reuse posix-sdc's loader, imported lazily so the raw/jsonl paths need neither posix-sdc nor huggingface_hub installed. Unit tests cover the raw-dir and jsonl dispatch.	2026-06-18 00:05:27 +02:00
Tiara Rodney	d78a8028d2	todo(12): in-progress sft.load_turns(data, hub, revision) yields turns from a raw rollout dir (keep-filtered), a curated .jsonl file, or the published corpus via posix-sdc load_trajectories (Hub fallback), imported lazily; sekft-train gains --hub and --revision and dispatches --data by dir-vs-.jsonl; train() and inspect() use it; unit tests cover the raw-dir and jsonl paths; existing tests stay green; mypy strict clean.	2026-06-18 00:03:19 +02:00
Tiara Rodney	d47ba8a56e	todo(12): open	2026-06-18 00:03:10 +02:00
Tiara Rodney	2209ade52c	Merge branch 'bugfix/11' bugfix(11): operate_rate must not sum a None	2026-06-17 23:49:27 +02:00
Tiara Rodney	c6939c0a64	todo(11): done bool() wraps the operate_rate predicate in eval.py and resident.py so the sum counts cleanly-operated trajectories without summing None; mypy strict clean across the whole package (5 files); 6 tests pass.	2026-06-17 23:49:27 +02:00
Tiara Rodney	157bb4955d	bugfix(11): operate_rate must not sum a None sum(t.steps > 0 and t.meta.get("clean") for t in rows) yields the right operand of `and` when steps>0, so a trajectory whose meta lacks the "clean" key contributes None and sum() raises TypeError. Wrap the predicate in bool() so it counts trajectories that operated and are clean. Surfaced by mypy once posix-sdc began shipping py.typed (Trajectory is now typed).	2026-06-17 23:49:26 +02:00
Tiara Rodney	637b746d1d	todo(11): in-progress operate_rate in eval.py and resident.py wraps the steps>0 and meta.get('clean') predicate in bool() so the sum counts cleanly-operated trajectories without summing None; mypy strict clean across the whole package (5 files); tests pass.	2026-06-17 23:48:29 +02:00
Tiara Rodney	299b2ce488	todo(11): open	2026-06-17 23:48:21 +02:00
Tiara Rodney	1c890c703f	Merge branch 'feature/10' feat(10): structured logging for the trainer	2026-06-17 23:47:52 +02:00
Tiara Rodney	7673e47002	todo(10): done sekft-train logs run config and each phase (tokenizer/model load, build, training, save) via a sekft.train logger to stderr; -v/--verbose and -q/--quiet added; dataset accounting reports keepers->usable with over-length/empty-mask drop counts and a warning; build loop logs per-trajectory (debug) and every 100 (info); transformers verbosity raised so the per-step curve shows; inspect() logs likewise; 6 tests pass; mypy strict clean on sft.py + _log.py (verified loop logging fires with FakeTok). No submodule changes.	2026-06-17 23:47:52 +02:00
Tiara Rodney	e4e88c5c18	feat(10): structured logging for the trainer The trainer was nearly silent: outside an example count and a save line it printed nothing through tokenizer load, the base-model load, example building, or the training loop, and trajectories dropped for length or empty mask vanished without a trace. Add _log.py (a shared stderr logging setup so stdout stays clean for results) and a module logger. sekft-train gains -v/--verbose and -q/--quiet. Log the run config and each phase; report dataset accounting (keepers -> usable, with counts dropped for over-length and empty-mask and a warning when any are dropped); inside the build loop, a per-trajectory debug line and a progress line every 100; raise transformers' verbosity during training so the per-step curve shows. Prints in train() and inspect() are routed through the logger.	2026-06-17 23:47:42 +02:00
Tiara Rodney	47b84a0dce	todo(10): in-progress sekft-train logs run config and each phase (tokenizer load, model load, training, save) via a module logger to stderr; -v/--verbose and -q/--quiet control level; dataset accounting reports keepers->usable with counts dropped for length and empty-mask and warns when any are dropped; transformers verbosity raised so the per-step curve shows during training; inspect() logs likewise; existing sft tests stay green; mypy strict clean.	2026-06-17 23:43:21 +02:00
Tiara Rodney	814261dc56	todo(10): open	2026-06-17 23:43:08 +02:00
Tiara Rodney	86df915524	Merge branch 'feature/9'	2026-06-17 14:03:55 +02:00
Tiara Rodney	90e0ebbb45	todo(9): done mypy --strict src tests passes (no issues, 6 files); ML/posix-sdc imports ignored via overrides; sft/eval/resident/tests fully annotated; mypy is a declared dev dep; py.typed ships in the package.	2026-06-17 14:03:54 +02:00
Tiara Rodney	64020c321d	test: annotate the sft test helpers	2026-06-17 14:03:52 +02:00
Tiara Rodney	9397280e6f	refactor: annotate the trainer modules under mypy strict	2026-06-17 14:03:52 +02:00
Tiara Rodney	e60495b2ce	chore: set up mypy strict checking and ship py.typed	2026-06-17 14:03:46 +02:00
Tiara Rodney	ee2a729438	todo(9): in-progress mypy --strict src tests passes; ML imports are configured to not error; eval/resident/sft are fully annotated; py.typed ships in the package.	2026-06-17 13:53:09 +02:00
Tiara Rodney	4fa082478c	todo(9): open	2026-06-17 13:53:02 +02:00
Tiara Rodney	e46e12c70b	Merge branch 'feature/8'	2026-06-16 23:49:04 +02:00
Tiara Rodney	26f956f25a	todo(8): done README rewritten for the trainer (sft/eval/resident, posix-sdc dependency, gpu extra, render contract, console-script usage); module docstrings updated to sekft-train/eval/resident and apply_chat_template.	2026-06-16 23:49:04 +02:00
Tiara Rodney	9e487bc2bf	docs: refresh module docstrings for the packaged layout and render contract	2026-06-16 23:49:02 +02:00
Tiara Rodney	a0b1fbc0c1	docs: rewrite README for the packaged trainer	2026-06-16 23:49:01 +02:00
Tiara Rodney	9cdb2bdc97	todo(8): in-progress README describes the trainer/eval/resident, the posix-sdc dependency, install via the gpu extra, the chat-template render contract, and console-script usage; module docstrings reference sekft-train/eval/resident and apply_chat_template, not the old flat scripts.	2026-06-16 23:47:43 +02:00
Tiara Rodney	4646d34d9d	todo(8): open	2026-06-16 23:47:41 +02:00
Tiara Rodney	9731dd250c	Merge branch 'feature/7'	2026-06-16 20:28:20 +02:00
Tiara Rodney	a34ce78dc5	todo(7): done GPL-2.0 LICENSE added with pyproject license-files and GPLv2 classifier; Dockerfile removed (now in posix-sdc docker/alpine-dash).	2026-06-16 20:28:19 +02:00
Tiara Rodney	f471db2cde	chore: remove Dockerfile relocated to posix-sdc	2026-06-16 20:28:18 +02:00
Tiara Rodney	66365f3d5f	chore: add GPL-2.0 license	2026-06-16 20:28:17 +02:00
Tiara Rodney	acaf8dd061	todo(7): in-progress LICENSE holds the GPL-2.0 text; pyproject declares it via license-files and a GPLv2 classifier; the Dockerfile is gone from sekft.	2026-06-16 20:27:51 +02:00
Tiara Rodney	0a03dd0cfa	todo(7): open	2026-06-16 20:27:50 +02:00
Tiara Rodney	d84b02bba3	Merge branch 'feature/6'	2026-06-16 20:15:22 +02:00
Tiara Rodney	95ce275301	todo(6): done 6 tests pass (4 unit render/mask, 2 smoke entry points); mask test covers assistant-only training and the non-additive guard.	2026-06-16 20:15:21 +02:00
Tiara Rodney	0ea77d434f	test: add entry-point smoke tests	2026-06-16 20:15:20 +02:00
Tiara Rodney	4d1ec9e7fa	test: add SFT render and mask unit tests	2026-06-16 20:15:19 +02:00
Tiara Rodney	45120dea97	todo(6): in-progress tests/unit and tests/smoke pass under pytest; the mask test proves assistant-only training and raises on non-additive templates; entry-point smoke tests pass without torch.	2026-06-16 20:15:18 +02:00
Tiara Rodney	9bd61f99a3	todo(6): open	2026-06-16 20:15:16 +02:00
Tiara Rodney	17fade4b3f	Merge branch 'feature/5'	2026-06-16 20:15:14 +02:00
Tiara Rodney	cf130e032e	todo(5): done README documents the pipeline, the posix-sdc dependency, and the box workflow.	2026-06-16 20:15:13 +02:00
Tiara Rodney	ea31b7fceb	docs: add pipeline overview	2026-06-16 20:15:12 +02:00
Tiara Rodney	12f10854c3	todo(5): in-progress README describes the trainer/eval/resident roles, the posix-sdc dependency, the chat-template render contract, and the box workflow.	2026-06-16 20:15:11 +02:00
Tiara Rodney	a24c6c9681	todo(5): open	2026-06-16 20:15:09 +02:00
Tiara Rodney	8726e3c9b4	Merge branch 'feature/4'	2026-06-16 20:14:49 +02:00
Tiara Rodney	fd30ab47ed	todo(4): done Resident base loads once; fit/evaluate cycle adapters without reloading; renders match the trainer.	2026-06-16 20:14:49 +02:00
Tiara Rodney	23e749c878	feat: add resident-base train and eval harness	2026-06-16 20:14:47 +02:00
Tiara Rodney	e810d0e442	todo(4): in-progress Resident loads the base once; fit trains an adapter and unloads it; evaluate attaches an adapter (or the base baseline) and renders via the shared chat-template canonicalisation.	2026-06-16 20:14:46 +02:00
Tiara Rodney	6f5245cda6	todo(4): open	2026-06-16 20:14:44 +02:00

1 2

68 commits