todo(10): open
This commit is contained in:
parent
86df915524
commit
814261dc56
1 changed files with 23 additions and 0 deletions
23
TODO
23
TODO
|
|
@ -149,3 +149,26 @@ Description: Make the lint env honestly pass: add mypy as a dev dependency,
|
||||||
ignore_missing_imports for the ML libs, fully annotate
|
ignore_missing_imports for the ML libs, fully annotate
|
||||||
eval/resident/sft (including the inner operator callables), and
|
eval/resident/sft (including the inner operator callables), and
|
||||||
ship a py.typed marker so the Typing::Typed claim is real.
|
ship a py.typed marker so the Typing::Typed claim is real.
|
||||||
|
|
||||||
|
--ISSUE
|
||||||
|
Content-Type: application/issue
|
||||||
|
ID: 10
|
||||||
|
Type: feature
|
||||||
|
Title: structured logging for the trainer (sft)
|
||||||
|
Status: open
|
||||||
|
Priority: medium
|
||||||
|
Created: 2026-06-17
|
||||||
|
Module: sekft
|
||||||
|
Relationships:
|
||||||
|
Description: The trainer is nearly silent: outside an example count and a save
|
||||||
|
line it prints nothing through tokenizer load, the ~14GB base-model
|
||||||
|
load, example building, and the whole training loop, and
|
||||||
|
trajectories dropped for exceeding --max-len or having an empty
|
||||||
|
loss mask vanish without a trace. Add a small shared logging setup
|
||||||
|
(_log.py, stderr so stdout stays clean for results) and a module
|
||||||
|
logger; give sekft-train -v/--verbose and -q/--quiet. Log the run
|
||||||
|
config and each phase, report dataset accounting (keepers ->
|
||||||
|
usable, with counts dropped for length / empty-mask and a warning
|
||||||
|
when any are dropped), and raise transformers' verbosity during
|
||||||
|
training so the per-step curve shows. Apply to train() and
|
||||||
|
inspect().
|
||||||
|
|
|
||||||
Loading…
Add table
Add a link
Reference in a new issue