todo(3): in-progress

eval reuses the posix-sdc rollout with a local operator; renders via apply_chat_template + normalize_for_template; reports command-mode, terminate, and verified rates over held-out scenarios.
This commit is contained in:
Tiara Rodney 2026-06-16 20:14:38 +02:00
parent 1d672ff8d9
commit 7ee0d63522
Signed by: tiara
GPG key ID: 5CD8EC1D46106723

2
TODO
View file

@ -52,7 +52,7 @@ Content-Type: application/issue
ID: 3 ID: 3
Type: feature Type: feature
Title: Behavioural evaluator Title: Behavioural evaluator
Status: open Status: in-progress
Priority: medium Priority: medium
Created: 2026-06-16 Created: 2026-06-16
Module: sekft Module: sekft