Tail contracts work on scored research rows; serving-slot drainage is now the blocker.
v0.62 was designed to generalize the v0.61 tail-output contract across research, story, agent, relationship, and psychology memory domains. The live run produced clean research-domain quality evidence, then stopped on repeated single-flight backend-busy responses before the remaining domains could be scored.
Institutional Finding
The memory/control-plane path can preserve current-fact identity, entity binding, memory-key binding, rejected-key recall, stale exclusion, and foreign exclusion under 1024 and 2048 pressure for the research-update slice when the output contract is placed at the tail.
- Both tail variants passed 1024 and 2048 pressure for research update precedence.
- The 2048 rows took about 195-196 seconds each, which is inside the 300s harness wall but long enough to expose serving-slot lifecycle problems afterward.
- The next optimization is serving-slot drainage/backpressure, not recall prompt quality for this scored slice.
Do Not Overclaim
v0.62 did not score story, agent, relationship, or psychology rows live. Those domains were present in the planned matrix and dry-run, but the live run stopped before their quality evidence due to repeated 503 backend-busy responses.
- Deck-safe: scored research rows passed under pressure.
- Not deck-safe yet: cross-domain generalization passed live.
- Engineering action: add post-long-row slot-idle drain checks before moving to the next row.
Run Table
| Row | Status | Elapsed | Attempts | Message |
|---|---|---|---|---|
v062-research_update_precedence-controller_expands_tail_contract-p1024 | 200 | 77.29s | 1 | {"choices":[{"finish_reason":"stop","index":0,"message":{"role":"assistant","content":"{\"answer\": \"The controller should combine selected-current payloads with controller-owned metadata, pressure recall must preserve current facts before adding more envelope formatting, and rejected handles should be filled by the controller.\", \"current_fact_ids\": [\"C1\", \"C2\", \"C3\"], \"confidence\": 1.0}"}}],"created":1781074633,"model":"ibm-granite_granite-4.0-micro-Q4_K_M.gguf","system_fingerprint":"b2122-5e335ba11","object":"chat.completion","usage":{"completion_tokens":64,"prompt_tokens":41472,"total_tokens":41536},"id":"chatcmpl-lYEcZ49pji0dt6lDU5XGV4W21W8mNdLZ","timings":{"cache_n":3,"prompt_n":41469,"prompt_ms":76471.733,"prompt_per_token_ms":1.8440698594130553,"prompt_per_second":542.2788051632098,"predicted_n":64,"predicted_ms":358.055,"predicted_per_token_ms":5.594609375,"predicted_per_second":178.74348912876513}} |
v062-research_update_precedence-controller_expands_tail_contract-p2048 | 200 | 194.595s | 1 | {"choices":[{"finish_reason":"stop","index":0,"message":{"role":"assistant","content":"{\"answer\": \"The selected current memory payload combines the selected-current payloads with controller-owned metadata, preserving current facts before adding more envelope formatting. Rejected handles are filled by the controller.\", \"current_fact_ids\": [\"C1\", \"C2\", \"C3\"], \"confidence\": 1.0}"}}],"created":1781074837,"model":"ibm-granite_granite-4.0-micro-Q4_K_M.gguf","system_fingerprint":"b2122-5e335ba11","object":"chat.completion","usage":{"completion_tokens":63,"prompt_tokens":82477,"total_tokens":82540},"id":"chatcmpl-Yh44qEgppXqelCMmUFtPT3q0bXZaEo0d","timings":{"cache_n":0,"prompt_n":82477,"prompt_ms":192931.749,"prompt_per_token_ms":2.339218800392837,"prompt_per_second":427.49314422065385,"predicted_n":63,"predicted_ms":351.811,"predicted_per_token_ms":5.584301587301587,"predicted_per_second":179.0734229458431}} |
v062-research_update_precedence-controller_expands_tail_schema_example-p1024 | 200 | 79.334s | 1 | {"choices":[{"finish_reason":"stop","index":0,"message":{"role":"assistant","content":"{\"answer\":\"The controller should combine the selected-current payloads with its own metadata, preserve the current facts before adding more envelope formatting, and fill rejected handles with the controller's own data.\",\"current_fact_ids\":[\"C1\",\"C2\",\"C3\"],\"confidence\":\"high\"}"}}],"created":1781074927,"model":"ibm-granite_granite-4.0-micro-Q4_K_M.gguf","system_fingerprint":"b2122-5e335ba11","object":"chat.completion","usage":{"completion_tokens":55,"prompt_tokens":41504,"total_tokens":41559},"id":"chatcmpl-8oD8uRrgTcn048y4uuF5y4eZzJYNwQ8m","timings":{"cache_n":0,"prompt_n":41504,"prompt_ms":77686.198,"prompt_per_token_ms":1.8717761661526602,"prompt_per_second":534.2519143490584,"predicted_n":55,"predicted_ms":307.851,"predicted_per_token_ms":5.597290909090909,"predicted_per_second":178.65785721014387}} |
v062-research_update_precedence-controller_expands_tail_schema_example-p2048 | 200 | 196.426s | 1 | {"choices":[{"finish_reason":"stop","index":0,"message":{"role":"assistant","content":"{\"answer\":\"The controller should combine selected-current payloads with its own metadata, preserve current facts before adding more envelope formatting, and fill rejected handles with controller-owned data.\",\"current_fact_ids\":[\"C1\",\"C2\",\"C3\"],\"confidence\":\"high\"}"}}],"created":1781075133,"model":"ibm-granite_granite-4.0-micro-Q4_K_M.gguf","system_fingerprint":"b2122-5e335ba11","object":"chat.completion","usage":{"completion_tokens":51,"prompt_tokens":82509,"total_tokens":82560},"id":"chatcmpl-UaYsM3rhIEVidPHqXXjbUNtFOzz25vp4","timings":{"cache_n":0,"prompt_n":82509,"prompt_ms":194601.955,"prompt_per_token_ms":2.358554278927147,"prompt_per_second":423.9885462610075,"predicted_n":51,"predicted_ms":286.045,"predicted_per_token_ms":5.608725490196079,"predicted_per_second":178.29362512891328}} |
v062-story_canon_supersession-controller_expands_tail_contract-p2048 | 503 | 0.492s | 4 | Backend busy: 1 request(s) already in flight (limit: 1). Try again in 10s. |
v062-story_canon_supersession-controller_expands_tail_schema_example-p2048 | 503 | 0.559s | 4 | Backend busy: 1 request(s) already in flight (limit: 1). Try again in 10s. |
v062-agent_loop_directive-controller_expands_tail_contract-p2048 | 503 | 0.559s | 4 | Backend busy: 1 request(s) already in flight (limit: 1). Try again in 10s. |
Scored Metrics
- Parse success: 100.0%
- Current-fact ID recall: 100.0%
- Entity ID exactness: 100.0%
- Memory-key exactness: 100.0%
- Rejected-key recall: 100.0%
- Stale absence: 100.0%
- Foreign absence: 100.0%
Next Harness Requirement
v0.62 shows that a successful 2048 row can leave the single inference slot unavailable to the immediate next row. The next runner should add a post-row chat-slot idle probe and cooldown loop after any row exceeding 120 seconds or any 2048-pressure row.
- Current preflight idle probe is necessary but not sufficient.
- Fast 503 after long rows should be classified as slot-drainage/backpressure.
- Do not retry all remaining rows blindly; resume from the first unscored domain after slot-idle verification.
Data Trace
Every claim on this page points to a local artifact. These are the handles a CTO or future agent should use to audit the run.
| Evidence | Path | Use |
|---|---|---|
| Live scores | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/scores.json | Aggregate v0.62 result and stop reason. |
| Run manifest | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/run-manifest.json | Endpoint, dry-run flag, and run metadata. |
| Research 1024 tail-contract attempts | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/v062-research_update_precedence-controller_expands_tail_contract-p1024-attempts.json | 77.29s successful scored row. |
| Research 2048 tail-contract attempts | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/v062-research_update_precedence-controller_expands_tail_contract-p2048-attempts.json | 194.59s successful scored row. |
| Research 1024 schema-example attempts | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/v062-research_update_precedence-controller_expands_tail_schema_example-p1024-attempts.json | 79.33s successful scored row. |
| Research 2048 schema-example attempts | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/v062-research_update_precedence-controller_expands_tail_schema_example-p2048-attempts.json | 196.43s successful scored row. |
| Story tail-contract attempts | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/v062-story_canon_supersession-controller_expands_tail_contract-p2048-attempts.json | Fast 503 backend-busy after the long research rows. |
| Agent-loop tail-contract attempts | research/tracks/hypernym-infinite-mim/results/v0.62-tail-contract-cross-domain-pressure/20260610T_tail_contract_cross_domain_pressure_live_codex_v1/v062-agent_loop_directive-controller_expands_tail_contract-p2048-attempts.json | Final stop condition: repeated backend-busy. |
| v0.62 strategy | research/tracks/hypernym-infinite-mim/v0.62-tail-contract-cross-domain-pressure-strategy.md | Hypothesis and success criteria. |
| v0.62 fixed runner | research/tracks/hypernym-infinite-mim/run_v0_62_tail_contract_cross_domain_pressure.py | Harness implementation after recursion repair. |
Compound Research Chain
| Prior / Related Artifact | Pointer |
|---|---|
| Current public board | https://hypernym-infinite-memory-v09.pages.dev/ |
| Previous immutable v0.61 board | https://b8f30d9c.hypernym-infinite-memory-v09.pages.dev/ |
| Post-v0.61 CXDB snapshot | .forge/artifacts/cxdb-hypernym-infinite-mim-post-v061-snapshot-20260610T061900Z.md |
| System-reset handoff | .forge/artifacts/cxdb-hypernym-infinite-mim-system-reset-handoff-20260610T063034Z.md |
| Portable CXDB export | .forge/artifacts/cxdb-hypernym-infinite-mim-full-export-20260610T063146Z.json |
| Compound visualization standard | research/tracks/hypernym-infinite-mim/compound-research-visualization-standard.md |
One-Sentence CTO Frame
Hypernym Infinite Memory is a memory control plane for model fleets: per-user memory stores, controller-curated recall, exact provenance handles, and lower long-memory serving cost for small local models, with v0.62 showing that serving-slot lifecycle is now the constraint to engineer around.