docs: resolve open questions, recompute ambiguity ~10%→~5% (v2.3)

Fold post-plan decisions into the spec and consensus plan: - Q1 deploy HW: undecided/mixed → delegate to hardware-adaptive auto-sizing - Q2 model strategy: collapse to single turbo model if P1 bench entity ≥95% - Q3 cancellation: cooperative (segment-boundary) is sufficient; no hard-kill - Q4 concurrency N: delegate to boot-time auto-sizing (AC-8 = ≤5s within auto N) Recompute clarity with the deep-interview model (Goal 0.96 / Constraint 0.95 / Success 0.95 → Total 0.954): ambiguity ~10% → ~5%. Residual is now entirely measurement/code-gated (AC-4 R-WER baseline, hybrid→single confirmation, CT2 GIL) — next lever is P1 bench, not further interview. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-07 11:07:36 +09:00
parent fbe13dddcc
commit 84faa121fe
2 changed files with 33 additions and 8 deletions
@@ -1,6 +1,6 @@
 # Consensus Implementation Plan: luke_scribe — 로컬 STT 전사 API

- **Status:** `pending approval` (consensus **v2.2** — v2.1 합의 + CCG 외부리뷰(Codex/Gemini) 반영; §3.6 능력등급·§3.10 프로비저닝/WS/공유스토어/Colab)
+- **Status:** `pending approval` (consensus **v2.3** — v2.2 + §11 Open Q 후속확정·모호도 ~5% 재산출(2026-06-07); v2.1 합의 + CCG 외부리뷰(Codex/Gemini) 반영; §3.6 능력등급·§3.10 프로비저닝/WS/공유스토어/Colab)
 - **Mode:** `--consensus --direct --deliberate`
 - **Source spec:** `.omc/specs/deep-interview-luke-scribe-stt-api.md` (ambiguity ~10%, PASSED)
 - **Project:** greenfield `/root/luke_scribe`
@@ -220,10 +220,11 @@ luke_scribe/

 ---

-## 11. Open Questions (사용자 확인 권장)
-1. 실제 배포가 **다중 GPU 워커**를 필요로 하나, 아니면 단일 T4/Colab 위주? (후자면 워커수 공식 위험 축소.)
-2. turbo의 KO entity 보존율이 P1 bench에서 ≥95%면 **단일 모델로 단순화**할 의향이 있는지(VRAM/복잡도 절감).
-3. 취소는 **협조적(세그먼트 경계)** 으로 충분한지, 즉시 hard-kill이 필요한지.
+## 11. Open Questions → 후속 결정 반영 (RESOLVED · 2026-06-07)
+1. **배포 타깃 HW:** **미정/혼합으로 확정** → 단일 타깃을 고정하지 않고 §3.6 하드웨어 적응형 자동산정에 위임. 워커수 공식·공유스토어는 부팅 실측으로 런타임 결정(설계 원칙 #1과 합치). *(잔여 검증: 특정 배포는 P1 `detect`/`bench`로.)*
+2. **모델 전략 규칙(확정):** P1 bench에서 turbo의 entity 보존율 **≥95%(+R-WER ≤ 기준선)이면 배치도 turbo 단일모델로 통일**(기본 지향=단순화, VRAM·복잡도 절감, T2 스왑 회피). 미달 시에만 하이브리드(batch=large-v3) 유지. → 아키텍처 분기를 *측정 기반 결정규칙*으로 확정(분기 모호 제거).
+3. **취소 의미론(확정):** **협조적 취소(세그먼트 경계)로 충분**. hard-kill 경로 비채택(필요 시 follow-up). §3.7a 그대로.
+4. **동시성 상한 N(확정):** **자동산정 위임** → AC-8의 N은 부팅 시 하드웨어 기반 자동 산정(고정 목표 없음). AC-8 = "≤5s within auto-sized N".

 ---

@@ -263,4 +264,11 @@ luke_scribe/
 - 출처: `.omc/artifacts/ask/{codex,gemini}-20260603-095739.md`. 미채택(추후): webhook·Idempotency-Key·페이지네이션·`410`.

 ---
-*Consensus v2.2 — `pending approval`. 실행(team/ralph/autopilot)은 사용자의 별도 명시 승인이 있어야만 진행됩니다. 승인 전 소스 수정·커밋·실행 스킬 호출 없음.*
+
+## 15. v2.3 Changelog — 후속 결정 반영 + 모호도 재산출 (2026-06-07)
+- **Open Q1~Q4 확정(§11):** 배포HW=적응형 위임(미정/혼합), 모델=단일통일 규칙(P1 bench 게이트), 취소=협조적, 동시성 N=자동산정 위임.
+- **모호도 재산출(딥인터뷰 모델 종합):** ~10%(인터뷰) → ~6%(계획 v2.2) → **~5%(후속 결정)**. Goal 0.96×0.40 + Constraint 0.95×0.30 + Success 0.95×0.30 = Total Clarity ≈**0.954** → Ambiguity ≈**0.046**.
+- **잔여 모호(전부 측정·코드 시점에만 닫히는 known-unknown):** ① AC-4 R-WER 기준선(P1 bench), ② 하이브리드→단일 최종 확정(P1 bench 결과), ③ CT2 GIL→실시간 프로세스 분리(P3). 대화 레버 소진 → **다음 레버는 P1 bench 실행.**
+
+---
+*Consensus v2.3 — `pending approval`. 실행(team/ralph/autopilot)은 사용자의 별도 명시 승인이 있어야만 진행됩니다. 승인 전 소스 수정·커밋·실행 스킬 호출 없음.*