luke_scribe

Author	SHA1	Message	Date
lukehemmin	a5e6d56568	docs: add Colab notebook for full-talk transcription (notebooks/colab_full_transcribe.ipynb) GPU(T4) 셀: ffmpeg+uv → 익명 clone → uv sync(engine+gpu) → detect → 오디오 업로드 → large-v3-turbo 풀 전사 → transcript.txt 다운로드. (Colab은 사내 게이트 미도달이라 전사 전용; 보정은 온프렘.) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 07:33:54 +09:00
lukehemmin	cd2f807557	chore(omc): hotpaths (beam-size/correct/COLAB) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 07:29:37 +09:00
lukehemmin	7a8cc12cb3	feat(cli): --beam-size + --correct; add COLAB.md GPU full-transcribe guide - transcribe: --beam-size(CPU 속도), --correct(사내 LLM 청크 보정, SCRIBE_LLM_*), config.beam_size(CPU 1~2 권장). 보정 시 전체 수집 후 한 번에 출력. - COLAB.md: Colab(전사 전용·게이트 미도달) + 온프렘 GPU(전사+보정 풀 파이프라인) 가이드. 23 tests pass, ruff clean. --correct 미설정 시 우아한 에러 검증. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 07:29:37 +09:00
lukehemmin	1a91060c43	chore(omc): hotpaths (chunked correction) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 07:09:51 +09:00
lukehemmin	b721ca6419	feat(api): chunk LLM correction for small context windows (+running glossary) 사내 GPT-4o 컨텍스트(<30k)에 맞춰 긴 전사를 문장 경계로 청크 분할하고, 각 청크 보정의 영문 용어를 '러닝 글로서리'로 다음 청크 system에 전달 → 큰 창 없이 강연 전체 용어 일관성 유지. config.llm_max_chars(기본 3000; ~8k창→1500/~16k→3000/~30k→6000). 과대 단일문장은 글자단위 강제 분할 안전망. 23 tests pass(청크 분할/글로서리 주입 포함), ruff clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 07:09:51 +09:00
lukehemmin	1ea96c36c8	chore(omc): record GPT-4o correction finding + P2 API progress (hotpaths) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-08 23:20:01 +09:00
lukehemmin	8f6f8969fd	feat(api): sync test API (serve) + opt-in LLM correction + cloudflared tunnel - api/: FastAPI app, X-API-Key 인증(미설정 시 임시키), 엔진 load-once 풀 (+transcribe lock), POST /v1/transcribe(multipart, 동기), /health, /v1/system, /v1/models. 업로드 임시파일 finally 삭제(프라이버시). - postprocess/: llm.correct(scripts/llm_correct.py 승격; opt-in·allowlist·감사로그·재시도) + rules.normalize(EmbeddingGemma 등 정규화). - results/formats.py: txt/srt/vtt. connectivity/tunnel.py: cloudflared quick tunnel(Colab). - cli serve: uvicorn 단일워커 + --tunnel cloudflare; config llm_* 필드; pyproject api/queue extra 분리(+python-multipart, dev httpx). 검증: 22 단위테스트(API TestClient·formats·postprocess) + 실서버 e2e (/health·auth 401·실제 전사(JFK)·SRT·임시파일 삭제). KO 품질은 turbo/large-v3 필요(tiny는 한국어 degenerate). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-08 23:20:01 +09:00
lukehemmin	480a36edfe	chore: scaffold samples/ko_en/ (clips/ + manifest template) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 15:14:25 +09:00
lukehemmin	45690371c3	docs: add samples/ bench dataset spec (KO+EN) + broaden audio gitignore Document the exact format for the KO+EN labeled clips that the bench gate needs (manifest.jsonl + ground-truth text + optional entities). Ignore audio/video under samples/** while keeping manifests tracked. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 15:12:20 +09:00
lukehemmin	518c03174a	chore(omc): record P1 progress note (engine+transcribe) + hotpaths Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 15:08:07 +09:00
lukehemmin	73380bebf9	feat(p1): faster-whisper engine + audio ingest + transcribe (CPU verified) - engine/: FasterWhisperEngine 래퍼 + model_registry (turbo→CT2 repo) - audio/ingest.py: ffprobe duration/size probe + 413 상한 훅 - cli transcribe: device-auto, model 오버라이드, 413 가드, model_used 출력 - 단위 테스트 3 (resolve_model, probe_media); README 갱신 검증(CPU): JFK 11s 클립 → 정확 전사, detected_lang=en. 10 tests pass, ruff clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 15:07:41 +09:00
lukehemmin	d75d60671e	chore(omc): seed build commands + hotpaths from P1 scaffolding Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 12:56:07 +09:00
lukehemmin	5d2604105b	feat(p1): scaffolding + Device Manager / VRAM probe + CLI detect - pyproject (uv, src layout) + extras: engine/gpu/api/diarize/llm - config.py (pydantic-settings, SCRIBE_ env) - devices/: vram_probe (NVML/psutil/disk) + DeviceManager → capability tier T0–T3, precision by cc/VRAM, worker estimate (계획 §3.6, AC-2/3) - cli.py (typer): detect (구현) + transcribe/bench/serve (스텁) - run.sh, .env.example, README Verified on GTX 1050/2GB: detect → T0_CPU (turbo doesn't fit → explicit downgrade, fail-explicit). Overrides (--device/--workers) work. 7 unit tests cover T0–T3 + overrides via synthetic VRAM. ruff clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 12:56:07 +09:00
lukehemmin	612b353105	docs(omc): seed project memory — directives, notes, tech stack Populate the previously-empty .omc/project-memory.json so teammates and future OMC sessions inherit context: 4 user directives (SoT location, greenfield/next-step, locked design decisions, measurement-gated residual), 3 notes (architecture, tech stack, env), and the decided tech stack. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 12:30:17 +09:00
lukehemmin	84faa121fe	docs: resolve open questions, recompute ambiguity ~10%→~5% (v2.3) Fold post-plan decisions into the spec and consensus plan: - Q1 deploy HW: undecided/mixed → delegate to hardware-adaptive auto-sizing - Q2 model strategy: collapse to single turbo model if P1 bench entity ≥95% - Q3 cancellation: cooperative (segment-boundary) is sufficient; no hard-kill - Q4 concurrency N: delegate to boot-time auto-sizing (AC-8 = ≤5s within auto N) Recompute clarity with the deep-interview model (Goal 0.96 / Constraint 0.95 / Success 0.95 → Total 0.954): ambiguity ~10% → ~5%. Residual is now entirely measurement/code-gated (AC-4 R-WER baseline, hybrid→single confirmation, CT2 GIL) — next lever is P1 bench, not further interview. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 11:07:36 +09:00
lukehemmin	fbe13dddcc	chore: initial commit — planning docs and omc project context Greenfield setup for luke_scribe (local STT transcription API). No source code yet; this captures the completed design phase so teammates can ramp through oh-my-claudecode. Includes: - .omc/plans/consensus-luke-scribe-stt-api.md — consensus impl plan v2.2 - .omc/specs/deep-interview-luke-scribe-stt-api.md — deep-interview spec - .omc/artifacts/ask/{codex,gemini}-*.md — external review (CCG) - .omc/project-memory.json — omc project memory - opencode.json, .claude/settings.json — shared tooling config - .gitignore — excludes ephemeral omc state/session logs and local settings Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 10:08:17 +09:00

16 Commits