⚠ ネタバレ注意: 本サイトはSFアニメ「SOLAR LINE」の内容を詳細に分析しています。未視聴の方はご注意ください。
📝 AI生成コンテンツ: 本考証の大部分は AI(Claude Code 等)によって生成されています。内容の正確性については原作および引用元をご確認ください。

Task 46: Whisper STT for Episodes 01-04

完了 ← タスク一覧

Task 046: Whisper STT for Episodes 01-04

Status: DONE

Motivation

Task 036 built Whisper STT infrastructure and successfully transcribed EP05 (Niconico-only, no YouTube subtitles). Episodes 01-04 have YouTube auto-generated VTT subtitles, but these are known to be unreliable for VOICEROID/software-talk content.

Running Whisper on EP01-04 provides a second, independent transcription source with higher accuracy.

Results

Whisper Quality (small model, CPU)

EPSegmentsReliableAvg LogProbWhisper LinesVTT LinesWhisper CharsVTT Chars
1440430 (98%)-0.144419875,2584,305
2426415 (97%)-0.142408805,0563,874
3448432 (96%)-0.145420884,8214,141
4162159 (98%)-0.215159855,2554,209

Key findings:

New Scripts

Whisper-Specific Extraction Strategy

Standard VTT extraction uses aggressive merging (no_terminal_punctuation + small_gap → merge) because VTT auto-generated subtitles split mid-sentence. Whisper segments are already semantically complete utterances, so we use minimal merging: only merge fragments (<3 chars) with zero-gap adjacency.

Files Modified

Files Created (gitignored, in raw_data/)

Depends on