⚠ ネタバレ注意: 本サイトはSFアニメ「SOLAR LINE」の内容を詳細に分析しています。未視聴の方はご注意ください。
📝 AI生成コンテンツ: 本考証の大部分は AI(Claude Code 等)によって生成されています。内容の正確性については原作および引用元をご確認ください。

Task 284: EP01 Transcription Accuracy Comparison

完了 ← タスク一覧

Task 284: EP01 Transcription Accuracy Comparison

Objective

Build automated accuracy measurement comparing EP01 official script (Layer 0) against VTT and Whisper transcriptions. This quantifies transcription quality and helps prioritize future STT improvements.

Scope

  1. Implement line-level text similarity metrics between:

- Official script (ep01_script.json) vs VTT (ep01_lines.json)

- Official script vs Whisper (ep01_lines_whisper.json)

  1. Produce structured accuracy report (JSON + rendered on site)
  2. Unit tests for the comparison functions

Results

MetricYouTube VTTWhisper (medium)
Corpus accuracy68.3%82.6%
Mean line accuracy8.6%*83.0%
Median line accuracy0.0%*90.0%

*VTT line-level metrics are low due to segmentation mismatch (87 VTT vs 229 script lines).

Corpus-level is the fair comparison metric.

Files Created/Modified

Also Fixed

Status: DONE