⚠ ネタバレ注意: 本サイトはSFアニメ「SOLAR LINE」の内容を詳細に分析しています。未視聴の方はご注意ください。
📝 AI生成コンテンツ: 本考証の大部分は AI(Claude Code 等)によって生成されています。内容の正確性については原作および引用元をご確認ください。

← アイデア一覧

VOICEROID ASR Quality Issue

Status: RESOLVED

Mitigations implemented:

Problem

YouTube's auto-generated subtitles for SOLAR LINE have very poor accuracy.

The series uses VOICEROID/software-talk voices (CeVIO, VOICEVOX, etc.) which

are synthetic speech not well-handled by YouTube's speech recognition.

Observed Issues (Episode 1: CQ_OkDjEwRk)

Impact on Pipeline

- Speaker detection by voice is impossible from ASR alone

- Must use dialogue patterns (AI=polite/technical, きりたん=casual/decisions)

Possible Mitigations

  1. Manual transcription: Most accurate but labor-intensive
  2. Whisper ASR: Run OpenAI Whisper locally on audio — may handle synthetic speech better
  3. Video OCR: Some series display text/subtitles on screen that could be extracted
  4. Human correction pass: Use ASR as starting point, correct against video
  5. Multiple ASR engines: Cross-reference YouTube auto-subs with Whisper output

Speaker Diarization Investigation (Task 056)

Tested speaker diarization tools on VOICEROID content:

Recommendation

For now, use YouTube auto-subs as structural scaffolding (timing, line breaks)

but treat the text content as unreliable. Speaker attribution and quote

correction should be done by reviewing the actual video content.