How YouTube Transcript Generators Work: ASR vs Caption Retrieval