What is a YouTube transcript?
A YouTube transcript is the full written text of everything spoken in a video — the same words you'd hear if you watched it, but in a form you can read, search, copy, and study. Transcripts make long lectures skimmable, podcasts quotable, and language-learning videos far more useful.
YouTube Translate generates transcripts two ways. When a video has captions available — either auto-generated by YouTube or uploaded by the creator — we fetch them directly and clean them up for readability. When captions are missing, our AI transcription takes the audio track and turns it into a precise transcript using Google's Gemini speech model, with punctuation, paragraph breaks, and proper nouns intact.