Transcription is the first step toward the creation of a captioned video. The transcript itself is a text equivalent of an audio recording. In addition to its use in the captioning process, a transcript may be useful in its own right as a learning tool for students, who can read the text and search for key words. In fact, transcripts are a great addition to any instructional video or podcast because they give students another way to comprehend and interact with the material.
Although numerous automatic transcription tools are available to convert speech to text, none provide satisfactory results at this time. The transcripts produced by automatic transcription require extensive edits and corrections. We therefore recommend that transcripts be typed manually in Microsoft Word.
Tools used in this tutorial
- Microsoft Word
The following guidelines are adapted from the Described and Captioned Media Program’s (DCMP) Captioning Key. The DCMP is funded by the U.S. Department of Education and administered by the National Association of the Deaf. We have found these guidelines to be comprehensive. For additional information, follow the link to the Captioning Key website.
A quality transcript should include speaker identification as well as description of sound effects, music, and other notations that promote comprehension when the audio cannot be heard.
Place speaker’s name in parentheses, on its own line:
Description of sound effects should be placed in square brackets and should identify the sound source:
Sound effects should appear after the sound description, on their own line, in lower case:
If possible, use words that imitate the sound being described:
Woof (dog bark)
If a piece of music is significant to the story, introduce the artist’s name and song title in brackets:
If the music has lyrics which are pertinent to the story, caption them. Lyrics should be denoted with a music symbol (♪) at the beginning and end of the caption (type Alt+13 on the numeric keypad to insert the music symbol):
Use two music symbols after the last line of the song to indicate the end of the music:
Use an ellipsis (…) to indicate a pause:
Use italics for foreign words and phrases:
Use ALL CAPS, not italics, for emphasis:
Follow these steps to create a transcript:
- Open a new Microsoft Word document.
- Open the video file.
- Play the video in small increments (20–30 seconds) while transcribing the audio and dialogue in Microsoft Word.
- Identify speakers when they appear in the video.
- If the scene contains multiple speakers, identify the person speaking in each caption.
- If the camera focuses on one speaker at a time, speaker identification is not needed after the initial identification.
Your new transcript file can be used in several ways:
- The text can be copied from Word and pasted into Camtasia Studio, where it will be synchronized with the video to create captions (see Adding Captions in Camtasia Studio).
- The Word document can be saved as a text file (.txt) and uploaded to YouTube for synchronization with the video.
- The document can be posted in Word or PDF format on the course website, where the text will become a searchable learning resource.