Rich Transcription Time Marked (RTTM) is a widely used, text-based format for annotating audio and video, representing results of speech recognition, speaker diarization, and related metadata. Developed by NIST in the early 2000s, RTTM files consist ...