Automatic Speech Recognition and Transcription for a topic-related Segmentation Tool in Audio Streams
Zitatschlüssel 1312Xu2011
Autor Shangbo Xu
Jahr 2011
Monat may
Schule Technische Universität Berlin
Zusammenfassung In this master-thesis the interest lies in extracting the structure of spoken audio streams with means of speech transcription. The main focus is put on News data that can be structured according to diverse types of information; speech, music, particular speakers, advertisement or other topics. This thesis uses the Term Frequency-Inverse Document Frequency (TF-IDF) method in text index to develop a topic detection system based on the news from ABC World News. 15 vocabularies are designed for the topic detection system, aiming at detecting the topics of the news reports in text format, with a successful recognition rate up to 80%. Performance of the topic detection system is evaluated with common tools such as the truth table, confusion matrix and single topic precision, recall and F-measure values.
