Digital Library
Close Browse articles from a journal
 
<< previous   
     Journal description
       All volumes of the corresponding journal
         All issues of the corresponding volume
           All articles of the corresponding issues
                                       Details for article 10 of 10 found articles
 
 
  Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream
 
 
Title: Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream
Author: J. Silovsky
J. Nouza
Appeared in: Radioengineering
Paging: Volume 15 (2006) nr. 3 pages 42-48
Year: 2006
Contents: This paper presents a set of techniques for classification of audiosegments in a system for automatic transcription of broadcast programs. The task consists in deciding a) whether the segment is to be labeled as speech or a non-speech one, and in the former case, b) whether the talking person is one of the speakers in the database, and if not, c) which gender the speaker belongs to. The result of the classification is used to extend the information provided by the transcription system and also to enhance the performance of the speech recognition module. Like the most of the state-of-the-art speaker recognition systems, the proposed one is based on Gaussian Mixture Models (GMM). As the number of the database speakers can be large, we introduce a technique that speeds up the identification process in significant way. Furthermore, we compare several approaches to the estimation of GMM parameters. Finally, we present the results achieved in classification of 230 minutes of real broadcast data.
Publisher: Spolecnost pro radioelektronicke inzenyrstvi (provided by DOAJ)
Source file: Elektronische Wetenschappelijke Tijdschriften
 
 

                             Details for article 10 of 10 found articles
 
<< previous   
 
 Koninklijke Bibliotheek - National Library of the Netherlands