Date of Award
Spring 2005
Document Type
Thesis
Degree Name
Master of Science (MS)
Department
Electrical & Computer Engineering
Program/Concentration
Computer Engineering
Committee Director
Stephen A. Zahorian
Committee Member
Vijayan K. Asari
Committee Member
Frederic D. McKenzie
Call Number for Print
Special Collections LD4331.E55 S255 2005
Abstract
This thesis is an extension of previous work on speaker identification, but with a primary focus on the generalization performance of patterns classifiers for speaker identification. The goal for all pattern classifiers is to generalize to new data, not just perform well on the training data set. The error on the training data is typically a biased estimate of the generalization error. The training set error tends to be smaller than the generalization error if the model is selected such that it minimizes the training error. Therefore, the training error alone should not be used as a good predictor of classifier performance.
The main focus of this research is to investigate the issues related to the commonly observed phenomena that classifiers have high identification performance on training data, but poor accuracy for test data. Understanding these issues is crucial in order to improve the generalization performance.
As detailed in this work, the basic algorithms from our previous work on speaker identification have not been changed, but many details in the signal processing were changed to improve identification results. Also, a recently developed non-parametric classifier, called a Support Vector Machine (SVM), which is claimed to have good generalization performance, was tested on the NTIMIT database. However, only negligible improvements were obtained in using an SVM versus a Neural Network (NN) classifier. In other experiments, performed with the SPIDRE speaker identification database, using the changes in the signal processing details, speaker identification accuracy of over 84% was obtained on test sentences, using all 45 target speakers in the database. These results are among the best reported for the SPIDRE database, and indicate that excellent results can be obtained using simple classifiers.
Rights
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/ This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
DOI
10.25777/y1kf-3r48
Recommended Citation
Sall, Mame C..
"Generalization Issues in Pattern Classification Applied to Speaker Identification"
(2005). Master of Science (MS), Thesis, Electrical & Computer Engineering, Old Dominion University, DOI: 10.25777/y1kf-3r48
https://digitalcommons.odu.edu/ece_etds/511