Date of Award

Winter 2006

Document Type

Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Electrical & Computer Engineering

Committee Director

Stephen A. Zahorian

Committee Member

Charlie H. Cooke

Committee Member

Oscar R. Gonzalez

Committee Member

David Streight

Abstract

Front-end feature extraction techniques have long been a critical component in Automatic Speech Recognition (ASR). Nonlinear filtering techniques are becoming increasingly important in this application, and are often better than linear filters at removing noise without distorting speech features. However, design and analysis of nonlinear filters are more difficult than for linear filters. Mathematical morphology, which creates filters based on shape and size characteristics, is a design structure for nonlinear filters. These filters are limited to minimum and maximum operations that introduce a deterministic bias into filtered signals.

This work develops filtering structures based on a mathematical morphology that utilizes the bias while emphasizing spectral peaks. The combination of peak emphasis via LP analysis with morphological filtering results in more noise robust speech recognition rates.

To help understand the behavior of these pre-processing techniques the deterministic and statistical properties of the morphological filters are compared to the properties of feature extraction techniques that do not employ such algorithms. The robust behavior of these algorithms for automatic speech recognition in the presence of rapidly fluctuating speech signals with additive and convolutional noise is illustrated. Examples of these nonlinear feature extraction techniques are given using the Aurora 2.0 and Aurora 3.0 databases. Features are computed using LP analysis alone to emphasize peaks, morphological filtering alone, or a combination of the two approaches. Although absolute best results are normally obtained using a combination of the two methods, morphological filtering alone is nearly as effective and much more computationally efficient.

Rights

In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/ This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).

DOI

10.25777/ehk1-gs02

Recommended Citation

Hix, Penny. "Automatic Speech Recognition Using LP-DCTC/DCS Analysis Followed by Morphological Filtering" (2006). Doctor of Philosophy (PhD), Dissertation, Electrical & Computer Engineering, Old Dominion University, DOI: 10.25777/ehk1-gs02
https://digitalcommons.odu.edu/ece_etds/88

Download

Included in

Electrical and Computer Engineering Commons

COinS

ODU Digital Commons

Electrical & Computer Engineering Theses & Dissertations

Automatic Speech Recognition Using LP-DCTC/DCS Analysis Followed by Morphological Filtering

Date of Award

Document Type

Degree Name

Department

Committee Director

Committee Member

Committee Member

Committee Member

Abstract

Rights

DOI

Recommended Citation

Included in

Search

Browse

Contribute

Links

Contact Us

ODU Digital Commons

Electrical & Computer Engineering Theses & Dissertations

Automatic Speech Recognition Using LP-DCTC/DCS Analysis Followed by Morphological Filtering

Author

Date of Award

Document Type

Degree Name

Department

Committee Director

Committee Member

Committee Member

Committee Member

Abstract

Rights

DOI

Recommended Citation

Included in

Share

Search

Browse

Contribute

Links

Contact Us