Publication: Feature extraction techniques for speech processing: A review
Loading...
Date
2019
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
World Academy of Research in Science and Engineering
Abstract
In digital signal processing, speech processing is one of the areas that is used in many type of applications. It is one of an intensive field of research. The major criterion for good speech processing system is the selection of feature extraction technique, which plays a major role in achieving higher accuracy. In this paper, most commonly used techniques for feature extraction such as Linear Predictive Coefficient (LPC), Mel Frequency Cepstral Coefficient (MFCC), Perceptual Linear Prediction (PLP), Relative Spectral Perceptual Linear Prediction (RASTA-PLP) and Wavelet Transform (WT) are presented. Comparisons that highlight the strengths and the weaknesses of these techniques are also presented. Studies show that feature extraction techniques are mainly selected based on the requirement of the applications. Wavelet transform outperform other techniques for the analysis of non-stationary signals in audio signal. Enhanced Wavelet transform technique is a way forward and studies can be focused on its coefficients. Hybrid methods can be further explored to increase the performance in speech processing. A number of hybrid methods were reviewed, and studies show that Mel-Frequency Cepstral Coefficients (WPCC) provide better results for speech processing applications with standard coefficient for classification. � 2019, World Academy of Research in Science and Engineering. All rights reserved.
Description
Keywords
Linear Predictive Coefficient (LPC), Mel Frequency Cepstral Coefficient (MFCC), Perceptual Linear Prediction (PLP), Relative Spectral Perceptual Linear Prediction (RASTA-PLP), Wavelet Transform (WT)