PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION.

Kazunori Mano, Shunichi Ishige, Katsuhiko Shirai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The authors describe an approach to phoneme recognition based on a clustering method which considers phonemic featuers in each frame. In the clustering, both acoustic and phonemic features of speech are used. The acoustic features are linear predictive coding (LPC) coefficients, the cepstral changes between adjacent frames, and the power changes. The combination of these features shows both the static and dynamic properties of the spectrum. The phonemic feature in a frame is composed of a triplet of phonemic symbols. A vector quantization method is used for the clustering. An experimental extraction of phonemic label sequences is performed, considering a contiguity of code sequences between input and the reference phonemic patterns. 8 refs.

Original languageEnglish
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
PublisherIEEE
Pages2243-2246
Number of pages4
Publication statusPublished - 1986
Externally publishedYes

Fingerprint

phonemics
phonemes
vector quantization
Vector quantization
dynamic characteristics
Acoustics
Labels
acoustics
coding
coefficients

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics

Cite this

Mano, K., Ishige, S., & Shirai, K. (1986). PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 2243-2246). IEEE.

PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION. / Mano, Kazunori; Ishige, Shunichi; Shirai, Katsuhiko.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. IEEE, 1986. p. 2243-2246.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mano, K, Ishige, S & Shirai, K 1986, PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. IEEE, pp. 2243-2246.
Mano K, Ishige S, Shirai K. PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. IEEE. 1986. p. 2243-2246
Mano, Kazunori ; Ishige, Shunichi ; Shirai, Katsuhiko. / PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. IEEE, 1986. pp. 2243-2246
@inproceedings{ea65ed597b1f4e838a1504f1b32ad0c3,
title = "PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION.",
abstract = "The authors describe an approach to phoneme recognition based on a clustering method which considers phonemic featuers in each frame. In the clustering, both acoustic and phonemic features of speech are used. The acoustic features are linear predictive coding (LPC) coefficients, the cepstral changes between adjacent frames, and the power changes. The combination of these features shows both the static and dynamic properties of the spectrum. The phonemic feature in a frame is composed of a triplet of phonemic symbols. A vector quantization method is used for the clustering. An experimental extraction of phonemic label sequences is performed, considering a contiguity of code sequences between input and the reference phonemic patterns. 8 refs.",
author = "Kazunori Mano and Shunichi Ishige and Katsuhiko Shirai",
year = "1986",
language = "English",
pages = "2243--2246",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
publisher = "IEEE",

}

TY - GEN

T1 - PHONEME RECOGNITION IN CONNECTED SPEECH USING BOTH STATIC AND DYNAMIC PROPERTIES OF SPECTRUM DESCRIBED BY VECTOR QUANTIZATION.

AU - Mano, Kazunori

AU - Ishige, Shunichi

AU - Shirai, Katsuhiko

PY - 1986

Y1 - 1986

N2 - The authors describe an approach to phoneme recognition based on a clustering method which considers phonemic featuers in each frame. In the clustering, both acoustic and phonemic features of speech are used. The acoustic features are linear predictive coding (LPC) coefficients, the cepstral changes between adjacent frames, and the power changes. The combination of these features shows both the static and dynamic properties of the spectrum. The phonemic feature in a frame is composed of a triplet of phonemic symbols. A vector quantization method is used for the clustering. An experimental extraction of phonemic label sequences is performed, considering a contiguity of code sequences between input and the reference phonemic patterns. 8 refs.

AB - The authors describe an approach to phoneme recognition based on a clustering method which considers phonemic featuers in each frame. In the clustering, both acoustic and phonemic features of speech are used. The acoustic features are linear predictive coding (LPC) coefficients, the cepstral changes between adjacent frames, and the power changes. The combination of these features shows both the static and dynamic properties of the spectrum. The phonemic feature in a frame is composed of a triplet of phonemic symbols. A vector quantization method is used for the clustering. An experimental extraction of phonemic label sequences is performed, considering a contiguity of code sequences between input and the reference phonemic patterns. 8 refs.

UR - http://www.scopus.com/inward/record.url?scp=0022863492&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0022863492&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0022863492

SP - 2243

EP - 2246

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

PB - IEEE

ER -