SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION.

Ken'iti Kido, Shozo Makino, Jouji Miwa, Yoshihiro Niitsu

Research output: Contribution to conferencePaper

Abstract

A description is given of the newly developed spoken word recognition system which is composed of the following four stages. In the first stage, seven parameters are extracted every 10 msec from the outputs of the filter bank. The parameters are the frequencies (channel numbers) of three spectral local peaks, the speech power and three parameters expressing the gross pattern of the spectrum. In the second stage, the segmentation and phoneme recognition are carried out. In the third stage, the errors in the segmentation and phoneme recognition are corrected by means of phoneme connecting rules. And, in the last stage, the item of the dictionary having maximum similarity to the recognized phonemic sequence is chosen. Every item of the dictionary is written in phonemic symbols which is transformed from the word written in Japanese ″Kana″ letters by simple rules.

Original languageEnglish
Pages980-984
Number of pages5
Publication statusPublished - 1979 Jan 1
Externally publishedYes
EventProc of the Int Jt Conf on Pattern Recognition, 4th - Kyoto, Jpn
Duration: 1978 Nov 71978 Nov 10

Conference

ConferenceProc of the Int Jt Conf on Pattern Recognition, 4th
CityKyoto, Jpn
Period78/11/778/11/10

Fingerprint

Glossaries
Linguistics
Filter banks

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Kido, K., Makino, S., Miwa, J., & Niitsu, Y. (1979). SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION.. 980-984. Paper presented at Proc of the Int Jt Conf on Pattern Recognition, 4th, Kyoto, Jpn, .

SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION. / Kido, Ken'iti; Makino, Shozo; Miwa, Jouji; Niitsu, Yoshihiro.

1979. 980-984 Paper presented at Proc of the Int Jt Conf on Pattern Recognition, 4th, Kyoto, Jpn, .

Research output: Contribution to conferencePaper

Kido, K, Makino, S, Miwa, J & Niitsu, Y 1979, 'SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION.' Paper presented at Proc of the Int Jt Conf on Pattern Recognition, 4th, Kyoto, Jpn, 78/11/7 - 78/11/10, pp. 980-984.
Kido K, Makino S, Miwa J, Niitsu Y. SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION.. 1979. Paper presented at Proc of the Int Jt Conf on Pattern Recognition, 4th, Kyoto, Jpn, .
Kido, Ken'iti ; Makino, Shozo ; Miwa, Jouji ; Niitsu, Yoshihiro. / SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION. Paper presented at Proc of the Int Jt Conf on Pattern Recognition, 4th, Kyoto, Jpn, .5 p.
@conference{1d10e910e2694a8daa3b43f3fff75b8a,
title = "SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION.",
abstract = "A description is given of the newly developed spoken word recognition system which is composed of the following four stages. In the first stage, seven parameters are extracted every 10 msec from the outputs of the filter bank. The parameters are the frequencies (channel numbers) of three spectral local peaks, the speech power and three parameters expressing the gross pattern of the spectrum. In the second stage, the segmentation and phoneme recognition are carried out. In the third stage, the errors in the segmentation and phoneme recognition are corrected by means of phoneme connecting rules. And, in the last stage, the item of the dictionary having maximum similarity to the recognized phonemic sequence is chosen. Every item of the dictionary is written in phonemic symbols which is transformed from the word written in Japanese ″Kana″ letters by simple rules.",
author = "Ken'iti Kido and Shozo Makino and Jouji Miwa and Yoshihiro Niitsu",
year = "1979",
month = "1",
day = "1",
language = "English",
pages = "980--984",
note = "Proc of the Int Jt Conf on Pattern Recognition, 4th ; Conference date: 07-11-1978 Through 10-11-1978",

}

TY - CONF

T1 - SPOKEN WORD RECOGNITION SYSTEM FOR UNLIMITED SPEAKERS USING GROSS PATTERN OF SPECTRUM AND LINGUISTIC INFORMATION.

AU - Kido, Ken'iti

AU - Makino, Shozo

AU - Miwa, Jouji

AU - Niitsu, Yoshihiro

PY - 1979/1/1

Y1 - 1979/1/1

N2 - A description is given of the newly developed spoken word recognition system which is composed of the following four stages. In the first stage, seven parameters are extracted every 10 msec from the outputs of the filter bank. The parameters are the frequencies (channel numbers) of three spectral local peaks, the speech power and three parameters expressing the gross pattern of the spectrum. In the second stage, the segmentation and phoneme recognition are carried out. In the third stage, the errors in the segmentation and phoneme recognition are corrected by means of phoneme connecting rules. And, in the last stage, the item of the dictionary having maximum similarity to the recognized phonemic sequence is chosen. Every item of the dictionary is written in phonemic symbols which is transformed from the word written in Japanese ″Kana″ letters by simple rules.

AB - A description is given of the newly developed spoken word recognition system which is composed of the following four stages. In the first stage, seven parameters are extracted every 10 msec from the outputs of the filter bank. The parameters are the frequencies (channel numbers) of three spectral local peaks, the speech power and three parameters expressing the gross pattern of the spectrum. In the second stage, the segmentation and phoneme recognition are carried out. In the third stage, the errors in the segmentation and phoneme recognition are corrected by means of phoneme connecting rules. And, in the last stage, the item of the dictionary having maximum similarity to the recognized phonemic sequence is chosen. Every item of the dictionary is written in phonemic symbols which is transformed from the word written in Japanese ″Kana″ letters by simple rules.

UR - http://www.scopus.com/inward/record.url?scp=0018315315&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0018315315&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:0018315315

SP - 980

EP - 984

ER -