Vector quantization of LSP parameters using moving average interframe prediction

Hitoshi Ohmuro, Takehiro Moriya, Kazunori Mano, Satoshi Miki

研究成果: Article

8 引用 (Scopus)

抄録

This paper proposes a new method for vector quantization (VQ) of LSP parameters, using moving average (MA) interframe predictions. MA predictive VQ executes the prediction using the code vectors (codebook outputs) in the previous frames. Moreover, this method has the following features, compared to autoregressive (AR) predictive coding, which executes the prediction based on the previous quantized (decoded) values: (1) even if a bit error is produced in the transmission channel, its effect to the succeeding frames remains finite; and (2) the stored codes can be decoded from any time point. This paper discusses the quantization performance and the robustness against bit errors of the MA predictive VQ, as well as the training method for the codebook. Assuming an application to low‐bit‐rate speech coding, the configuration considered has a frame length of 40 ms, with each frame composed of four subframes. A method for further improvement of the efficiency is reported. As a result of evaluation using actual speech data, bit reductions of approximately 16 percent and 23 percent are achieved for the quantization for each 20 ms and 40 ms, respectively, compared to the conventional method which does not use interframe prediction.

元の言語English
ページ(範囲)12-26
ページ数15
ジャーナルElectronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)
77
発行部数10
DOI
出版物ステータスPublished - 1994
外部発表Yes

Fingerprint

Vector quantization
Speech coding

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

これを引用

@article{2ce4ca2f2cce44faa9b8870a37b8f7ff,
title = "Vector quantization of LSP parameters using moving average interframe prediction",
abstract = "This paper proposes a new method for vector quantization (VQ) of LSP parameters, using moving average (MA) interframe predictions. MA predictive VQ executes the prediction using the code vectors (codebook outputs) in the previous frames. Moreover, this method has the following features, compared to autoregressive (AR) predictive coding, which executes the prediction based on the previous quantized (decoded) values: (1) even if a bit error is produced in the transmission channel, its effect to the succeeding frames remains finite; and (2) the stored codes can be decoded from any time point. This paper discusses the quantization performance and the robustness against bit errors of the MA predictive VQ, as well as the training method for the codebook. Assuming an application to low‐bit‐rate speech coding, the configuration considered has a frame length of 40 ms, with each frame composed of four subframes. A method for further improvement of the efficiency is reported. As a result of evaluation using actual speech data, bit reductions of approximately 16 percent and 23 percent are achieved for the quantization for each 20 ms and 40 ms, respectively, compared to the conventional method which does not use interframe prediction.",
keywords = "bit error, interframe prediction, LSP, moving average, vector quantization",
author = "Hitoshi Ohmuro and Takehiro Moriya and Kazunori Mano and Satoshi Miki",
year = "1994",
doi = "10.1002/ecjc.4430771002",
language = "English",
volume = "77",
pages = "12--26",
journal = "Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)",
issn = "1042-0967",
publisher = "John Wiley and Sons Inc.",
number = "10",

}

TY - JOUR

T1 - Vector quantization of LSP parameters using moving average interframe prediction

AU - Ohmuro, Hitoshi

AU - Moriya, Takehiro

AU - Mano, Kazunori

AU - Miki, Satoshi

PY - 1994

Y1 - 1994

N2 - This paper proposes a new method for vector quantization (VQ) of LSP parameters, using moving average (MA) interframe predictions. MA predictive VQ executes the prediction using the code vectors (codebook outputs) in the previous frames. Moreover, this method has the following features, compared to autoregressive (AR) predictive coding, which executes the prediction based on the previous quantized (decoded) values: (1) even if a bit error is produced in the transmission channel, its effect to the succeeding frames remains finite; and (2) the stored codes can be decoded from any time point. This paper discusses the quantization performance and the robustness against bit errors of the MA predictive VQ, as well as the training method for the codebook. Assuming an application to low‐bit‐rate speech coding, the configuration considered has a frame length of 40 ms, with each frame composed of four subframes. A method for further improvement of the efficiency is reported. As a result of evaluation using actual speech data, bit reductions of approximately 16 percent and 23 percent are achieved for the quantization for each 20 ms and 40 ms, respectively, compared to the conventional method which does not use interframe prediction.

AB - This paper proposes a new method for vector quantization (VQ) of LSP parameters, using moving average (MA) interframe predictions. MA predictive VQ executes the prediction using the code vectors (codebook outputs) in the previous frames. Moreover, this method has the following features, compared to autoregressive (AR) predictive coding, which executes the prediction based on the previous quantized (decoded) values: (1) even if a bit error is produced in the transmission channel, its effect to the succeeding frames remains finite; and (2) the stored codes can be decoded from any time point. This paper discusses the quantization performance and the robustness against bit errors of the MA predictive VQ, as well as the training method for the codebook. Assuming an application to low‐bit‐rate speech coding, the configuration considered has a frame length of 40 ms, with each frame composed of four subframes. A method for further improvement of the efficiency is reported. As a result of evaluation using actual speech data, bit reductions of approximately 16 percent and 23 percent are achieved for the quantization for each 20 ms and 40 ms, respectively, compared to the conventional method which does not use interframe prediction.

KW - bit error

KW - interframe prediction

KW - LSP

KW - moving average

KW - vector quantization

UR - http://www.scopus.com/inward/record.url?scp=0003784212&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0003784212&partnerID=8YFLogxK

U2 - 10.1002/ecjc.4430771002

DO - 10.1002/ecjc.4430771002

M3 - Article

AN - SCOPUS:0003784212

VL - 77

SP - 12

EP - 26

JO - Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)

JF - Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)

SN - 1042-0967

IS - 10

ER -