Pitch synchronous innovation code excited linear prediction (PSI-CELP)

Satoshi Miki, Takehiro Moriya, Kazunori Mano, Hitoshi Ohmuro

Research output: Contribution to journalArticle

Abstract

This paper proposes a new speech coding method pitch synchronous innovation code excited linear predictor (PSI-CELP). This method is based on CELP but adds pitch synchronous innovation. This results in even random codevectors being adaptively converted to have pitch periodicity for voiced frames. This scheme can improve the synthesized speech quality of voiced frames in the low bit-rate CELP without increasing either computational complexity or bit rate. In addition to pitch synchronous innovation, this paper also proposes four other methods in which the quality of synthesized speech is effectively improved or the computational complexity is reduced for low bit-rate CELP: (1) the perceptual weighting filter is an MA-type and uses nonquantized LPC parameters; (2) the fixed codebook is prepared as part of the adaptive codebook for nonperiodic frames; (3) the random codebook consists of two-channel conjugate sub-codebooks; and (4) codebook searches are made using delayed decision. In a speech coding system with pitch synchronization of random codevectors, improvements in ratio SNR, segmental SNR, and opinion equivalent Q value are respectively 1.6 dB, 1.0 dB, and 0.5 dB. These basic methods were adopted in the standard codec for Japanese half-rate digital mobile telephone service.

Original languageEnglish
Pages (from-to)36-48
Number of pages13
JournalElectronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)
Volume77
Issue number12
Publication statusPublished - 1994 Dec
Externally publishedYes

Fingerprint

Speech coding
Innovation
Computational complexity
Telephone
Synchronization

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

@article{6dc90c8d1c3141f59e045f4a30569e99,
title = "Pitch synchronous innovation code excited linear prediction (PSI-CELP)",
abstract = "This paper proposes a new speech coding method pitch synchronous innovation code excited linear predictor (PSI-CELP). This method is based on CELP but adds pitch synchronous innovation. This results in even random codevectors being adaptively converted to have pitch periodicity for voiced frames. This scheme can improve the synthesized speech quality of voiced frames in the low bit-rate CELP without increasing either computational complexity or bit rate. In addition to pitch synchronous innovation, this paper also proposes four other methods in which the quality of synthesized speech is effectively improved or the computational complexity is reduced for low bit-rate CELP: (1) the perceptual weighting filter is an MA-type and uses nonquantized LPC parameters; (2) the fixed codebook is prepared as part of the adaptive codebook for nonperiodic frames; (3) the random codebook consists of two-channel conjugate sub-codebooks; and (4) codebook searches are made using delayed decision. In a speech coding system with pitch synchronization of random codevectors, improvements in ratio SNR, segmental SNR, and opinion equivalent Q value are respectively 1.6 dB, 1.0 dB, and 0.5 dB. These basic methods were adopted in the standard codec for Japanese half-rate digital mobile telephone service.",
author = "Satoshi Miki and Takehiro Moriya and Kazunori Mano and Hitoshi Ohmuro",
year = "1994",
month = "12",
language = "English",
volume = "77",
pages = "36--48",
journal = "Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)",
issn = "1042-0967",
publisher = "John Wiley and Sons Inc.",
number = "12",

}

TY - JOUR

T1 - Pitch synchronous innovation code excited linear prediction (PSI-CELP)

AU - Miki, Satoshi

AU - Moriya, Takehiro

AU - Mano, Kazunori

AU - Ohmuro, Hitoshi

PY - 1994/12

Y1 - 1994/12

N2 - This paper proposes a new speech coding method pitch synchronous innovation code excited linear predictor (PSI-CELP). This method is based on CELP but adds pitch synchronous innovation. This results in even random codevectors being adaptively converted to have pitch periodicity for voiced frames. This scheme can improve the synthesized speech quality of voiced frames in the low bit-rate CELP without increasing either computational complexity or bit rate. In addition to pitch synchronous innovation, this paper also proposes four other methods in which the quality of synthesized speech is effectively improved or the computational complexity is reduced for low bit-rate CELP: (1) the perceptual weighting filter is an MA-type and uses nonquantized LPC parameters; (2) the fixed codebook is prepared as part of the adaptive codebook for nonperiodic frames; (3) the random codebook consists of two-channel conjugate sub-codebooks; and (4) codebook searches are made using delayed decision. In a speech coding system with pitch synchronization of random codevectors, improvements in ratio SNR, segmental SNR, and opinion equivalent Q value are respectively 1.6 dB, 1.0 dB, and 0.5 dB. These basic methods were adopted in the standard codec for Japanese half-rate digital mobile telephone service.

AB - This paper proposes a new speech coding method pitch synchronous innovation code excited linear predictor (PSI-CELP). This method is based on CELP but adds pitch synchronous innovation. This results in even random codevectors being adaptively converted to have pitch periodicity for voiced frames. This scheme can improve the synthesized speech quality of voiced frames in the low bit-rate CELP without increasing either computational complexity or bit rate. In addition to pitch synchronous innovation, this paper also proposes four other methods in which the quality of synthesized speech is effectively improved or the computational complexity is reduced for low bit-rate CELP: (1) the perceptual weighting filter is an MA-type and uses nonquantized LPC parameters; (2) the fixed codebook is prepared as part of the adaptive codebook for nonperiodic frames; (3) the random codebook consists of two-channel conjugate sub-codebooks; and (4) codebook searches are made using delayed decision. In a speech coding system with pitch synchronization of random codevectors, improvements in ratio SNR, segmental SNR, and opinion equivalent Q value are respectively 1.6 dB, 1.0 dB, and 0.5 dB. These basic methods were adopted in the standard codec for Japanese half-rate digital mobile telephone service.

UR - http://www.scopus.com/inward/record.url?scp=0028754881&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0028754881&partnerID=8YFLogxK

M3 - Article

VL - 77

SP - 36

EP - 48

JO - Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)

JF - Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi)

SN - 1042-0967

IS - 12

ER -