New 2-kbit/s speech coder based on normalized pitch waveform

Yuusuke Hiwasaki, Kazunori Mano

研究成果: Conference contribution

2 引用 (Scopus)

抄録

Speech coding at very low bitrate is useful for purposes such as voice communication over computer networks. However, speech coding at around 2.0 kbit/s is difficult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform' and its quantization scheme are presented, aiming for effective compression coding of the `voiced' speech. Listening tests has proven that an efficient and high quality coding has been achieved at bitrate 2.0 kbit/s, less than half of the FS1016. Furthermore, this paper discusses the disadvantage of the normalized pitch waveform and presents an alternative method of using non-normalized pitch waveform. Encoding of a transitional `mixed' state between the `voiced' and the `unvoiced' state is discussed for further improvements.

元の言語English
ホスト出版物のタイトルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
編集者 Anon
出版者IEEE
ページ1583-1586
ページ数4
2
出版物ステータスPublished - 1997
外部発表Yes
イベントProceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. Part 1 (of 5) - Munich, Ger
継続期間: 1997 4 211997 4 24

Other

OtherProceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. Part 1 (of 5)
Munich, Ger
期間97/4/2197/4/24

Fingerprint

Speech coding
coders
waveforms
coding
Speech communication
Computer networks
voice communication
computer networks

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics

これを引用

Hiwasaki, Y., & Mano, K. (1997). New 2-kbit/s speech coder based on normalized pitch waveform. : Anon (版), ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (巻 2, pp. 1583-1586). IEEE.

New 2-kbit/s speech coder based on normalized pitch waveform. / Hiwasaki, Yuusuke; Mano, Kazunori.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 版 / Anon. 巻 2 IEEE, 1997. p. 1583-1586.

研究成果: Conference contribution

Hiwasaki, Y & Mano, K 1997, New 2-kbit/s speech coder based on normalized pitch waveform. : Anon (版), ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 巻. 2, IEEE, pp. 1583-1586, Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. Part 1 (of 5), Munich, Ger, 97/4/21.
Hiwasaki Y, Mano K. New 2-kbit/s speech coder based on normalized pitch waveform. : Anon, 編集者, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 巻 2. IEEE. 1997. p. 1583-1586
Hiwasaki, Yuusuke ; Mano, Kazunori. / New 2-kbit/s speech coder based on normalized pitch waveform. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 編集者 / Anon. 巻 2 IEEE, 1997. pp. 1583-1586
@inproceedings{6b38b63622074b5ca499e0f60ca1da73,
title = "New 2-kbit/s speech coder based on normalized pitch waveform",
abstract = "Speech coding at very low bitrate is useful for purposes such as voice communication over computer networks. However, speech coding at around 2.0 kbit/s is difficult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform' and its quantization scheme are presented, aiming for effective compression coding of the `voiced' speech. Listening tests has proven that an efficient and high quality coding has been achieved at bitrate 2.0 kbit/s, less than half of the FS1016. Furthermore, this paper discusses the disadvantage of the normalized pitch waveform and presents an alternative method of using non-normalized pitch waveform. Encoding of a transitional `mixed' state between the `voiced' and the `unvoiced' state is discussed for further improvements.",
author = "Yuusuke Hiwasaki and Kazunori Mano",
year = "1997",
language = "English",
volume = "2",
pages = "1583--1586",
editor = "Anon",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
publisher = "IEEE",

}

TY - GEN

T1 - New 2-kbit/s speech coder based on normalized pitch waveform

AU - Hiwasaki, Yuusuke

AU - Mano, Kazunori

PY - 1997

Y1 - 1997

N2 - Speech coding at very low bitrate is useful for purposes such as voice communication over computer networks. However, speech coding at around 2.0 kbit/s is difficult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform' and its quantization scheme are presented, aiming for effective compression coding of the `voiced' speech. Listening tests has proven that an efficient and high quality coding has been achieved at bitrate 2.0 kbit/s, less than half of the FS1016. Furthermore, this paper discusses the disadvantage of the normalized pitch waveform and presents an alternative method of using non-normalized pitch waveform. Encoding of a transitional `mixed' state between the `voiced' and the `unvoiced' state is discussed for further improvements.

AB - Speech coding at very low bitrate is useful for purposes such as voice communication over computer networks. However, speech coding at around 2.0 kbit/s is difficult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform' and its quantization scheme are presented, aiming for effective compression coding of the `voiced' speech. Listening tests has proven that an efficient and high quality coding has been achieved at bitrate 2.0 kbit/s, less than half of the FS1016. Furthermore, this paper discusses the disadvantage of the normalized pitch waveform and presents an alternative method of using non-normalized pitch waveform. Encoding of a transitional `mixed' state between the `voiced' and the `unvoiced' state is discussed for further improvements.

UR - http://www.scopus.com/inward/record.url?scp=0030643566&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030643566&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0030643566

VL - 2

SP - 1583

EP - 1586

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

A2 - Anon, null

PB - IEEE

ER -