Predictive vector quantized variational autoencoder for spectral envelope quantization

Tanasan Srikotr, Kazunori Mano

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The Predictive Vector Quantized Variational AutoEncoder is proposed to improve the reconstruction error of the conventional VQ-VAE. The proposed model can predict the current data from the previous data. The performance of the quantized spectral envelope parameters of the high-quality 48 kHz WORLD vocoder is evaluated. The results indicate that the Predictive Vector Quantized Variational AutoEncoder has a lower distortion with four target bitrates in term of log-spectral distortion, compared with the conventional VQ-VAE.

Original languageEnglish
Title of host publication2020 International Conference on Electronics, Information, and Communication, ICEIC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728162898
DOIs
Publication statusPublished - 2020 Jan
Event2020 International Conference on Electronics, Information, and Communication, ICEIC 2020 - Barcelona, Spain
Duration: 2020 Jan 192020 Jan 22

Publication series

Name2020 International Conference on Electronics, Information, and Communication, ICEIC 2020

Conference

Conference2020 International Conference on Electronics, Information, and Communication, ICEIC 2020
Country/TerritorySpain
CityBarcelona
Period20/1/1920/1/22

Keywords

  • Predictive vector quantization
  • Spectral envelope
  • Vector quantized variational autoencoder
  • WORLD vocoder

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Information Systems and Management
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Predictive vector quantized variational autoencoder for spectral envelope quantization'. Together they form a unique fingerprint.

Cite this