Details of the submission

Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh, ,,Noise and acoustic modeling with waveform generator in text-to-speech and neutral speech conversion'', submitted.

Samples of the listening test #1

sentenceNaturalAnchorHMM
(Continuous vocoder)
DNN
(WORLD vocoder)
DNN
(Continuous vocoder)
DNN
(Continuous vocoder
+ envelope)
SLT / 001
SLT / 002
SLT / 003
SLT / 004
SLT / 005
SLT / 006
SLT / 007
SLT / 008
SLT / 009
SLT / 010
SLT / 011
SLT / 012
SLT / 013
SLT / 014
SLT / 015

Samples of the listening test #2

sentenceNaturalAnchorDNNBLSTMHybrid
AWB / 01
AWB / 02
AWB / 03
AWB / 04
AWB / 05
AWB / 06
AWB / 07
AWB / 08
AWB / 09
AWB / 10
SLT / 01
SLT / 02
SLT / 03
SLT / 04
SLT / 05
SLT / 06
SLT / 07
SLT / 08
SLT / 09
SLT / 10

Samples of the listening test #3 (VC)

sentenceTargetSourceVoice Conversion
(Sprocket)
Voice Conversion
(MagPhase)
Voice Conversion
(WORLD)
Voice Conversion
(CSM - Proposed)
BDL-to-CLB
BDL-to-JMK
BDL-to-SLT
CLB-to-BDL
CLB-to-JMK
CLB-to-SLT
JMK-to-BDL
JMK-to-CLB
JMK-to-SLT
SLT-to-BDL
SLT-to-CLB
SLT-to-JMK