語音處理實驗室 (speech processing laboratory)
近年學術著作:
期刊論文
[1]
C. Yu, K-H. Hung, S-S. Wang,
Y. Tsao and J-W. Hung, “Time-domain multi-modal bone/air conducted speech
enhancement,” IEEE Signal Processing Letters, 2020.
[2]
S-S. Wang, P. Lin, Y. Tsao,
J-W. Hung, B. Su, “Suppression by selecting wavelets
for feature compression in distributed speech recognition,” IEEE/ACM Trans. on Audio, Speech, and
Language Processing, March 2018 (SCI)
[3]
J-W. Hung, J-S. Lin and P-J.
Wu, “Employing robust principal component analysis for noise-robust speech
feature extraction in automatic speech recognition with the structure of a deep
neural network,” Applied System
Innovation, Aug 2018
[4]
S-K. Lee, J-W. Hung, “An
evaluation study of using various SNR-level training data in the denoising
autoencoder (DAE) technique for speech enhancement,” International Journal of Electrical, Electronics and Data Communication,
Apr 2018
[5]
S-S. Wang, A. Chern, Y. Tsao, J-W. Hung, X. Lu, Y-H. Lai and B. Su, “Wavelet speech enhancement based on nonnegative matrix
factorization,” IEEE Signal Processing Letters, May 2016 (SCI)
[6]
J-W. Hung, H-J. Hsieh and B.
Chen, “Robust speech recognition via enhancing the complex-valued acoustic
spectrum in modulation Domain,” IEEE/ACM Trans. on Audio,
Speech, and Language Processing, Feb 2016 (SCI)
[7]
Y-D. Wang, J-H. Jheng, H-J. Hsieh and J-W. Hung, “An evaluation study of
speaker and noise adaptation for nonnegative matrix factorization
based speech enhancement,” International Journal of
Electrical, Electronics and Data Communication, Nov 2015
[8]
H-J. Hsieh, H-T. Fan and
J-W. Hung, “Leveraging jointly spatial, temporal and modulation enhancement in
creating noise-robust features for speech recognition,” International Journal of Electrical, Electronics and Data
Communication, Nov 2015
國際會議論文
[1]
Y-J. Lu, C-F. Liao, X. Lu,
J-W. Hung, Y. Tsao, “Incorporating Broad Phonetic Information for Speech
Enhancement”, Interspeech 2020
[2]
C-L. Lin, Z-Q. Lin, S-S.
Wang, Y. Tsao and J-W. Hung, “Exponentiated magnitude spectrogram-based
relative-to-maximum masking for speech enhancement in adverse environments,”
IEEE International Conference on Consumer Electronics –Taiwan, 2020
[3]
Z-Q. Lin, C-L. Lin and J-W.
Hung, “Lowpass-filtered relative-to-maximum masking for speech enhancement in
noise-corrupted environments,” IEEE International Conference on Consumer
Electronics –Taiwan, 2020
[4]
S-K. Lee, S-S. Wang, Y.
Tsao, J-W. Hung, “Speech enhancement based on reducing the detail portion of
speech spectrograms in modulation domain via discrete wavelet transform,” in Proc. ISCSLP, 2018
[5]
J-W. Hung, J-S. Lin and P-J.
Wu, “Employing robust principal component analysis for noise-robust speech
feature extraction in automatic speech recognition with the structure of deep
neural network,” in Proc. ICASI, 2018
[6]
J-W. Hung, J-S. Lin, L-M
Lee, S-Yu Wang, “A study of integrating noise-robustness feature extraction
techniques with the reduced frame-rate acoustic models in mobile-device speech
recognition,” in Proc. AROB, 2018
[7]
C-L. Wu, H-P. Hsu, S-S.
Wang, J-W. Hung, Y-H. Lai, H-M. Wang, Y. Tsao, “Wavelet speech enhancement
based on robust principal component analysis,” in Proc. Interspeech, 2017
[8]
J-W. Hung and J-S. Lin,
“Enhancing the acoustic spectrogram in modulation domain via sparse nonnegative
matrix factorization for speech enhancement,” in Proc. AROB, 2017
[9]
J. C. Yang, S-S. Wang, Y.
Tsao and J-W. Hung, “Speech enhancement via ensemble modeling NMF adaptation,” in Proc. ICCE-TW, 2016
[10] H-J. Hsieh, J-H. Jheng, J-S.
Lin and J-W. Hung, “Linear prediction filtering on cepstral time series for
noise-robust speech recognition,” in Proc. ICCE-TW, 2016
[11] S-S. Wang, J. C. Yang; Y. Tsao and J-W. Hung, “Leveraging
nonnegative matrix factorization in processing the temporal modulation spectrum
for speech enhancement,” in Proc. ICCE-TW, 2016
[12] J-W. Hung and J-S. Lin, “A study of the noise-robustness
algorithms on various types of cepstral feature representation for real-world
speech recognition,” in Proc. AROB, 2016