95年度國科會工程處專題計畫主持人近五年成果績效自評表

語音處理實驗室 (speech processing laboratory)

洪志偉 (Jeih-weih Hung) 教授

近年學術著作：

期刊論文

[1] C. Yu, K-H. Hung, S-S. Wang, Y. Tsao and J-W. Hung, “Time-domain multi-modal bone/air conducted speech enhancement,” IEEE Signal Processing Letters, 2020.

[2] S-S. Wang, P. Lin, Y. Tsao, J-W. Hung, B. Su, “Suppression by selecting wavelets for feature compression in distributed speech recognition,” IEEE/ACM Trans. on Audio, Speech, and Language Processing, March 2018 (SCI)

[3] J-W. Hung, J-S. Lin and P-J. Wu, “Employing robust principal component analysis for noise-robust speech feature extraction in automatic speech recognition with the structure of a deep neural network,” Applied System Innovation, Aug 2018

[4] S-K. Lee, J-W. Hung, “An evaluation study of using various SNR-level training data in the denoising autoencoder (DAE) technique for speech enhancement,” International Journal of Electrical, Electronics and Data Communication, Apr 2018

[5] S-S. Wang, A. Chern, Y. Tsao, J-W. Hung, X. Lu, Y-H. Lai and B. Su, “Wavelet speech enhancement based on nonnegative matrix factorization,” IEEE Signal Processing Letters, May 2016 (SCI)

[6] J-W. Hung, H-J. Hsieh and B. Chen, “Robust speech recognition via enhancing the complex-valued acoustic spectrum in modulation Domain,” IEEE/ACM Trans. on Audio, Speech, and Language Processing, Feb 2016 (SCI)

[7] Y-D. Wang, J-H. Jheng, H-J. Hsieh and J-W. Hung, “An evaluation study of speaker and noise adaptation for nonnegative matrix factorization based speech enhancement,” International Journal of Electrical, Electronics and Data Communication, Nov 2015

[8] H-J. Hsieh, H-T. Fan and J-W. Hung, “Leveraging jointly spatial, temporal and modulation enhancement in creating noise-robust features for speech recognition,” International Journal of Electrical, Electronics and Data Communication, Nov 2015

國際會議論文

[1] Y-J. Lu, C-F. Liao, X. Lu, J-W. Hung, Y. Tsao, “Incorporating Broad Phonetic Information for Speech Enhancement”, Interspeech 2020

[2] C-L. Lin, Z-Q. Lin, S-S. Wang, Y. Tsao and J-W. Hung, “Exponentiated magnitude spectrogram-based relative-to-maximum masking for speech enhancement in adverse environments,” IEEE International Conference on Consumer Electronics –Taiwan, 2020

[3] Z-Q. Lin, C-L. Lin and J-W. Hung, “Lowpass-filtered relative-to-maximum masking for speech enhancement in noise-corrupted environments,” IEEE International Conference on Consumer Electronics –Taiwan, 2020

[4] S-K. Lee, S-S. Wang, Y. Tsao, J-W. Hung, “Speech enhancement based on reducing the detail portion of speech spectrograms in modulation domain via discrete wavelet transform,” in Proc. ISCSLP, 2018

[5] J-W. Hung, J-S. Lin and P-J. Wu, “Employing robust principal component analysis for noise-robust speech feature extraction in automatic speech recognition with the structure of deep neural network,” in Proc. ICASI, 2018

[6] J-W. Hung, J-S. Lin, L-M Lee, S-Yu Wang, “A study of integrating noise-robustness feature extraction techniques with the reduced frame-rate acoustic models in mobile-device speech recognition,” in Proc. AROB, 2018

[7] C-L. Wu, H-P. Hsu, S-S. Wang, J-W. Hung, Y-H. Lai, H-M. Wang, Y. Tsao, “Wavelet speech enhancement based on robust principal component analysis,” in Proc. Interspeech, 2017

[8] J-W. Hung and J-S. Lin, “Enhancing the acoustic spectrogram in modulation domain via sparse nonnegative matrix factorization for speech enhancement,” in Proc. AROB, 2017

[9] J. C. Yang, S-S. Wang, Y. Tsao and J-W. Hung, “Speech enhancement via ensemble modeling NMF adaptation,” in Proc. ICCE-TW, 2016

[10] H-J. Hsieh, J-H. Jheng, J-S. Lin and J-W. Hung, “Linear prediction filtering on cepstral time series for noise-robust speech recognition,” in Proc. ICCE-TW, 2016

[11] S-S. Wang, J. C. Yang; Y. Tsao and J-W. Hung, “Leveraging nonnegative matrix factorization in processing the temporal modulation spectrum for speech enhancement,” in Proc. ICCE-TW, 2016

[12] J-W. Hung and J-S. Lin, “A study of the noise-robustness algorithms on various types of cepstral feature representation for real-world speech recognition,” in Proc. AROB, 2016