Tài liệu tham khảo |
Loại |
Chi tiết |
[11] Blauth, D. A., V. P. Minotto, C. R. Jung, B. Lee, and T. Kalker (2012b).Voice activity detection and speaker localization using audiovisual cues. Pat- tern Recognition Letters 33 (4), 373 – 380. Intelligent Multimedia Interactivity |
Sách, tạp chí |
Tiêu đề: |
Voice activity detection and speaker localization using audiovisual cues |
Tác giả: |
D. A. Blauth, V. P. Minotto, C. R. Jung, B. Lee, T. Kalker |
Nhà XB: |
Pattern Recognition Letters |
Năm: |
2012 |
|
[12] Boashash, B. (2015). Time-Frequency Signal Analysis with Applications.UK: Academic Press |
Sách, tạp chí |
Tiêu đề: |
Time-Frequency Signal Analysis with Applications |
Tác giả: |
Boashash, B |
Nhà XB: |
Academic Press |
Năm: |
2015 |
|
[15] Chen, Y., M. Ding, and J. A. S. Kelso (1997, December). Long Memory Processes ( 1/f a Type) in Human Coordination. Physical Review Letters 79, 4501–4504 |
Sách, tạp chí |
Tiêu đề: |
Long Memory Processes ( 1/f a Type) in Human Coordination |
Tác giả: |
Y. Chen, M. Ding, J. A. S. Kelso |
Nhà XB: |
Physical Review Letters |
Năm: |
1997 |
|
[16] Cho, Y. D., K. Al-Naimi, and A. Kondoz (2001, Apr). Mixed decision-based noise adaptation for speech enhancement. Electronics Letters 37 (8), 540–542 |
Sách, tạp chí |
Tiêu đề: |
Mixed decision-based noise adaptation for speech enhancement |
Tác giả: |
Y. D. Cho, K. Al-Naimi, A. Kondoz |
Nhà XB: |
Electronics Letters |
Năm: |
2001 |
|
[20] Davis, A., S. Nordholm, and R. Togneri (2006). Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold.IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 412– |
Sách, tạp chí |
Tiêu đề: |
Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold |
Tác giả: |
A. Davis, S. Nordholm, R. Togneri |
Nhà XB: |
IEEE Transactions on Audio, Speech, and Language Processing |
Năm: |
2006 |
|
[22] Dimitriadis, D., P. Maragos, and A. Potamianos (2002, May). Modulation features for speech recognition. In Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, Volume 1, pp. I–377–I–380 |
Sách, tạp chí |
Tiêu đề: |
Modulation features for speech recognition |
Tác giả: |
D. Dimitriadis, P. Maragos, A. Potamianos |
Nhà XB: |
IEEE International Conference on Acoustics, Speech, and Signal Processing |
Năm: |
2002 |
|
[23] Dov, D., R. Talmon, and I. Cohen (2015, April). Audio-visual voice activity detection using diffusion maps. IEEE/ACM Trans. Audio, Speech and Lang.Proc. 23 (4), 732–745 |
Sách, tạp chí |
Tiêu đề: |
Audio-visual voice activity detection using diffusion maps |
Tác giả: |
Dov, D., Talmon, R., Cohen, I |
Nhà XB: |
IEEE/ACM Trans. Audio, Speech and Lang.Proc. |
Năm: |
2015 |
|
[25] Eroglu, D., T. K. D. Peron, N. Marwan, F. A. Rodrigues, L. d. F. Costa, M. Sebek, I. Z. Kiss, and J. Kurths (2014, Oct). Entropy of weighted recurrence plots. Phys. Rev. E 90, 042919 |
Sách, tạp chí |
Tiêu đề: |
Entropy of weighted recurrence plots |
Tác giả: |
D. Eroglu, T. K. D. Peron, N. Marwan, F. A. Rodrigues, L. d. F. Costa, M. Sebek, I. Z. Kiss, J. Kurths |
Nhà XB: |
Phys. Rev. E |
Năm: |
2014 |
|
[26] Farmer, J. D. and J. J. Sidorowichl (2013). Exploiting Chaos to Predict the Future and Reduce Noise, pp. 277–330. World Scientific |
Sách, tạp chí |
Tiêu đề: |
Exploiting Chaos to Predict the Future and Reduce Noise |
Tác giả: |
J. D. Farmer, J. J. Sidorowichl |
Nhà XB: |
World Scientific |
Năm: |
2013 |
|
[27] Fraser, A. M. and H. L. Swinney (1986, Feb). Independent coordinates for strange attractors from mutual information. Phys. Rev. A 33, 1134–1140 |
Sách, tạp chí |
Tiêu đề: |
Independent coordinates for strange attractors from mutual information |
Tác giả: |
A. M. Fraser, H. L. Swinney |
Nhà XB: |
Phys. Rev. A |
Năm: |
1986 |
|
[28] Freeman, D., G. Cosier, C. Southcott, and I. Boyd (1989). The voice ac- tivity detector for the pan-european digital cellular mobile telephone service.In Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 Inter- national Conference on, pp. 369–372. IEEE |
Sách, tạp chí |
Tiêu đề: |
The voice activity detector for the pan-european digital cellular mobile telephone service |
Tác giả: |
D. Freeman, G. Cosier, C. Southcott, I. Boyd |
Nhà XB: |
IEEE |
Năm: |
1989 |
|
[29] Gao, X., H. Cao, J. Zhang, J. Bai1, T. Zhang, and L. Jia (2013). A real-time dsp-based system for voice activity detection: Design and implement. Inter- national Journal of Signal Processing, Image Processing and Pattern Recogni- tion 6 (6), 27 – 40 |
Sách, tạp chí |
Tiêu đề: |
A real-time dsp-based system for voice activity detection: Design and implement |
Tác giả: |
X. Gao, H. Cao, J. Zhang, J. Bai1, T. Zhang, L. Jia |
Nhà XB: |
International Journal of Signal Processing, Image Processing and Pattern Recognition |
Năm: |
2013 |
|
[30] Garza, V. R. (1997). Product reviews: Continuous speech-recognition soft- ware: Naturallyspeaking edges out viavoice with hands-free editing. In In- foWorld, pp. 116 |
Sách, tạp chí |
Tiêu đề: |
Product reviews: Continuous speech-recognition software: Naturallyspeaking edges out viavoice with hands-free editing |
Tác giả: |
Garza, V. R |
Nhà XB: |
In- foWorld |
Năm: |
1997 |
|
[31] Gazor, S. and W. Zhang (2003a, Sept). A soft voice activity detector based on a laplacian-gaussian model. IEEE Transactions on Speech and Audio Pro- cessing 11 (5), 498–505 |
Sách, tạp chí |
Tiêu đề: |
A soft voice activity detector based on a laplacian-gaussian model |
Tác giả: |
S. Gazor, W. Zhang |
Nhà XB: |
IEEE Transactions on Speech and Audio Processing |
Năm: |
2003 |
|
[33] Gold, B. and N. Morgan (1999). Speech and Audio Signal Processing: Pro- cessing and Perception of Speech and Music (1st ed.). New York, NY, USA:John Wiley & Sons, Inc |
Sách, tạp chí |
Tiêu đề: |
Speech and Audio Signal Processing: Processing and Perception of Speech and Music |
Tác giả: |
B. Gold, N. Morgan |
Nhà XB: |
John Wiley & Sons, Inc |
Năm: |
1999 |
|
[34] Haigh, J. A. and J. S. Mason (1993, Oct). Robust voice activity detection using cepstral features. In TENCON ’93. Proceedings. Computer, Communi- cation, Control and Power Engineering.1993 IEEE Region 10 Conference on, Volume 3, pp. 321–324 vol.3 |
Sách, tạp chí |
Tiêu đề: |
Robust voice activity detection using cepstral features |
Tác giả: |
Haigh, J. A., Mason, J. S |
Nhà XB: |
1993 IEEE Region 10 Conference on |
Năm: |
1993 |
|
[36] Hamila, R., M. Renfors, M. Gabbouj, and J. Astola (1997). Time-frequency signal analysis using teager energy. In Proc. Fourth International Conference on Electronics, Circuits and Systems, (Cairo, Egypt), pp. 911–914, December 1997 |
Sách, tạp chí |
Tiêu đề: |
Time-frequency signal analysis using teager energy |
Tác giả: |
R. Hamila, M. Renfors, M. Gabbouj, J. Astola |
Nhà XB: |
Proc. Fourth International Conference on Electronics, Circuits and Systems |
Năm: |
1997 |
|
[37] Haykin, S. (2001). Adaptive Filter Theory (4th ed.). New York, NY, USA:Prentice Hall |
Sách, tạp chí |
Tiêu đề: |
Adaptive Filter Theory |
Tác giả: |
Haykin, S |
Nhà XB: |
Prentice Hall |
Năm: |
2001 |
|
[38] Hermansky, H. (1990). Perceptual linear predictive (plp) analysis of speech.The Journal of the Acoustical Society of America 87 (4), 1738–1752 |
Sách, tạp chí |
Tiêu đề: |
Perceptual linear predictive (plp) analysis of speech |
Tác giả: |
Hermansky, H |
Nhà XB: |
The Journal of the Acoustical Society of America |
Năm: |
1990 |
|
[39] Hermansky, H., N. Morgan, and H. G. Hirsch (1993, April). Recognition of speech in additive and convolutional noise based on rasta spectral processing.In Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on, Volume 2, pp. 83–86 vol.2 |
Sách, tạp chí |
Tiêu đề: |
Recognition of speech in additive and convolutional noise based on rasta spectral processing |
Tác giả: |
H. Hermansky, N. Morgan, H. G. Hirsch |
Nhà XB: |
ICASSP-93, 1993 IEEE International Conference on |
Năm: |
1993 |
|