Tài liệu tham khảo |
Loại |
Chi tiết |
[3] William Chan, Navdeep Jaitly, Quoc Le, and Oriol Vinyals.Listen, attend and spell: A neural network for large vocabu- lary conversational speech recognition. In Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on, pages 4960–4964. IEEE, 2016 |
Sách, tạp chí |
Tiêu đề: |
Listen, attend and spell: A neural network for large vocabulary conversational speech recognition |
Tác giả: |
William Chan, Navdeep Jaitly, Quoc Le, Oriol Vinyals |
Nhà XB: |
IEEE |
Năm: |
2016 |
|
[4] Rohit Prabhavalkar, Kanishka Rao, Tara N. Sainath, Bo Li, Leif Johnson, and Navdeep Jaitly. A Comparison of sequence-to-sequence models for speech recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, vol- ume 2017-Augus, pages 939–943, 2017 |
Sách, tạp chí |
Tiêu đề: |
A Comparison of sequence-to-sequence models for speech recognition |
Tác giả: |
Rohit Prabhavalkar, Kanishka Rao, Tara N. Sainath, Bo Li, Leif Johnson, Navdeep Jaitly |
Nhà XB: |
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Năm: |
2017 |
|
[5] Hiroaki Sakoe and Seibi Chiba. Readings in speech recogni- tion. chapter Dynamic Programming Algorithm Optimiza- tion for Spoken Word Recognition, pages 159–165. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1990 |
Sách, tạp chí |
Tiêu đề: |
Readings in speech recognition |
Tác giả: |
Hiroaki Sakoe, Seibi Chiba |
Nhà XB: |
Morgan Kaufmann Publishers Inc. |
Năm: |
1990 |
|
[6] Xuedong Huang, James Baker, and Raj Reddy. A Histor- ical Perspective of Speech Recognition. Commun. ACM, 57(1):94–103, 2014 |
Sách, tạp chí |
Tiêu đề: |
A Historical Perspective of Speech Recognition |
Tác giả: |
Xuedong Huang, James Baker, Raj Reddy |
Nhà XB: |
Commun. ACM |
Năm: |
2014 |
|
[7] Alex Graves, Santiago Fernández, Faustino Gomez, and J¨ urgen Schmidhuber. Connectionist temporal classifica- tion. Proceedings of the 23rd international conference on Machine learning - ICML ’06, pages 369–376, 2006 |
Sách, tạp chí |
Tiêu đề: |
Connectionist temporal classification |
Tác giả: |
Alex Graves, Santiago Fernández, Faustino Gomez, Jürgen Schmidhuber |
Nhà XB: |
Proceedings of the 23rd international conference on Machine learning - ICML '06 |
Năm: |
2006 |
|
[8] Theodore Bluche, Hermann Ney, Jerome Louradour, and Christopher Kermorvant. Framewise and ctc training of neural networks for handwriting recognition. In Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR), ICDAR ’15, pages 81– |
Sách, tạp chí |
Tiêu đề: |
Framewise and ctc training of neural networks for handwriting recognition |
Tác giả: |
Theodore Bluche, Hermann Ney, Jerome Louradour, Christopher Kermorvant |
Nhà XB: |
Proceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR) |
Năm: |
2015 |
|
[9] Awni Y. Hannun, Carl Case, Jared Casper, Bryan Catan- zaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, and Andrew Y.Ng. Deep speech: Scaling up end-to-end speech recognition.CoRR, abs/1412.5567, 2014 |
Sách, tạp chí |
Tiêu đề: |
Deep speech: Scaling up end-to-end speech recognition |
Tác giả: |
Awni Y. Hannun, Carl Case, Jared Casper, Bryan Catan-zaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y.Ng |
Nhà XB: |
CoRR |
Năm: |
2014 |
|
[10] Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, KyungHyun Cho, and Yoshua Bengio. Attention-based models for speech recognition. CoRR, abs/1506.07503, 2015 |
Sách, tạp chí |
Tiêu đề: |
Attention-based models for speech recognition |
Tác giả: |
Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, KyungHyun Cho, Yoshua Bengio |
Nhà XB: |
CoRR |
Năm: |
2015 |
|
[11] Albert Zeyer, Kazuki Irie, Ralf Schl¨ uter, and Hermann Ney.Improved training of end-to-end attention models for speech recognition. CoRR, abs/1805.03294, 2018 |
Sách, tạp chí |
Tiêu đề: |
Improved training of end-to-end attention models for speech recognition |
Tác giả: |
Albert Zeyer, Kazuki Irie, Ralf Schlüter, Hermann Ney |
Nhà XB: |
CoRR |
Năm: |
2018 |
|
[13] Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, and Zhenyao Zhu. Explor- ing neural transducers for end-to-end speech recognition.CoRR, abs/1707.07413, 2017 |
Sách, tạp chí |
Tiêu đề: |
Exploring neural transducers for end-to-end speech recognition |
Tác giả: |
Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu |
Nhà XB: |
CoRR |
Năm: |
2017 |
|
[16] Yann LeCun, Léon Bottou, Genevieve B. Orr, and Klaus- Robert M¨ uller. Efficient backprop. In Neural Networks:Tricks of the Trade - Second Edition, pages 9–48. 2012 |
Sách, tạp chí |
Tiêu đề: |
Neural Networks: Tricks of the Trade - Second Edition |
Tác giả: |
Yann LeCun, Léon Bottou, Genevieve B. Orr, Klaus-Robert Müller |
Năm: |
2012 |
|
[17] Luong Chi Mai and Dang Ngoc Duc. Design of vietnamese speech corpus and current status. In Proceedings of the In- ternational Symposium on Chinese Spoken Language Pro- cessing (ISCSLP), volume 6, pages 748–758, 2006 |
Sách, tạp chí |
Tiêu đề: |
Design of vietnamese speech corpus and current status |
Tác giả: |
Luong Chi Mai, Dang Ngoc Duc |
Nhà XB: |
Proceedings of the International Symposium on Chinese Spoken Language Processing (ISCSLP) |
Năm: |
2006 |
|
[20] Hugo Van Hamme and Filip Van Aelten. An adaptive-beam pruning technique for continuous speech recognition. In Fourth International Conference on Spoken Language Pro- cessing, 1996 |
Sách, tạp chí |
Tiêu đề: |
An adaptive-beam pruning technique for continuous speech recognition |
Tác giả: |
Hugo Van Hamme, Filip Van Aelten |
Năm: |
1996 |
|
[22] B H Tran y H Ney V. Steinbiss. Improvements in Beam Search. Proc. of the International Conference on Spo- ken Language Processing (ICSLP), (July 2014):2140–2143, 1994 |
Sách, tạp chí |
Tiêu đề: |
Improvements in Beam Search |
Tác giả: |
B H Tran, H Ney, V. Steinbiss |
Nhà XB: |
Proc. of the International Conference on Spoken Language Processing (ICSLP) |
Năm: |
2014 |
|
[23] Stefan Ortmanns, Hermann Ney, and Andreas Eiden.Language-model look-ahead for large vocabulary speech |
Sách, tạp chí |
Tiêu đề: |
Language-model look-ahead for large vocabulary speech |
Tác giả: |
Stefan Ortmanns, Hermann Ney, Andreas Eiden |
|
[25] Kenneth Heafield. Kenlm: Faster and smaller language model queries. In Proceedings of the Sixth Workshop on Statistical Machine Translation, WMT ’11, pages 187–197, Stroudsburg, PA, USA, 2011. Association for Computa- tional Linguistics |
Sách, tạp chí |
Tiêu đề: |
Kenlm: Faster and smaller language model queries |
Tác giả: |
Kenneth Heafield |
Nhà XB: |
Association for Computational Linguistics |
Năm: |
2011 |
|
[12] Alex Graves, Abdel-rahman Mohamed, and Geoffrey E.Hinton. Speech recognition with deep recurrent neural net- works. CoRR, abs/1303.5778, 2013 |
Khác |
|
[14] Daniel Povey, Vijayaditya Peddinti, Daniel Galvez, Pegah Ghahremani, Vimal Manohar, Xingyu Na, Yiming Wang, and Sanjeev Khudanpur. Purely sequence-trained neural networks for asr based on lattice-free mmi. In Interspeech, pages 2751–2755, 2016 |
Khác |
|
[15] Abdel-rahman Mohamed, George Dahl, and Geoffrey Hin- ton. Deep belief networks for phone recognition. In Nips workshop on deep learning for speech recognition and related applications, volume 1, page 39. Vancouver, Canada, 2009 |
Khác |
|
[18] Andrew L. Maas, Awni Y. Hannun, Daniel Jurafsky, and Andrew Y. Ng. First-pass large vocabulary continu- ous speech recognition using bi-directional recurrent dnns.CoRR, abs/1408.2873, 2014 |
Khác |
|