世界杯加纳

声学模型概述

自动语音识别中的声学模型概述收藏这部分内容有帮助吗?

有帮助报告问题标记为完成参考文献

Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Pearson) - 全面教材,详细介绍了语音识别,涵盖声学建模原理、特征提取和整个ASR流程。Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural Networks, Alex Graves, Santiago Fernández, Faustino Gomez, and Jürgen Schmidhuber, 2006 Proceedings of the 23rd International Conference on Machine Learning (ICML) (Association for Computing Machinery) DOI: 10.1145/1143844.1143891 - 这篇基础论文介绍了连接时序分类(CTC),这是一种关键算法,用于训练循环神经网络执行声学建模等序列到序列任务,而无需对输入进行显式预分割。Deep Neural Networks for Acoustic Modeling in Speech Recognition, Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, and Andrew Senior, 2012 IEEE Signal Processing Magazine, Vol. 29 (IEEE) DOI: 10.1109/MSP.2012.2205597 - 一篇开创性的综述和教程,确立了深度神经网络(DNNs)作为语音识别中声学建模的主要方法,并讨论了其架构和训练。© 2025 ApX Machine Learning