Advancing Speech Processing with End-to-End Modeling and LLM Integration
Room: 1302, Bldg: Sobrato Campus for Discovery and Innovation Building , Santa Clara University, 500 El Camino Real, Santa Clara, California, United States, 95053Abstract The field of speech processing is currently dominated by end-to-end (E2E) models, which utilize a single model to optimize directly towards the final objective function rather than optimizing multiple sub-models separately. This trend is particularly notable in automatic speech recognition (ASR). In this talk, we will provide an overview of E2E ASR models and […]