BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Silicon Valley Engineering Council - ECPv6.15.20//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://svec.org
X-WR-CALDESC:Events for Silicon Valley Engineering Council
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Los_Angeles
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20240310T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20241103T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20250309T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20251102T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20260308T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20261101T090000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20250307T183000
DTEND;TZID=America/Los_Angeles:20250307T210000
DTSTAMP:20260422T025345
CREATED:20250207T074809Z
LAST-MODIFIED:20250207T074809Z
UID:65625-1741372200-1741381200@svec.org
SUMMARY:Advancing Speech Processing with End-to-End Modeling and LLM Integration
DESCRIPTION:Abstract\nThe field of speech processing is currently dominated by end-to-end (E2E) models\, which utilize a single model to optimize directly towards the final objective function rather than optimizing multiple sub-models separately. This trend is particularly notable in automatic speech recognition (ASR). In this talk\, we will provide an overview of E2E ASR models and discuss recent advancements from an industry perspective. Subsequently\, we will examine the trend of E2E modeling beyond ASR\, with applications such as multi-speaker ASR and simultaneous speech translation\, where ASR traditionally serves as only one of several components. This trend ultimately unlocks multimodal intelligence by integrating speech capabilities into large language models (LLM). We will highlight the most recent developments in this area\, which present unprecedented opportunities for the field.\nSpeaker(s): Jinyu Li\,\nAgenda:\n6:30 – 7:00 Check-in\, networking\, food\, and drink\n7:00 – 8:30 PM – Presentation by Dr. Jinyu Li\n8:30 – 9:00 PM – Q & A\nRoom: 1302\, Bldg: Sobrato Campus for Discovery and Innovation Building \, Santa Clara University\, 500 El Camino Real\, Santa Clara\, California\, United States\, 95053
URL:https://svec.org/event/advancing-speech-processing-with-end-to-end-modeling-and-llm-integration/
LOCATION:Room: 1302\, Bldg: Sobrato Campus for Discovery and Innovation Building \, Santa Clara University\, 500 El Camino Real\, Santa Clara\, California\, United States\, 95053
END:VEVENT
END:VCALENDAR