Speech Recognition Python Tutorial

Speech To Speech: Build local voice agents with open-source models

The pipeline provides a fully open and modular approach, with a focus on leveraging models available through the Transformers library on the Hugging Face hub. The code is designed for easy ...

IEEE

Curriculum Learning aided Audio-Visual Speech Recognition with Arbitrary Speaker Number

Abstract: Recently, audio-visual speech recognition has attracted increasing attention. However, most existing works only focused on scenarios with two speakers. In this work, we study the effect of ...

IEEE

Keyword Guided Target Speech Recognition

Abstract: This letter presents a new target speech recognition problem, where the target speech is defined by a keyword. For instance, when a person speaks “Hey Google” or “Help Me”, we hope the model ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果

Speech To Speech: Build local voice agents with open-source models

Curriculum Learning aided Audio-Visual Speech Recognition with Arbitrary Speaker Number

Keyword Guided Target Speech Recognition

今日热点