News

There are about 7 000 languages in the world, but just a small fraction of them are supported by AI language models. Nvidia is tackling the problem with a new dataset and models that support the ...
From the voice-to-text feature on your phone to the captions that make videos more accessible, speech transcription is ...
They then adapted Whisper, an open-source, large-scale speech recognition model powered by artificial intelligence (AI), to decode the vibrations into recognizable speech transcriptions.
Mistral has released Voxtral, a large language model aimed at speech recognition (ASR) applications that seek to integrate more advanced LLM-based capabilities and go beyond simple transcription ...
Justin Antonipillai, counselor to Commerce Secretary Penny Pritzker, wrote in a blog post published Sept. 26 that the Alexa Skills Open Data Challenge held from Oct. 7 to 8 focused on using ...
Discover the pros and cons of ElevenLabs vs open-source voice cloning tools. Learn which option suits your needs for lifelike voice replication ...
VCG. The US tech industry is endorsing a new plan this week called the ATOM Project, namely American Truly Open Models, which aims to win back the US lead in open-source ...