Cookie Consent by Free Privacy Policy Generator ๐Ÿ“Œ OpenAI Open-Sources Whisper, a Multilingual Speech Recognition System

๐Ÿ  Team IT Security News

TSecurity.de ist eine Online-Plattform, die sich auf die Bereitstellung von Informationen,alle 15 Minuten neuste Nachrichten, Bildungsressourcen und Dienstleistungen rund um das Thema IT-Sicherheit spezialisiert hat.
Ob es sich um aktuelle Nachrichten, Fachartikel, Blogbeitrรคge, Webinare, Tutorials, oder Tipps & Tricks handelt, TSecurity.de bietet seinen Nutzern einen umfassenden รœberblick รผber die wichtigsten Aspekte der IT-Sicherheit in einer sich stรคndig verรคndernden digitalen Welt.

16.12.2023 - TIP: Wer den Cookie Consent Banner akzeptiert, kann z.B. von Englisch nach Deutsch รผbersetzen, erst Englisch auswรคhlen dann wieder Deutsch!

Google Android Playstore Download Button fรผr Team IT Security



๐Ÿ“š OpenAI Open-Sources Whisper, a Multilingual Speech Recognition System


๐Ÿ’ก Newskategorie: IT Security Nachrichten
๐Ÿ”— Quelle: news.slashdot.org

Speech recognition remains a challenging problem in AI and machine learning. In a step toward solving it, OpenAI today open-sourced Whisper, an automatic speech recognition system that the company claims enables "robust" transcription in multiple languages as well as translation from those languages into English. TechCrunch reports: Countless organizations have developed highly capable speech recognition systems, which sit at the core of software and services from tech giants like Google, Amazon and Meta. But what makes Whisper different, according to OpenAI, is that it was trained on 680,000 hours of multilingual and "multitask" data collected from the web, which lead to improved recognition of unique accents, background noise and technical jargon. "The primary intended users of [the Whisper] models are AI researchers studying robustness, generalization, capabilities, biases and constraints of the current model. However, Whisper is also potentially quite useful as an automatic speech recognition solution for developers, especially for English speech recognition," OpenAI wrote in the GitHub repo for Whisper, from where several versions of the system can be downloaded. "[The models] show strong ASR results in ~10 languages. They may exhibit additional capabilities ... if fine-tuned on certain tasks like voice activity detection, speaker classification or speaker diarization but have not been robustly evaluated in these area." Whisper has its limitations, particularly in the area of text prediction. Because the system was trained on a large amount of "noisy" data, OpenAI cautions Whisper might include words in its transcriptions that weren't actually spoken -- possibly because it's both trying to predict the next word in audio and trying to transcribe the audio itself. Moreover, Whisper doesn't perform equally well across languages, suffering from a higher error rate when it comes to speakers of languages that aren't well-represented in the training data. Despite this, OpenAI sees Whisper's transcription capabilities being used to improve existing accessibility tools.

Read more of this story at Slashdot.

...



๐Ÿ“Œ Choosing the Right Whisper Model: When To Use Whisper v2, Whisper v3, and Distilled Whisper?


๐Ÿ“ˆ 68.06 Punkte

๐Ÿ“Œ How to Perform Speech-to-Text and Translate Any Speech to English With OpenAIโ€™s Whisper


๐Ÿ“ˆ 49.96 Punkte

๐Ÿ“Œ This AI Paper from Google Unveils a Groundbreaking Non-Autoregressive, LM-Fused ASR System for Superior Multilingual Speech Recognition


๐Ÿ“ˆ 45.53 Punkte

๐Ÿ“Œ Whisper models for automatic speech recognition now available in Amazon SageMaker JumpStart


๐Ÿ“ˆ 39.44 Punkte

๐Ÿ“Œ AssemblyAI Unveils Universal-1: Surpassing Whisper-3 with Groundbreaking Accuracy and Speed in Speech Recognition


๐Ÿ“ˆ 39.44 Punkte

๐Ÿ“Œ OpenAIโ€™s Whisper Learned 680,000 Hours Of Speech!


๐Ÿ“ˆ 38.08 Punkte

๐Ÿ“Œ Audio Hijack for Mac now includes speech to text transcription powered by OpenAIโ€™s Whisper


๐Ÿ“ˆ 38.08 Punkte

๐Ÿ“Œ Webinar: From multilingual UI to multilingual company


๐Ÿ“ˆ 37.17 Punkte

๐Ÿ“Œ Microsoft Introduces Multilingual E5 Text Embedding: A Step Towards Multilingual Processing Excellence


๐Ÿ“ˆ 37.17 Punkte

๐Ÿ“Œ CMU Researchers Introduce OWSM v3.1: A Better and Faster Open Whisper-Style Speech Model-Based on E-Branchformer


๐Ÿ“ˆ 33.55 Punkte

๐Ÿ“Œ OpenAI vs Open-Source Multilingual Embedding Models


๐Ÿ“ˆ 32.41 Punkte

๐Ÿ“Œ Researchers From Oxford Open-Source WhisperX: A Time-Accurate Speech Recognition System With Word-Level Timestamps


๐Ÿ“ˆ 31.59 Punkte

๐Ÿ“Œ Open Source: OpenAI verรถffentlicht automatisches Spracherkennungssystem Whisper


๐Ÿ“ˆ 30.84 Punkte

๐Ÿ“Œ Whisper to your keyboard: Setting up a speech-to-text button


๐Ÿ“ˆ 28.9 Punkte

๐Ÿ“Œ Build a Speech-to-text Web App with Whisper, React and Node


๐Ÿ“ˆ 28.9 Punkte

๐Ÿ“Œ CMU Researchers Unveil An AI System for Human-like Text-to-Speech Training with Diverse Speech


๐Ÿ“ˆ 28.29 Punkte

๐Ÿ“Œ FlashSpeech: A Novel Speech Generation System that Significantly Reduces Computational Costs while Maintaining High-Quality Speech Output


๐Ÿ“ˆ 28.29 Punkte

๐Ÿ“Œ Mozilla Releases Open Source Speech Recognition Model, Massive Voice Dataset


๐Ÿ“ˆ 27.07 Punkte

๐Ÿ“Œ What is the best native Live Speech Recognition system in Linux?


๐Ÿ“ˆ 26.94 Punkte

๐Ÿ“Œ Creating Whisper Video Captions: OpenAI VTT


๐Ÿ“ˆ 26.19 Punkte

๐Ÿ“Œ How to Turn Audio to Text using OpenAI Whisper


๐Ÿ“ˆ 26.19 Punkte

๐Ÿ“Œ OpenAI releases API for ChatGPT and Whisper


๐Ÿ“ˆ 26.19 Punkte

๐Ÿ“Œ Generate subtitles with OpenAI Whisper and Eyevinn OSC


๐Ÿ“ˆ 26.19 Punkte

๐Ÿ“Œ On Avoiding Conflation of Political Speech and Hate Speech


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ Speech to Text (Google Cloud Speech API)


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ AI Show Live - Episode 19 - Improving customer experiences with Speech to Text and Text to Speech


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ Universal Speech Model (USM): State-of-the-art speech AI for 100+ languages


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ Towards Real-World Streaming Speech Translation for Code-Switched Speech


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ โ€œFree Speech Extremistโ€ Elon Musk Begs Courts to Protect Him From Speech


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ Speech to Text to Speech with AI Using Pythonโ€Šโ€”โ€Ša How-To Guide


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ Speech Central 13.1.4 - Text-to-speech suite.


๐Ÿ“ˆ 23.77 Punkte

๐Ÿ“Œ New Free Speech Site Gets in a Tangle Over<nobr> <wbr></nobr>... Free Speech


๐Ÿ“ˆ 23.77 Punkte











matomo