📚 HPR3219: Linux Inlaws S01E18: Voice Recognition and Text to Speech

🕛 Zeit seit Veröffentlichung: 1220 Tage, 10 Stunden 23 Minuten
📆 Veröffentlicht am: 03.12.2020 um 01:00 Uhr
💡 Newskategorie: Podcasts
🔗 Quelle: hackerpublicradio.org

In this episode, Chris is harassed by quite a few artificial nuisance callers, among drug lords, Irish nurses and some random Linux Inlaws Chief Financial Officer. Based on these examples, our two heroes discuss the history and current state of text-to- speech (TTS) and voice recognition. We attempted to use voice recognition software in order to produce a transcript of the show.

Shownotes:

Wavenet: https://deepmind.com/blog/article/wavenet-generative-model-raw-audio
Tacotron: https://ai.googleblog.com/2017/12/tacotron-2-generating-human-like-speech.html
DeepSpeech: https://github.com/mozilla/DeepSpeech
Lyrebird / Welcome.AI: https://www.welcome.ai/lyrebird
Nvidia Tacotron 2: https://github.com/NVIDIA/tacotron2
Tensorflow: https://www.tensorflow.org
PyTorch: https://pytorch.org
Melspectrograms: https://medium.com/analytics-vidhya/understanding-the-mel-spectrogram-fca2afa2ce53
GRAPHCORE: https://www.graphcore.ai
FGPA: https://en.wikipedia.org/wiki/Field-programmable_gate_array
IBM ROMP: https://en.wikipedia.org/wiki/IBM_ROMP
Google's TTS: https://cloud.google.com/text-to-speech
Apple M1: https://www.gsmarena.com/the_apple_m1_is_the_first_armbased_chipset_for_macs_with_the_fastest_cpu_cores_and_top_igpu-news-46222.php
Secure Enclaves: https://support.apple.com/guide/security/secure-enclave-overview-sec59b0b31ff/web
OSDU: https://www.opengroup.org/osdu/forum-homepage
Jack Kerouac's On the Road: https://en.wikipedia.org/wiki/On_the_Road

...

Sharing is caring on Social Media

Join the Team IT Security Community

📌 HPR3219: Linux Inlaws S01E18: Voice Recognition and Text to Speech

🕛 1220 Tage, 11 Stunden 6 Minuten
📆 03.12.2020 um 01:00 Uhr
📈 143.92 Punkte

📌 HPR3249: Linux Inlaws S01E21: The Big Linux Inlaws Peep Show

🕛 1178 Tage, 10 Stunden 20 Minuten
📆 14.01.2021 um 01:00 Uhr
📈 47.83 Punkte

📌 Google AI Introduces Universal Speech Model (USM): A Family of State-of-the-Art Speech Models with 2B Parameters Trained on 12 Million Hours of Speech and 28 Billion Sentences of Text

🕛 394 Tage, 17 Stunden 48 Minuten
📆 08.03.2023 um 16:59 Uhr
📈 45.56 Punkte

📌 Use Text-to-Speech and Voice Recognition on Windows 11

🕛 456 Tage, 21 Stunden 52 Minuten
📆 05.01.2023 um 12:30 Uhr
📈 42.1 Punkte

📌 AI Show Live - Episode 19 - Improving customer experiences with Speech to Text and Text to Speech

🕛 1020 Tage, 20 Stunden 22 Minuten
📆 19.06.2021 um 01:33 Uhr
📈 41.75 Punkte

📌 Google AI Proposes Easy End-to-End Diffusion-based Text to Speech E3-TTS: A Simple and Efficient End-to-End Text-to-Speech Model Based on Diffusion

🕛 142 Tage, 20 Stunden 15 Minuten
📆 15.11.2023 um 15:05 Uhr
📈 41.75 Punkte

📌 Techotronic all-in-one-favicon Plugin 4.6 on WordPress Apple-Text/GIF-Text/ICO-Text/PNG-Text/JPG-Text Persistent cross site scripting

🕛 1492 Tage, 18 Stunden 35 Minuten
📆 05.03.2020 um 15:38 Uhr
📈 40.42 Punkte

📌 'Seamless' voice-to-text feature for Android will let you turn speech into text instantly

🕛 73 Tage, 16 Stunden 35 Minuten
📆 23.01.2024 um 18:42 Uhr
📈 37.87 Punkte

📌 Windows Speech Recognition is getting deprecated and it will be replaced by Voice Access

🕛 120 Tage, 21 Stunden 51 Minuten
📆 07.12.2023 um 08:35 Uhr
📈 34.01 Punkte

📌 How to Perform Speech-to-Text and Translate Any Speech to English With OpenAI’s Whisper

🕛 478 Tage, 15 Stunden 51 Minuten
📆 14.12.2022 um 18:54 Uhr
📈 33.66 Punkte

📌 The Text-to-Speech-Client Tool by Xenova: A Robust and Flexible AI Platform for Producing Natural-Sounding Synthetic Speech

🕛 158 Tage, 19 Stunden 17 Minuten
📆 30.10.2023 um 16:00 Uhr
📈 33.66 Punkte

📌 Speech Recognition to Text in Linux, Ubuntu using Google Docs

🕛 487 Tage, 17 Stunden 23 Minuten
📆 05.12.2022 um 17:45 Uhr
📈 33 Punkte

📌 The last Windows 11 Insider Beta build of the year improves voice access as Microsoft marches toward the end of Speech Recognition

🕛 113 Tage, 15 Stunden 6 Minuten
📆 14.12.2023 um 20:11 Uhr
📈 32.22 Punkte

📌 Mozilla Releases Open Source Speech Recognition Model, Massive Voice Dataset

🕛 2316 Tage, 14 Stunden 37 Minuten
📆 02.12.2017 um 19:34 Uhr
📈 32.22 Punkte

📌 Speech to Text (Google Cloud Speech API)

🕛 1896 Tage, 18 Stunden 23 Minuten
📆 26.01.2019 um 16:27 Uhr
📈 31.88 Punkte

📌 Speech to Text to Speech with AI Using Python — a How-To Guide

🕛 54 Tage, 18 Stunden 17 Minuten
📆 11.02.2024 um 16:47 Uhr
📈 31.88 Punkte

📌 Speech Central 13.1.4 - Text-to-speech suite.

🕛 372 Tage, 20 Stunden 37 Minuten
📆 30.03.2023 um 14:36 Uhr
📈 31.88 Punkte

📌 Speech 1.11.0 - Intuitive text-to-speech app.

🕛 60 Tage, 1 Stunden 22 Minuten
📆 06.02.2024 um 09:45 Uhr
📈 31.88 Punkte

📌 Are there any speech dispatcher engines (text-to-speech) that don't suck?

🕛 59 Tage, 17 Stunden 36 Minuten
📆 06.02.2024 um 17:02 Uhr
📈 31.88 Punkte

📌 CMU Researchers Unveil An AI System for Human-like Text-to-Speech Training with Diverse Speech

🕛 414 Tage, 16 Stunden 18 Minuten
📆 16.02.2023 um 18:51 Uhr
📈 31.88 Punkte

📌 Researchers from Korea University Unveil HierSpeech++: A Groundbreaking AI Approach for High-Fidelity, Efficient Text-to-Speech and Voice Conversion

🕛 127 Tage, 12 Stunden 17 Minuten
📆 30.11.2023 um 22:41 Uhr
📈 31.57 Punkte

📌 Researchers from Korea University Unveil HierSpeech++: A Groundbreaking AI Approach for High-Fidelity, Efficient Text-to-Speech and Voice Conversion

🕛 127 Tage, 12 Stunden 17 Minuten
📆 30.11.2023 um 22:41 Uhr
📈 31.57 Punkte

📌 Mozilla's New Open Source Voice-Recognition Project Wants Your Voice

🕛 2449 Tage, 12 Stunden 4 Minuten
📆 22.07.2017 um 22:34 Uhr
📈 30.13 Punkte

📌 5 Best AI Voice Generators: Text-To-Speech AI in 2024

🕛 14 Tage, 18 Stunden 34 Minuten
📆 05.04.2024 um 20:53 Uhr
📈 29.78 Punkte

📌 Give voice to your words with Notevibes text-to-speech, now under $70

🕛 1157 Tage, 16 Stunden 8 Minuten
📆 03.02.2021 um 19:03 Uhr
📈 29.78 Punkte

📌 Voice Dream Reader 1.3.4 - Powerful text to speech tools.

🕛 352 Tage, 20 Stunden 52 Minuten
📆 19.04.2023 um 14:22 Uhr
📈 29.78 Punkte

📌 How to Setup & Enable Windows 11 Text-To-Speech Voice Typing

🕛 132 Tage, 17 Stunden 1 Minuten
📆 23.11.2023 um 06:57 Uhr
📈 29.78 Punkte

📌 Meet Bark: The Revolutionary Text-to-Speech AI Voice Clone Model That Sounds Just Like You

🕛 346 Tage, 2 Stunden 3 Minuten
📆 26.04.2023 um 09:08 Uhr
📈 29.78 Punkte

📌 Ubisoft Accidentally Used Text-to-Speech To Voice a Character in the New Prince of Persia Game

🕛 84 Tage, 16 Stunden 17 Minuten
📆 12.01.2024 um 19:00 Uhr
📈 29.78 Punkte

📌 Windows 11’s Voice Access adds custom voice commands, voice shortcuts in insider build

🕛 93 Tage, 0 Stunden 35 Minuten
📆 03.01.2024 um 20:49 Uhr
📈 29.4 Punkte

📌 HPR3329: Linux Inlaws S01E29: The (one and only) Linux Kernel Contributor Panel

🕛 1066 Tage, 1 Stunden 52 Minuten
📆 06.05.2021 um 02:00 Uhr
📈 28.19 Punkte

📌 HPR3619: Linux Inlaws S01E58: Kubernetes and Friends and Sarah

🕛 660 Tage, 4 Stunden 18 Minuten
📆 16.06.2022 um 02:00 Uhr
📈 27.49 Punkte

📌 HPR3258: Linux Inlaws S01E22: The Linux Professional Institute

🕛 1165 Tage, 10 Stunden 36 Minuten
📆 27.01.2021 um 01:00 Uhr
📈 26.4 Punkte

📌 HPR3299: Linux Inlaws S01E26: Make your Linux harder

🕛 1108 Tage, 11 Stunden 8 Minuten
📆 25.03.2021 um 01:00 Uhr
📈 26.4 Punkte

📌 HPR3069: Linux Inlaws S01E05 Porn and Skynet

🕛 1430 Tage, 8 Stunden 36 Minuten
📆 07.05.2020 um 02:00 Uhr
📈 25.7 Punkte

🏠 Team IT Security News

📚 HPR3219: Linux Inlaws S01E18: Voice Recognition and Text to Speech

Sharing is caring on Social Media

Join the Team IT Security Community