Premium Only Content

How to do Free Speech-to-Text Transcription Better Than Google Premium API with OpenAI Whisper Model
If you want to transcribe your videos and audio into text for free but with high quality, you have come to the correct video.
In this tutorial video, I will guide you on how to use #OpenAI #Whisper model. I will show you how to install and run Open AI's Whisper from scratch. I will demonstrate to you how to convert audio/speech into text.
Whisper is a general-purpose speech recognition model released for free by Open AI. I claim that Whisper is the best available Speech-to-Text model (Natural Language Processing - #NLP) released to public usage including premium paid ones such as Amazon Web Services, Microsoft Azure Cloud Platform, or Google Cloud API. And Whisper is free to use.
I will show you how to install the necessary Python code and the dependent libraries. I will show you how to download a video from YouTube with YT-DLP, how to cut certain parts of the video with LosslessCut, and how to extract the audio of a video with FFMPEG. I will show you how to do a transcription of a video or a sound. I will show you how to generate subtitles for any video. Finally, I will show you how to generate translated transcription and subtitles of any language video.
With the translation feature of the Whisper model, you can watch any language (Whisper supports 99 languages) with English subtitles. Let's say you can find English subtitles for your favorite video in German or Japanese or Arabic. It is not a problem. Just follow my tutorial and generated English translated subtitles.
Actually, to be precise, Whisper is able to transcribe speech to text in all the following languages, and therefore, translation of these following languages into English:
{af,am,ar,as,az,ba,be,bg,bn,bo,br,bs,ca,cs,cy,da,de,el,en,es,et,eu,fa,fi,fo,fr,gl,gu,ha,haw,hi,hr,ht,hu,hy,id,is,it,iw,ja,jw,ka,kk,km,kn,ko,la,lb,ln,lo,lt,lv,mg,mi,mk,ml,mn,mr,ms,mt,my,ne,nl,nn,no,oc,pa,pl,ps,pt,ro,ru,sa,sd,si,sk,sl,sn,so,sq,sr,su,sv,sw,ta,te,tg,th,tk,tl,tr,tt,uk,ur,uz,vi,yi,yo,zh,Afrikaans,Albanian,Amharic,Arabic,Armenian,Assamese,Azerbaijani,Bashkir,Basque,Belarusian,Bengali,Bosnian,Breton,Bulgarian,Burmese,Castilian,Catalan,Chinese,Croatian,Czech,Danish,Dutch,English,Estonian,Faroese,Finnish,Flemish,French,Galician,Georgian,German,Greek,Gujarati,Haitian,Haitian Creole,Hausa,Hawaiian,Hebrew,Hindi,Hungarian,Icelandic,Indonesian,Italian,Japanese,Javanese,Kannada,Kazakh,Khmer,Korean,Lao,Latin,Latvian,Letzeburgesch,Lingala,Lithuanian,Luxembourgish,Macedonian,Malagasy,Malay,Malayalam,Maltese,Maori,Marathi,Moldavian,Moldovan,Mongolian,Myanmar,Nepali,Norwegian,Nynorsk,Occitan,Panjabi,Pashto,Persian,Polish,Portuguese,Punjabi,Pushto,Romanian,Russian,Sanskrit,Serbian,Shona,Sindhi,Sinhala,Sinhalese,Slovak,Slovenian,Somali,Spanish,Sundanese,Swahili,Swedish,Tagalog,Tajik,Tamil,Tatar,Telugu,Thai,Tibetan,Turkish,Turkmen,Ukrainian,Urdu,Uzbek,Valencian,Vietnamese,Welsh,Yiddish,Yoruba}
The links and the commands I have shown in the video below:
Open AI Whisper : https://openai.com/blog/whisper/
Whisper Code : https://github.com/openai/whisper
Python : https://www.python.org/downloads/release/python-399/
Whisper install : pip install git+https://github.com/openai/whisper.git
How to install CUDA support for using GPU when doing transcription of audio :
First, delete existing Pytorch : pip3 uninstall torch
Then install Pytorch with CUDA support : pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu116
FFMPEG : https://github.com/BtbN/FFmpeg-Builds/releases
LosslessCut : https://github.com/mifi/lossless-cut/releases
How to extract sound of any video with FFMPEG : ffmpeg -i "test_video.webm" -q:a 0 -map a test_video.mp3
How to transcribe an English video : whisper "C:\speech to text\test_video.mp3" --language en --model base.en --device cpu --task transcribe
How to transcribe an English video with CUDA support : whisper "C:\speech to text\test_video.mp3" --language en --model base.en --device cuda --task transcribe
How to transcribe a Turkish video : whisper "C:\speech to text\test_video.mp3" --language tr --model base.en --device cpu --task transcribe
How to transcribe a Turkish video with translation : whisper "C:\speech to text\test.mp3" --language tr --model small --device cuda -o "C:\speech to text" --task translate
Our Discord for SECourses : https://discord.gg/rfttctFewW
If you are interested in programming but you lack experience and skills I suggest you watch our playlists: https://www.youtube.com/c/SECourses/playlists
[1] Introduction to Programming Full Course with C# playlist
[2] Advanced Programming with C# Full Course Playlist
[3] Object Oriented Programming Full Course with C# playlist
[4] Asp.NET Core V5 - MVC Pattern - Bootstrap V5 - Responsive Web Programming with C# Full Course Playlist
[5] Artificial Intelligence (AI) and Machine Learning (ML) Full Course with C# Examples playlist
[6] Software Engineering Full Course playlist
[7] Security of Information Systems Full Course playlist
-
LIVE
Nerdrotic
2 hours agoLate Night CANCELLED, Taika WOKEtiti DREDD, Absolute STATE of Superheroes! | Friday Night Tights 363
2,077 watching -
LIVE
Dr Disrespect
5 hours ago🔴LIVE - DR DISRESPECT - NEW AAA BATTLE ROYALE ON UNREAL ENGINE 5 #OTG #OFFTHEGRID
3,507 watching -
LIVE
Barry Cunningham
3 hours agoWATCH LIVE: PRESIDENT TRUMP SIGNS LAWS ON CRYPTO AND ENDS NPR, PBS, & USAID!
1,303 watching -
44:02
The White House
2 hours agoPresident Trump Participates in a Signing Ceremony for S.1582 GENIUS Act
21.3K13 -
LIVE
LadyDesireeMusic
1 hour ago $0.04 earnedLive Piano Renditions | Make Ladies Great Again | Live Piano Requests TONS to choose from
121 watching -
4:13:45
Viss
5 hours ago🔴LIVE - Winning with Tactics & Outplay The Competition - How To With Viss!
5.69K1 -
1:09:46
Jeff Ahern
2 hours ago $0.20 earnedFriday Freak out with Jeff Ahern
12.9K7 -
LIVE
Spartan
2 hours agoSpartan - Pro Halo Player for OMiT | Scrims vs FaZe @ 4 EST + Ranked
23 watching -
LIVE
WolfsDenBoxing
59 minutes agoStreaming Fights: New Wave or Distraction
133 watching -
57:12
Crypto Power Hour
9 hours ago $0.23 earnedThe GENIUS Act & The Clarity Act Pass… Whats The Impact?
8.93K5