<div class="statcounter"><a title="Web Analytics" href="https://statcounter.com/" target="_blank"><img class="statcounter" src="https://c.statcounter.com/12795394/0/d64e9537/1/" alt="Web Analytics" referrerPolicy="no-referrer-when-downgrade">

BizTech Q&A 41: Transcribe your audio

BizTech Q&A

Answering your questions about business and technology

Question

I tried to use ChatGPT to transcribe some audio files and it didn’t work.  After doing some research, I found that I can use another OpenAI tool called Whisper but it looks too technical for me.  Is there a simpler way to transcribe audio?
~ T.W.

Answer

Hi T.W.,

Good news, the answer is yes and it is straightforward.

Use Gemini.

Specifically, follow these steps

    • Open Google Gemini 
    • Upload your audio file
    • Prompt Gemini to transcribe the audio file
      • I.e. “transcribe the audio in the attachment verbatim”
    • Wait for output

That’s it!

I tried it to make sure that it works and I had no issues.  In case you do have issues, I will add that I uploaded an MP4 file, it was a 10MB file (~2 minutes of audio), I am using Gemini Pro, and the model that I am using is 2.5 Flash.

The main issues that I can see you having are using an unsupported files type and uploading a file that is too large.

Hope that helps

~ Damien

Comments