Transcribing audio or video to text used to be a complicated and expensive process, but that’s no longer the case. Today, we have tools like Buzz, an AI-powered transcription and translation software that works offline on your computer and is completely free. Whether you’re working with Windows 10, Windows 11, Mac, or Linux, Buzz has got you covered. In this tutorial, we’ll walk you through a simple 3-step process to transcribe and translate your files quickly and accurately.
Key Takeaways
- Buzz AI: Free transcription and translation software available for Windows, Mac, and Linux.
- Offline Usage: Once installed, Buzz runs without needing an internet connection.
- Multiple File Export Options: You can export text as plain text, SRT, or VTT files.
Step 1: Download and Install Buzz AI

The first step in converting your audio or video to text is downloading Buzz from its GitHub page. Once you’re on the Releases page, download the correct version for your operating system. Buzz works on Windows, Mac, and Linux, so make sure you select the right one.

Once the download is complete, click on the file to open it, and follow the straightforward installation instructions. Windows users: If you encounter a SmartScreen alert, click on “More Info” and then “Run Anyway” to proceed with the installation.
Step 2: Transcribing Your Files

Once Buzz is installed and open, you’ll see the home screen where you can either record audio directly from your microphone or add files for transcription. For this tutorial, we’ll focus on transcribing an existing file. Follow these steps:

Add Your File: Click the “+” sign to add the audio or video file you want to transcribe. Buzz will prompt you to select a file from your system. By default, it shows only audio files, but you can change this by selecting “All Files” in the dropdown menu.

Choose Your AI Model: Buzz offers two AI models: OpenAI’s Whisper and the Hugging Face text-to-speech model. Whisper by OpenAI is recommended, especially the “medium” model for a balance of speed and accuracy.

Transcribe or Translate: You can choose to transcribe (convert audio to text in the same language) or translate (convert to text in a different language). For this tutorial, we’ll stick with transcribing.

Language Selection: The AI does a great job of automatically detecting the language of your audio, so you can leave it on “Detect Language” unless you need a specific language.

Once your settings are in place, click “Run.” The first time you use Buzz, the selected model will be downloaded. After that, the tool works offline, so no internet connection is required for future transcriptions.
Step 3: Exporting Your Transcription

After Buzz completes the transcription, the finished job will appear on the screen. Double-click on the job to view the text output. From here, you can export the transcription in three different file formats:
- Text (.txt): Ideal for those who need plain text for documents, blogs, or notes.
- SRT (.srt): Contains timestamps, perfect for creating subtitles for videos.
- VTT (.vtt): Similar to SRT, also used for subtitles and captions.
Most video editing software, including YouTube, supports SRT files, making it easy to add subtitles to your videos.
Tips for Accuracy

Even though Buzz does an impressive job with accuracy, it’s always a good idea to double-check the output. Listen to the audio while reading the text to ensure everything is transcribed correctly, especially if the content will be shared publicly.
Real-Time Transcription (Optional)

Buzz also supports real-time transcription directly from your microphone. This is useful for creating captions for live recordings or voice notes. To use this feature:
- Click the “Record” button on the home screen.
- Adjust the settings, ensuring your microphone is selected.
- Start recording, and Buzz will transcribe your speech as you talk.
However, if real-time transcription doesn’t work for you, as it didn’t on my PC, you can follow a workaround by downloading a GDML model as mentioned in a post (linked in the video description). Alternatively, you can record audio using free software like Audacity and then import it into Buzz for transcription.
Conclusion
Buzz AI makes converting audio or video to text an easy, free, and offline process. Whether you’re transcribing audio for notes, captions, or translations, Buzz simplifies the entire process without needing an internet connection once set up. The flexibility to export in multiple formats is an added bonus, making it a must-have tool for content creators and professionals alike.
FAQs
1. Is Buzz AI completely free?
Yes, Buzz is 100% free and open-source. You can download it from GitHub and use it without any hidden charges.
2. Can Buzz work offline?
Yes, once Buzz is set up and the transcription model is downloaded, it can work offline without needing an internet connection.
3. Does Buzz support multiple languages?
Yes, Buzz can transcribe and translate into multiple languages, making it versatile for various needs.
4. Can I transcribe real-time audio with Buzz?
Yes, Buzz supports real-time transcription via your microphone. If you face issues, there are alternatives and fixes available, as discussed in the post.
5. What formats can I export the transcriptions in?
Buzz allows you to export transcriptions as plain text (.txt), SRT (.srt), and VTT (.vtt) files.