Posted July 07, 2025 by Cyber Apps
#devlog #update #ai #transcription #subtitles #tool #accessibility #announcement #beta #sneak-peek #translation #speech-to-text #vocal-isolation #audio #artificial-intelligence #machine-learning #utility #software #open-source #streamers #content-creators #video-editing #productivity #windows
Hello everyone! As a solo developer, I've been working hard behind the scenes on a massive architectural update for Synthalingua, and I'm incredibly excited to share a preview of what's coming next.
This next update (which will be Beta 1.1.1) is focused on making the application faster, more stable, and packed with powerful new tools. While it's not ready for release just yet, here’s a look at what I've been building.
These are brand-new capabilities designed to solve common frustrations and unlock new possibilities.
This is a game-changer for anyone working with less-than-perfect audio. A new feature will use an AI model (Demucs) to intelligently separate spoken words from background music, in-game sounds, noise, and other interference. This means you'll be able to get clean, high-quality transcriptions from audio that was previously unusable.
Tired of transcribing the wrong audio from a livestream? Soon, when you target a live stream, Synthalingua will show you all available audio tracks. More importantly, it will let you download and listen to a short preview of any track before starting the full transcription. This eliminates the guesswork and ensures you're always transcribing the correct source.
I'm improving the way the AI handles live audio. A new context-aware mode will allow the AI to remember what you just said, using it as a reference for what you're about to say next. The result will be far more fluid and accurate live transcriptions with fewer awkwardly cut-off sentences.
The subtitle generator is getting a massive upgrade. A new Model Comparison Mode will let you generate subtitles using every available AI model with a single command, so you can easily compare the output files and choose the best one. I'm also adding support for Word-Level Timestamps to create karaoke-style subtitles with precise timing.
Alongside new features, I've focused heavily on improving the core experience and fixing common issues.
remote_microphone.py
script has been transformed into a powerful, standalone utility. It will allow you to set up remote audio streaming from one computer to another over your local network, complete with a live audio meter and device testing.
This has been a massive undertaking, and I'm focused on making sure everything is as stable as possible before release. I'm aiming to get this update out to all of you soon!
Thank you for your incredible support!
- Cyberofficial