WhisperScript, an Electron desktop app GUI for Whisper #1028

jonathgh · 2023-03-04T15:06:40Z

jonathgh
Mar 4, 2023

Thanks to the work of @ggerganov and with inspiration from @jordibruin, @kai-shimada and I were able to implement Whisper in a desktop app built with the Electron framework. The app runs on both Mac (Apple Silicon) and Windows. You can download it here: Whisperscript

Currently, our features in Version 2 include:

Super Fast and Responsive ⚡: We’ve completely rebuilt WhisperScript for speed and stability. With a fresh visual update, the app is now more responsive and intuitive to use.
Video Viewer Support 🎥: Now transcribe directly from video files in formats like MOV, MP4, and MKV, and view the files in the new video player, streamlining the workflow for video and subtitle creators.
👨‍🦰 Speaker Separation: (Beta on MacOS; coming soon on Windows)
Enhanced Toolbar 🛠️: Improved access to essential editing actions, making it easier to navigate and refine your transcripts.
Search & Replace 🔍: A new feature for quick text adjustments, although still in beta, making changes across the entire transcript faster.
Merge Segments by Sentence: Combine segments seamlessly by sentence, available in Edit > Merge Segments by Sentence, for a cleaner transcript flow.
Bookmarks 📌: Mark key points in your transcripts for easier reference and navigation.
Real-Time Transcription Progress: Monitor the progress of your transcription as it happens, enhancing visibility and control.
Abort Running Transcriptions: Easily stop a transcription in progress via the progress bar or by closing the active tab.
Transcription Queue Control: Delete tabs to individually abort or remove queued transcriptions without disrupting active processes.
Filename Dropdown Actions 📝: New dropdown in the header to reveal the file in Finder, re-transcribe, or close the transcript. (Feedback on visibility is welcome—let us know if this action menu is too hidden!)
Large v3 Turbo Model Support 🚀: Experience the speed of Large V3 Turbo—twice as fast as the original Large V3 model and comparable to Distill Whisper speeds.
Expanded Export Options 📄: Export transcripts to HTML, JSON, PDF, Word, and RTF, enabling more flexibility in sharing and documentation.
Open Recent Projects: Quickly access recent projects without navigating through file directories.
Multi-device support: with Wavery accounts, use your software on up to 2 devices.

We hope you enjoy it and let us know if there are any other features you would like to see. You can reach us at [email protected]

tonyho · 2023-03-07T02:54:31Z

tonyho
Mar 7, 2023

@jonathgh Are you going to make it cross-platform for Linux + Windows?
Or is there a github repository for this project?

8 replies

candideu May 1, 2023

If you're looking to use Whisper this way on Linux: Kdenlive's (video editor) latest release integrates Whisper to create subtitle files. It's FOSS

jsmollin May 7, 2023

If you're looking to use Whisper this way on Linux: Kdenlive's (video editor)...

Thank you for this suggestion. My interest in this particular application is the aggregate of features the author has put into the same application.

qhkm May 9, 2023

I've made a similar gui apps compatible for both windows and mac. Will share once it's done

jonathgh Jun 2, 2023
Author

@qhkm We'd love to see what you have made once you are done with the app! Send us a mail, maybe we can send some people your way who are looking for the Windows or Linux versions

jonathgh Jan 29, 2025
Author

We just updated to be available for Windows as well as for MacOS. If you would like to help us build and maintain for Linux, reach out! For Windows, here's the link: Download for Windows

kai-shimada · 2023-03-08T12:58:14Z

kai-shimada
Mar 8, 2023

👉 UPDATE: WhisperScript now added support for MKV, MP4 and MOV Video Import!

3 replies

bbecausereasonss Apr 24, 2023

Is it possible to utilize Whisper Jax in this GUI?

https://github.com/sanchit-gandhi/whisper-jax

kai-shimada Apr 24, 2023

interesting, do you know how it performs compared to whisper.cpp?

bbecausereasonss Apr 24, 2023

Up to 70x faster.

August78 · 2023-03-11T16:43:16Z

August78
Mar 11, 2023

Wow this is great! Subbed the topic, and waiting for a windows client :)

6 replies

nhan000 Apr 9, 2023

You can use a desktop GUI app on Windows here https://github.com/Const-me/Whisper

Makememo Aug 31, 2023

Maybe try https://memo.ac/, which supports Windows GPU.

candideu Sep 1, 2023

@Makememo How can users obtain an invitation code?

And when will the docs be available in English?

Makememo Sep 2, 2023

@candideu sorry, These are some codes.

WEAE-HPeM-uhR8-nuk6
E9sh-536K-38n7-LdbL
95Ac-r667-WSFZ-nqis
M9UY-ZuHA-4zej-fxb8
Ck7C-vmEf-Ugeg-pifn
j43j-enHE-oqPo-bBfB
N7GF-VVRU-Pcja-5Cks
mape-7ssz-Xj2A-P5Wf
5BRJ-fxAf-pfvA-ZcVZ
LAL9-pjfR-aD9M-7BXf

jonathgh Jan 29, 2025
Author

The Windows version is now live! Download it here

mrgalindo · 2023-03-12T12:46:09Z

mrgalindo
Mar 12, 2023

It would be great if you could add speaker recognition! i would get it right away!

3 replies

jonathgh Mar 18, 2023
Author

Speaker recognition would be great to have – we are looking into it, as it's quite a requested feature. We would also love to include speaker labels in the transcripts. I know that there have been several proposals to use Pyannote to diarize the segments.

zenminimalist May 13, 2023

Yupp, same here. Would LOVE that!

jonathgh Feb 12, 2025
Author

Speaker Recognition is now released as of v2.2.0, in Beta on MacOS, coming soon to Windows. In this first diarization version, we support:

up to 5 speakers,
English audio (other languages also work, but the model is trained on English speech. Other languages might see poorer results, depending on similarity to English)
And renaming speaker labels.

We know this is still an early version of the feature, so please let us know if you encounter any issues.

In the near future, with Speaker Separation v2, we plan to support

Custom number of max speakers
Setting a threshold for speaker differenciation
Merge Speaker Labels
Segment level speaker setting

duttaoindril · 2023-03-24T22:57:38Z

duttaoindril
Mar 24, 2023

Why is the paid feature for better whisper models? It should be paid for features you've written yourself.

6 replies

jonathgh Mar 26, 2023
Author

@duttaoindril how do you like the new update? Let us know how you're using it, we'd be happy to hear from you!

duttaoindril Mar 26, 2023

That's great, and I really appreciate those features.

But the larger models are still behind payment gates, it'd be better to strip down the free experience but offer all models in the free version.

A free trial of a few days or # of transcriptions with the pro features would also help you make sales. Along with an auto updater with constant feature drip for pro users.

Pro feature ideas:

try to find ways to improve the performance of the models, and sell that in the free version - don't artificially slow down performance
when dragging in a video, also show the video as you scrub through the transcript and audio waveform
after adding video add speaker diarization to try to automatically label voices and faces if any in the UI
after adding diarization make it easy to export all transcriptions as time and person labelled subtitles to add them back to sites like YouTube or movies or anything.

Point is, there're a bunch of features you can work toward putting together to make this an increasingly valuable and enticing paid product - I just don't think you should payment gate the larger models since it's open source and not your work.

jonathgh Mar 30, 2023
Author

@duttaoindril Glad you are enjoying the new features. Thank you for the feature ideas, we really appreciate them and are taking user feedback seriously when prioritizing new features. An auto-updater and feature drip for the Pro users are both on the roadmap. Diarization is something we are also working on, but will take a few releases to get right.

As for the paywall, I think there is a bit of a misunderstanding. We aren't paywalling any models, the models are totally free to use and open for everyone to download. We are simply only offering the GUI and transcript editing / visualizations features for the larger models to Pro users, which we did write, and will continue to improve.

innagorda Dec 13, 2024

@duttaoindril I was wondering if you could help me out with something. I'm trying to figure out if there's a way to use the large model without paying for a subscription. Do you know how I could do that? Thanks so much in advance!

jonathgh Feb 12, 2025
Author

@innagorda We now offer the chance to start a trial of 7 days, during which you can use all the features of the app without limitation. Since @duttaoindril commented about features that would make sense, we have been implementing those consistently over the last year and a half:

We have moved from the gated model to a free trial model, so that the Large models are no longer behind the paywall - they are fully usable during the trial period.
We have consistently been updating the app through the auto-updater, not just at a slow-drip pace
We have added video support, and let you scrub through the video along with the transcript and audio waveform
We have added speaker diarization, and label speakers in the transcript UI
We have added the option to export the transcripts with time and person labeled files

Feature requests are welcome :)

jonathgh · 2023-03-25T22:24:49Z

jonathgh
Mar 25, 2023
Author

We're excited to announce WhisperScript v1.2.1, an update to our Electron desktop Whisper implementation that introduces a lot of new features to speed up your transcription workflow. This update adds a bunch of improvements to the visualization, playback, editing, and exporting of your transcripts. Here's what's new in v1.2.1:

Multiple selection and segment actions (copy, remove, merge, bookmark)
Bookmarking, filtering, and exporting bookmarked or selected segments as individual clips.
Enhanced transcript functionality (auto-saving changes, copy without timestamps)
Improved audio format support (FLAC, OGG, OPUS)
Introducing Regions, transcript segment visualizations on the timeline – to see exactly where the text is being spoken.
Add new Regions, Adjust Region length, move Regions around, merge and delete Regions.
Optimizations for better usability and performance

We hope that these new features will speed up your workflow and make it easier to edit and navigate your transcripts. We’re actively developing more, and we're excited to see how you'll utilize these features in your projects!

Download the latest version here. The features above are only in the Pro version: https://getwavery.com

We'd love to hear your thoughts and suggestions for future updates. You can reach us at [email protected].

Happy transcribing!

0 replies

sscotti · 2023-03-26T10:48:58Z

sscotti
Mar 26, 2023

Might have to try it. BTW, I started playing around with Whisper in Docker on an Intel Mac, M1 Mac and maybe eventually a Dell R710 server (24 cores, but no GPU). Not sure you can help, but wondering about mutli-CPU and/or GPU support in Whisper with that hardware. It sounds like it might be partially possible, but NVIDIA GPU's are the only ones that are supported much.

I want to integrate the thing into a medical IT application stack that I have, just using the Whisper API in the local build. I have an OpenAI API Key for testing also.

0 replies

jwnacnud · 2023-03-26T19:39:14Z

jwnacnud
Mar 26, 2023

Totally agree. Find ways to speed up the larger models. That’s worth paying for. Hey people hooked on the larger models, even though slow, and also offer the smaller and faster models. Your service and application are your selling points, not the language models.

On Sun, Mar 26, 2023 at 1:13 PM Oindril Dutta ***@***.***> wrote: That's great, and I really appreciate those features. But the larger models are still behind payment gates, it'd be better to strip down the free experience but offer all models in the free version. A free trial of a few days or # of transcriptions with the pro features would also help you make sales. Along with an auto updater with constant feature drip for pro users. Pro feature ideas: - try to find ways to improve the performance of the models, and sell that in the free version - don't artificially slow down performance - when dragging in a video, also show the video as you scrub through the transcript and audio waveform - after adding video add speaker diarization to try to automatically label voices and faces if any in the UI - after adding diarization make it easy to export all transcriptions as time and person labelled subtitles to add them back to sites like YouTube or movies or anything. Point is, there're a bunch of features you can work toward putting together to make this an increasingly valuable and enticing paid product - I just don't think you should payment gate the larger models since it's open source and not your work. — Reply to this email directly, view it on GitHub <#1028 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGW5A736VDBX25WQ4VFS43W6CILZANCNFSM6AAAAAAVPS7CDI> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

-- Jeffrey Duncan

0 replies

leidmew · 2023-04-02T19:35:45Z

leidmew
Apr 2, 2023

Hi! Just update to PRO version because I think there's value added for people like me without coding experience.. Just one feature I'm missing and that may be easy to implement (speaking with zero idea so enlighten me if otherwise) is to be able to decide where the models are downloaded. By default a new folder is created in Documents folder (I'm on Mac) so it's annoying to have a high level folder in there just for that. Thanks for your efforts with the app!!

1 reply

jonathgh Apr 2, 2023
Author

Thanks for your support! We have had some other requests for this feature, and we will try to get this into a future update. I know it can be a big download to keep it on the main drive. Thanks for the feedback.

bbecausereasonss · 2023-04-20T22:48:30Z

bbecausereasonss
Apr 20, 2023

Can this be updated to use https://github.com/guillaumekln/faster-whisper

0 replies

jwnacnud · 2023-04-24T18:09:31Z

jwnacnud
Apr 24, 2023

I saw WAY faster speeds but also more hallucinations. I'm ok with slower, if it means more accurate. Jeffrey Duncan

…

On Mon, Apr 24, 2023 at 12:07 PM becausereasons ***@***.***> wrote: Up to 70x faster. — Reply to this email directly, view it on GitHub <#1028 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAGW5A5I2MGI4AV5RHMZPC3XC253BANCNFSM6AAAAAAVPS7CDI> . You are receiving this because you commented.Message ID: ***@***.***>

1 reply

kai-shimada May 3, 2023

I agree. I think we'll stick with whisper.cpp. It has also received performance improvements through added support for CoreML (for Apple silicon devices) and 4-/5-bit integer quantization in their latest v1.4.0 beta release. We'll definitely try to implement these improvements once a stable version is available.

Veneration1 · 2023-05-04T19:26:52Z

Veneration1
May 4, 2023

Is model large-v2 implemented or going to be implemented? Here's what that page says about it: "The large-v2 model on average shows about 5% relative error reduction in English and about 10% in other languages, but please note that it may behave differently depending on the individual audio and in some cases perform worse than large-v1." Here it is on GitHub it's also on Hugging Face

1 reply

kai-shimada May 5, 2023

Hi, the large model we use is the large-v2. In the upcoming release, we'll also start supporting the english-only models.

Veneration1 · 2023-05-05T13:07:09Z

Veneration1
May 5, 2023

A few questions:

Would be nice to be able to have the option to export a txt without timestamps. I see you can copy the transcript and paste it to do that, but would be nice to have a simple toggle for that and the normal export.
Without knowing if the large model will run on my computer I'm hesitant, as it may not run or may be too slow for our needs, like 3+ hours rather than 1-2 hours to work 1 hour of audio. We need a large model quality for our work. So wondering if there are any simple ways to test this. Does it use GPU? My Mac has intel.
I see you can modify text in-app, which is cool. It would probably be very difficult to code, but I've been looking for an app or way for content we edit to be learned so that we don't have to keep making the same corrections recording after recording. Could almost be a different app, maybe there is one..?
Thank you for your reply to my other question. Wishing you all the best on your continued development. SUPER excited to have speaker identification in the app. Very much looking forward to that.

1 reply

kai-shimada May 5, 2023

Would be nice to be able to have the option to export a txt without timestamps. I see you can copy the transcript and paste it to do that, but would be nice to have a simple toggle for that and the normal export.

I'm currently working on the new export window which will allow you to do that.

Without knowing if the large model will run on my computer I'm hesitant, as it may not run or may be too slow for our needs, like 3+ hours rather than 1-2 hours to work 1 hour of audio. We need a large model quality for our work. So wondering if there are any simple ways to test this. Does it use GPU? My Mac has intel.

I have an old MacBook Pro from Mid 2012 and it takes about 2 times the audio duration to transcribe using the large model (2h~ to transcribe 1h of audio). If you are using a newer Intel Mac, it should transcribe much faster. I assume you don't have the pro license, in this case, the simplest way to test it out would be to go to https://github.com/ggerganov/whisper.cpp#quick-start, follow the quick start guide and let it run from your terminal. WhisperScript doesn't support GPU yet, but it seems like whisper.cpp is on its way to release a stable version soon with GPU support, so when it's out, WhisperScript will support GPU inference too.

I see you can modify text in-app, which is cool. It would probably be very difficult to code, but I've been looking for an app or way for content we edit to be learned so that we don't have to keep making the same corrections recording after recording. Could almost be a different app, maybe there is one..?

Do you mean something like MacWhispers "Global Find & Replace" which will let you build your own "dictionary" of commonly misspelled words and their correct spellings so that these words get automatically corrected each time? We haven't implemented it yet but it will be in near future update for sure. In the upcoming release, we instead added an option to add a prompt that can be used to introduce new words.

Thank you for your reply to my other question. Wishing you all the best on your continued development. SUPER excited to have speaker identification in the app. Very much looking forward to that.

Speaker Diarization is a very requested feature but it also needs a bit more time to implement, we'll let you know soon enough when we are ready ;)

darnn · 2023-05-05T13:29:57Z

darnn
May 5, 2023

@Veneration1 If you can get Subtitle Edit to work on a Mac, with Wine or something, it has a file where you can add automatic corrections:
https://github.com/SubtitleEdit/subtitleedit/
(And the file is eng_OCRFixReplaceList.xml, or whichever language you need. The format is pretty intuitive, just add your own lines.)

0 replies

shruru · 2023-05-05T18:27:23Z

shruru
May 5, 2023

Looks interesting . Glad to see a project got updated continually 👍🏻

May I ask what's the main differences between this and MacWhisper if you don't mind? 😁
Speed / Accuracy (by same model) ?

BTW when it's working on Windows, I do hope it could supports GPU for speed-up processing.

Thank you.

1 reply

matsumurae Jul 28, 2023

At least for me, WhisperScript works faster I don't know why. Same conditions, same file, same model. Small model took 17 min MW vs 4 on WS. After seeing how slow was MW I didn't wanted to wait for the Medium, on WS took 11 min.

Note: I'm using an Intel mac, not M1. 2020 i5 if you're curious.

darnn · 2023-05-05T18:29:37Z

darnn
May 5, 2023

@shruru As mentioned previously in this thread, for a Windows implementation that utilizes the GPU, you might want to look into this: https://github.com/Const-me/Whisper/releases/

1 reply

jonathgh Jan 29, 2025
Author

We also utilize GPU on the Windows version of WhisperScript, it can be set in the advanced options in the transcription view:

edtechdev · 2023-05-22T11:14:28Z

edtechdev
May 22, 2023

Another Windows app is whispercppGUI https://github.com/Topping1/whispercppGUI
I combine it with yt-dlg to generate more accurate YouTube transcripts https://yt-dlg.github.io/yt-dlg/

0 replies

jonathgh · 2023-05-23T16:54:15Z

jonathgh
May 23, 2023
Author

EDIT: We have now released a new version of WhisperScript, with several improvements, including a video player, batch processing and improved performance overall. We'd love to hear your thoughts on the new design! Join the discord to hear about our latest developments.

Exciting news! We're back with an impactful update to improve your transcription experience, making it more efficient for language learners, subtitle creators, interview analysts, or those combing through their media libraries. We're introducing WhisperScript v1.3.4, and here's what's new:

New in WhisperScript v1.3.4:

Transcription Settings Window 🪟: Now you can view and modify imported files, limit the number of characters per segment, and translate any language to English right within the window! Currently we are supporting only any-to-English translation.
Limit number of characters per Segment: customize subtitles by limiting the length of produced segments.
Initial Prompting 🪄: control the output of WhisperScript to add domain-specific vocabulary, example formatting and selecting language variants (e.g. Simplified vs. Traditional Chinese).
Auto-Updater 🌟: Always stay up to date with the latest features.
Find & Replace All 🔍: Easily replace certain words, names, or abbreviations in the transcript.
Model download manager: Save space and time - download multiple models in parallel, and set a custom folder to store the models (e.g. on an external drive).
Merge segments: Combine related segments by sentence.
AAC Audio Format Support: Support for more audio formats.
Paste Command : Transcribe files from the clipboard right onto the WhisperScript window.
New Transcript Export Options: Quickly output the current transcript, current selection, audio clips, document files, SRT or Anki APKG (for language learners) or current bookmarks in the new export window!
Beta Channel Updates: subscribe to the Beta channel within the app to get the cutting-edge features.

These features are available to Pro Users, but Lite Users will also get several UI improvements and bug fixes:

English Only Models: for faster, more accurate transcriptions if you are only working in English.
Status Bar: displays information about the model and language used to transcribe each transcript, access settings and re-transcribe button.
Re-transcribe button: to re-do transcription without having to leave the app.

We’re still actively developing additional features, and we’d love to invite you to join the Discord to be a part of the process, suggest features and report bugs: https://discord.gg/b9TYCgC6

Download the latest version here: https://getwavery.com

We'd love to hear your thoughts and suggestions for future updates. You can reach us at [email protected].

Happy transcribing!

3 replies

pietrosperoni Sep 24, 2023

Thanks, I love your tool.
How do we get the pro version?
Also the Discord invite is expired :-(.

Cheers.
Pietro

kai-shimada Sep 24, 2023

@pietrosperoni Hi, the information above is slightly outdated. We are currently not offering the pro version. Please contact us at [email protected], and we might be able to help you further.

pietrosperoni Sep 25, 2023

Hello @kai-shimada , thanks for your answer. I did send an email yesterday, just after writing this message when I realised the email was maybe the preferred route. I also added some suggestions I had. I will check if I get an answer in the next days.

Best Regards,
Pietro

ghost · 2023-06-05T06:33:11Z

ghost
Jun 5, 2023

Greate job!

1 reply

jonathgh Jun 5, 2023
Author

Thanks!!

JackLawrence6 · 2023-06-30T05:41:00Z

JackLawrence6
Jun 30, 2023

GJ,can it be used for ytb or twitch live streaming?

1 reply

candideu Feb 5, 2025

I wou'd use caption.ninja in OBS for streaming: https://caption.ninja/

bits-by-brandon · 2023-09-22T22:38:13Z

bits-by-brandon
Sep 22, 2023

I made a open-source alternative with basic features here. Feel free to fork, contribute, or do whatever you need!
https://github.com/bits-by-brandon/whisper-ui

3 replies

candideu Sep 23, 2023

Nice! Would it be possible to add build instructions? Not sure how to use it on Windows...

ntovarsolorzano Jun 28, 2024

Here are the instructions. Feel free to share it, and/or copy-paste it for others to know.

$ git clone https://github.com/whisper-ui/whisper-ui.git
$ cd whisper-ui
$ yarn install
$ sudo apt-get install ffmpeg -y # For Ubuntu/Debian-based systems
$ yarn build # If there's a build script
$ yarn vite # Since it uses Vite

It worked for me.
Ubuntu 20.04 distro.

Update: When clicking the buttons nothing happens. 😢

God bless.

jonathgh Jan 29, 2025
Author

The Windows version of WhisperScript is now live: Download it here

SituDaMan · 2024-07-05T22:37:04Z

SituDaMan
Jul 5, 2024

Hi there, does it only transcribe, or it can do "translate" as well?

1 reply

jonathgh Jul 8, 2024
Author

@SituDaMan It does support translation as well, but at the moment, just all languages to English.

thebigboss9018 · 2024-07-27T06:19:54Z

thebigboss9018
Jul 27, 2024

When windows? T.T

7 replies

Makememo Jul 27, 2024

Welcome to try MemoAI (https://memo.ac )，it's support for Windows.

thebigboss9018 Jul 27, 2024

Actually this looks good for my usecase, I will test it later. Thanks for rec

edtechdev Jul 29, 2024

When windows? T.T

See https://github.com/Const-me/Whisper

jswhisperer Oct 17, 2024

https://dev.to/jswhisperer/cross-compile-a-distributed-electron-app-3nfk

jonathgh Jan 29, 2025
Author

‼️The WhisperScript Windows version is now available! @thebigboss9018

Check it out here: WIndows WhisperScript

jonathgh · 2024-11-14T22:31:42Z

jonathgh
Nov 14, 2024
Author

whisperscript-05-search-replace.mov

Exciting update! WhisperScript v2.0 is here, re-written from the ground up with React and Typescript, with a whole new architecture and improved features. Check out what’s new in this release!

Wavery Accounts:

Multi-Device Support 📲: Now you can install WhisperScript on up to 2 devices simultaneously, ensuring seamless access across multiple machines.
Simplified Account Management 🔑: All licenses are now managed directly through Wavery, making account setup and management easier than ever.
Super Fast and Responsive ⚡: We’ve completely rebuilt WhisperScript for speed and stability. With a fresh visual update, the app is now more responsive and intuitive to use.
Video Viewer Support 🎥: Now transcribe directly from video files in formats like MOV, MP4, and MKV, and view the files in the new video player, streamlining the workflow for video and subtitle creators.
Enhanced Toolbar 🛠️: Improved access to essential editing actions, making it easier to navigate and refine your transcripts.
Search & Replace (Beta) 🔍: A new feature for quick text adjustments, although still in beta, making changes across the entire transcript faster.
Merge Segments by Sentence: Combine segments seamlessly by sentence, available in Edit > Merge Segments by Sentence, for a cleaner transcript flow.
Bookmarks 📌: Mark key points in your transcripts for easier reference and navigation.
Real-Time Transcription Progress: Monitor the progress of your transcription as it happens, enhancing visibility and control.
Abort Running Transcriptions: Easily stop a transcription in progress via the progress bar or by closing the active tab.
Future Transcriptions Control: Delete tabs to individually abort or remove queued transcriptions without disrupting active processes.
Filename Dropdown Actions 📝: New dropdown in the header to reveal the file in Finder, re-transcribe, or close the transcript. (Feedback on visibility is welcome—let us know if this action menu is too hidden!)
Large v3 Turbo Model Support 🚀: Experience the speed of Large V3 Turbo—twice as fast as the original Large V3 model and comparable to Distill Whisper speeds.
Expanded Export Options 📄: Export transcripts to HTML, JSON, PDF, Word, and RTF, enabling more flexibility in sharing and documentation.
Open Recent Projects: Quickly access recent projects without navigating through file directories.

What This Means for You:

Create Your Wavery Account: To continue using WhisperScript, simply create a Wavery account. It’s quick and easy, and unlocks all the new features.
Pro Trial and Lite Version Changes: We’re streamlining our offerings by retiring the Lite version. Now, all users can try the Pro version for free with a trial. For former Gumroad Pro license holders, you will always be able to keep using WhsiperScript 1, and also get access to WhisperScript 2 by migrating your license code during signup or on the account page .
New Pricing Structure: WhisperScript has been optimized for performance, enabling faster updates and additional features. Existing Pro users maintain lifetime access, while new users benefit from an improved value at our updated pricing. We’re offering a complimentary year of WhisperScript Pro to all Lite users to ease the transition.

Thank you for being part of the WhisperScript community! We’re thrilled to continue evolving the app with these improvements and more to come.

For any questions or assistance, reach out at [email protected]. We’re here to help!

WhisperScript v2.0 is designed to optimize your transcription workflow, whether you're working on interviews, media analysis, or multilingual transcription. As always, we appreciate your feedback and suggestions to help us keep improving!

Download the latest version here: https://getwavery.com

Have ideas or feedback? Reach out to us at [email protected] or join our community on Discord to connect with other WhisperScript users: https://discord.gg/b9TYCgC6.

Happy transcribing!

2 replies

jsmollin Nov 18, 2024

Let me know if I need to correct this. Many people, including me, have repeatedly asked you for a Linux version. In my case, you emailed that you expected to have a Linux version out in a matter of one week (as of 29 APR 2023)

My Message to You

Good afternoon, I emailed you two days ago and have not heard a reply.
I'm am curious if you have had a change to read it? You software looks
to fill a very handy niche. I have been unable to find anything that
matches the functionality that you describe. If I were a more
experienced with writing software, I would attempt to create a program
that satisfied my software needs (I, in fact did try to a very poor
start). It is prohibitively difficult to build WhisperScript into Linux
with Electron's building utilities?

Thank you in advance for your reply.

Kind Regards,
Juro...

Your Reply

Hi Juro,

Apologies for the late reply, I’ve been out of the office for a short holiday. I hear your request and it was brought up in our weekly standup meeting that we should prioritize building for other systems, including Linux. I would like to attempt a build a version of WhisperScript for Linux sometime this week, but I have limited access to Linux on bare metal devices, only through virtualization on a Mac. The difficulty of building it for Linux entirely depends on the compatibility of the myriad of dependencies working on Linux too, which is not a given. In the case of incompatibilities, we would need to find alternate solutions and that would be the biggest hurdle in my estimation. Electron itself shouldn’t present too much of a roadblock.

Best,
Jonathan
from Wavery

It has now been 81 weeks since your Linux release ETA. Further, you changed the pricing for fixed licenses from $40 (I can't remember the exact price) to $250. You also instantiated the brilliant idea of screwing over people by removing your free tier and replacing it with options that cost $100/year. Am I getting this right?

I still like the potential of your software, and your choices have alienated me in the severest way. I'm not a software developer. If I were, I would not have the time to write software with similar functionality due to my grad school schedule. Perhaps this critique can be brought up in your next standup meeting.

Kind Regards,
Juro

jonathgh Nov 18, 2024
Author

Hi @jsmollin,
Thanks for your message - I hear you loud and clear. I would also like to support a Linux version, as a developer and a fan of the Linux ethos, it is something that we would like to support, but have not had the time to prioritize it. Sorry that it has slipped from my attention. We are currently working to support Windows, and make the build process more cross-platform friendly, so a Linux version could become a part of that effort. As I mentioned in my reply, I don't have access to a Linux machine, so I'd need to set one up to test out a build. I'll put this higher on the docket, and keep you updated on my progress.
As for the pricing, we are still offering a free version - you can download it now on Gumroad, and that hasn't changed. We'll continue to offer that free version of WhisperScript going forward. The new version is offered with a free trial and a paid option that is comparable to other Whisper apps (like superwhisper). As I understand it from your message, would it be preferable to offer a free 'lite' version as well? I'm very open to that option.
We built the app for people who aren't developers, and are looking for an easy way to use Whisper on their machines, and we strive to build out the features that are useful.

Best Regards,
Jonathan

jonathgh · 2025-01-29T15:39:54Z

jonathgh
Jan 29, 2025
Author

Update Incoming ‼️

🚀 WhisperScript is Now on Windows! 🎉

We’re excited to announce that WhisperScript is finally available for Windows! 🖥️

We know many of you have been looking for a transcription tool on PC with the same features and intuitive UI as our Mac version. After extensive development and testing, we’re proud to say that feature parity between Mac and Windows is now here!

💡 What This Means:

The Windows version has all the same capabilities as the Mac version—transcription, translation, search & replace, video support, and more.
Moving forward, we’ll keep both platforms updated, with some features arriving first on Mac but quickly following on Windows.
PC users now have access to the same simple interface and easy transcript editing that was on MacOS.

We have also heard from some that a Linux version would be nice, however, we realized when building the Windows version that we will likely not have the capacity to build and keep updated 3 different platforms/architectures. If you would like to help us work on the Linux version, reach out.

So, now’s the time to check it out. We’re looking forward to your feedback!

👉 Download the Windows version now! Download for Windows on our site

As always, let us know if you run into any issues or have suggestions—we’re committed to making WhisperScript the best transcription tool for all creators.

Happy transcribing! ✨

** Installation Note: some may see the Windows SmartScreen window when running WhisperScript for the first time, you can click "More Info" and then "Run Anyway" to install:

0 replies

WhisperScript, an Electron desktop app GUI for Whisper #1028

Replies: 25 comments · 50 replies

jonathgh Jun 2, 2023 Author

jonathgh Jan 29, 2025 Author

jonathgh Jan 29, 2025 Author

jonathgh Mar 18, 2023 Author

jonathgh Feb 12, 2025 Author

jonathgh Mar 26, 2023 Author

jonathgh Mar 30, 2023 Author

jonathgh Feb 12, 2025 Author

jonathgh Mar 25, 2023 Author

jonathgh Apr 2, 2023 Author

jonathgh Jan 29, 2025 Author

jonathgh May 23, 2023 Author

jonathgh Jun 5, 2023 Author

jonathgh Jan 29, 2025 Author

jonathgh Jul 8, 2024 Author

Replies: 25 comments 50 replies

jonathgh Jun 2, 2023
Author

jonathgh Jan 29, 2025
Author

jonathgh Jan 29, 2025
Author

jonathgh Mar 18, 2023
Author

jonathgh Feb 12, 2025
Author

jonathgh Mar 26, 2023
Author

jonathgh Mar 30, 2023
Author

jonathgh Feb 12, 2025
Author

jonathgh
Mar 25, 2023
Author

jonathgh Apr 2, 2023
Author

jonathgh Jan 29, 2025
Author

jonathgh
May 23, 2023
Author

jonathgh Jun 5, 2023
Author

jonathgh Jan 29, 2025
Author

jonathgh Jul 8, 2024
Author