WhisperScript, an Electron desktop app GUI for Whisper #1028
Replies: 25 comments 50 replies
-
@jonathgh Are you going to make it cross-platform for Linux + Windows? |
Beta Was this translation helpful? Give feedback.
-
👉 UPDATE: WhisperScript now added support for MKV, MP4 and MOV Video Import! |
Beta Was this translation helpful? Give feedback.
-
Wow this is great! Subbed the topic, and waiting for a windows client :) |
Beta Was this translation helpful? Give feedback.
-
It would be great if you could add speaker recognition! i would get it right away! |
Beta Was this translation helpful? Give feedback.
-
Why is the paid feature for better whisper models? It should be paid for features you've written yourself. |
Beta Was this translation helpful? Give feedback.
-
We're excited to announce WhisperScript v1.2.1, an update to our Electron desktop Whisper implementation that introduces a lot of new features to speed up your transcription workflow. This update adds a bunch of improvements to the visualization, playback, editing, and exporting of your transcripts. Here's what's new in v1.2.1:
We hope that these new features will speed up your workflow and make it easier to edit and navigate your transcripts. We’re actively developing more, and we're excited to see how you'll utilize these features in your projects! Download the latest version here. The features above are only in the Pro version: https://getwavery.com We'd love to hear your thoughts and suggestions for future updates. You can reach us at [email protected]. Happy transcribing! |
Beta Was this translation helpful? Give feedback.
-
Might have to try it. BTW, I started playing around with Whisper in Docker on an Intel Mac, M1 Mac and maybe eventually a Dell R710 server (24 cores, but no GPU). Not sure you can help, but wondering about mutli-CPU and/or GPU support in Whisper with that hardware. It sounds like it might be partially possible, but NVIDIA GPU's are the only ones that are supported much. I want to integrate the thing into a medical IT application stack that I have, just using the Whisper API in the local build. I have an OpenAI API Key for testing also. |
Beta Was this translation helpful? Give feedback.
-
Totally agree. Find ways to speed up the larger models. That’s worth paying
for. Hey people hooked on the larger models, even though slow, and also
offer the smaller and faster models. Your service and application are your
selling points, not the language models.
On Sun, Mar 26, 2023 at 1:13 PM Oindril Dutta ***@***.***> wrote:
That's great, and I really appreciate those features.
But the larger models are still behind payment gates, it'd be better to
strip down the free experience but offer all models in the free version.
A free trial of a few days or # of transcriptions with the pro features
would also help you make sales. Along with an auto updater with constant
feature drip for pro users.
Pro feature ideas:
- try to find ways to improve the performance of the models, and sell
that in the free version - don't artificially slow down performance
- when dragging in a video, also show the video as you scrub through
the transcript and audio waveform
- after adding video add speaker diarization to try to automatically
label voices and faces if any in the UI
- after adding diarization make it easy to export all transcriptions
as time and person labelled subtitles to add them back to sites like
YouTube or movies or anything.
Point is, there're a bunch of features you can work toward putting
together to make this an increasingly valuable and enticing paid product -
I just don't think you should payment gate the larger models since it's
open source and not your work.
—
Reply to this email directly, view it on GitHub
<#1028 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAGW5A736VDBX25WQ4VFS43W6CILZANCNFSM6AAAAAAVPS7CDI>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
--
Jeffrey Duncan
|
Beta Was this translation helpful? Give feedback.
-
Hi! Just update to PRO version because I think there's value added for people like me without coding experience.. Just one feature I'm missing and that may be easy to implement (speaking with zero idea so enlighten me if otherwise) is to be able to decide where the models are downloaded. By default a new folder is created in Documents folder (I'm on Mac) so it's annoying to have a high level folder in there just for that. Thanks for your efforts with the app!! |
Beta Was this translation helpful? Give feedback.
-
Can this be updated to use https://github.com/guillaumekln/faster-whisper |
Beta Was this translation helpful? Give feedback.
-
I saw WAY faster speeds but also more hallucinations. I'm ok with slower,
if it means more accurate.
Jeffrey Duncan
…On Mon, Apr 24, 2023 at 12:07 PM becausereasons ***@***.***> wrote:
Up to 70x faster.
—
Reply to this email directly, view it on GitHub
<#1028 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAGW5A5I2MGI4AV5RHMZPC3XC253BANCNFSM6AAAAAAVPS7CDI>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Is model large-v2 implemented or going to be implemented? Here's what that page says about it: "The large-v2 model on average shows about 5% relative error reduction in English and about 10% in other languages, but please note that it may behave differently depending on the individual audio and in some cases perform worse than large-v1." Here it is on GitHub it's also on Hugging Face |
Beta Was this translation helpful? Give feedback.
-
A few questions:
|
Beta Was this translation helpful? Give feedback.
-
@Veneration1 If you can get Subtitle Edit to work on a Mac, with Wine or something, it has a file where you can add automatic corrections: |
Beta Was this translation helpful? Give feedback.
-
Looks interesting . Glad to see a project got updated continually 👍🏻 May I ask what's the main differences between this and MacWhisper if you don't mind? 😁 BTW when it's working on Windows, I do hope it could supports GPU for speed-up processing. Thank you. |
Beta Was this translation helpful? Give feedback.
-
@shruru As mentioned previously in this thread, for a Windows implementation that utilizes the GPU, you might want to look into this: https://github.com/Const-me/Whisper/releases/ |
Beta Was this translation helpful? Give feedback.
-
Another Windows app is whispercppGUI https://github.com/Topping1/whispercppGUI |
Beta Was this translation helpful? Give feedback.
-
![]() EDIT: We have now released a new version of WhisperScript, with several improvements, including a video player, batch processing and improved performance overall. We'd love to hear your thoughts on the new design! Join the discord to hear about our latest developments. Exciting news! We're back with an impactful update to improve your transcription experience, making it more efficient for language learners, subtitle creators, interview analysts, or those combing through their media libraries. We're introducing WhisperScript v1.3.4, and here's what's new: New in WhisperScript v1.3.4:
These features are available to Pro Users, but Lite Users will also get several UI improvements and bug fixes:
We’re still actively developing additional features, and we’d love to invite you to join the Discord to be a part of the process, suggest features and report bugs: https://discord.gg/b9TYCgC6 Download the latest version here: https://getwavery.com We'd love to hear your thoughts and suggestions for future updates. You can reach us at [email protected]. Happy transcribing! |
Beta Was this translation helpful? Give feedback.
-
Greate job! |
Beta Was this translation helpful? Give feedback.
-
GJ,can it be used for ytb or twitch live streaming? |
Beta Was this translation helpful? Give feedback.
-
I made a open-source alternative with basic features here. Feel free to fork, contribute, or do whatever you need! |
Beta Was this translation helpful? Give feedback.
-
Hi there, does it only transcribe, or it can do "translate" as well? |
Beta Was this translation helpful? Give feedback.
-
When windows? T.T |
Beta Was this translation helpful? Give feedback.
-
whisperscript-05-search-replace.movExciting update! WhisperScript v2.0 is here, re-written from the ground up with React and Typescript, with a whole new architecture and improved features. Check out what’s new in this release! Wavery Accounts:
What This Means for You:
Thank you for being part of the WhisperScript community! We’re thrilled to continue evolving the app with these improvements and more to come. For any questions or assistance, reach out at [email protected]. We’re here to help! WhisperScript v2.0 is designed to optimize your transcription workflow, whether you're working on interviews, media analysis, or multilingual transcription. As always, we appreciate your feedback and suggestions to help us keep improving! Download the latest version here: https://getwavery.com Have ideas or feedback? Reach out to us at [email protected] or join our community on Discord to connect with other WhisperScript users: https://discord.gg/b9TYCgC6. Happy transcribing! |
Beta Was this translation helpful? Give feedback.
-
Update Incoming 🚀 WhisperScript is Now on Windows! 🎉 We’re excited to announce that WhisperScript is finally available for Windows! 🖥️ We know many of you have been looking for a transcription tool on PC with the same features and intuitive UI as our Mac version. After extensive development and testing, we’re proud to say that feature parity between Mac and Windows is now here! 💡 What This Means:
We have also heard from some that a Linux version would be nice, however, we realized when building the Windows version that we will likely not have the capacity to build and keep updated 3 different platforms/architectures. If you would like to help us work on the Linux version, reach out. So, now’s the time to check it out. We’re looking forward to your feedback! 👉 Download the Windows version now! Download for Windows on our site As always, let us know if you run into any issues or have suggestions—we’re committed to making WhisperScript the best transcription tool for all creators. Happy transcribing! ✨ ** Installation Note: some may see the Windows SmartScreen window when running WhisperScript for the first time, you can click "More Info" and then "Run Anyway" to install: |
Beta Was this translation helpful? Give feedback.
-
Thanks to the work of @ggerganov and with inspiration from @jordibruin, @kai-shimada and I were able to implement Whisper in a desktop app built with the Electron framework. The app runs on both Mac (Apple Silicon) and Windows. You can download it here: Whisperscript
Currently, our features in Version 2 include:
Super Fast and Responsive ⚡: We’ve completely rebuilt WhisperScript for speed and stability. With a fresh visual update, the app is now more responsive and intuitive to use.
Video Viewer Support 🎥: Now transcribe directly from video files in formats like MOV, MP4, and MKV, and view the files in the new video player, streamlining the workflow for video and subtitle creators.
👨🦰 Speaker Separation: (Beta on MacOS; coming soon on Windows)
Enhanced Toolbar 🛠️: Improved access to essential editing actions, making it easier to navigate and refine your transcripts.
Search & Replace 🔍: A new feature for quick text adjustments, although still in beta, making changes across the entire transcript faster.
Merge Segments by Sentence: Combine segments seamlessly by sentence, available in Edit > Merge Segments by Sentence, for a cleaner transcript flow.
Bookmarks 📌: Mark key points in your transcripts for easier reference and navigation.
Real-Time Transcription Progress: Monitor the progress of your transcription as it happens, enhancing visibility and control.
Abort Running Transcriptions: Easily stop a transcription in progress via the progress bar or by closing the active tab.
Transcription Queue Control: Delete tabs to individually abort or remove queued transcriptions without disrupting active processes.
Filename Dropdown Actions 📝: New dropdown in the header to reveal the file in Finder, re-transcribe, or close the transcript. (Feedback on visibility is welcome—let us know if this action menu is too hidden!)
Large v3 Turbo Model Support 🚀: Experience the speed of Large V3 Turbo—twice as fast as the original Large V3 model and comparable to Distill Whisper speeds.
Expanded Export Options 📄: Export transcripts to HTML, JSON, PDF, Word, and RTF, enabling more flexibility in sharing and documentation.
Open Recent Projects: Quickly access recent projects without navigating through file directories.
Multi-device support: with Wavery accounts, use your software on up to 2 devices.
We hope you enjoy it and let us know if there are any other features you would like to see. You can reach us at [email protected]
Beta Was this translation helpful? Give feedback.
All reactions