• 0 Posts
  • 15 Comments
Joined 1 year ago
cake
Cake day: June 12th, 2023

help-circle









  • Only transcription, it outputs to a few formats that amount to plain text with or without time coding including srt subtitles. It transcribes really well, one bit of note is that sometimes with more technical discussions I find better results using the smaller models. My best theory is the technical words are less likely to be assumed to be an accent/variation.


  • Not sure if you have tried/heard of Whisper. It automatically transcribes audio, I use it for meetings/lectures that don’t come with Closed Captioning, it supports audio/video files and a few languages. I had tried a few solutions with mixed results (e.g. Google is slow, many places limit lengths/sizes), IBM is supposed to be the best free/low cost cloud model but they would never approve my accounts. In the end locally with whisper in an Anaconda/Python environment was best cheap option for me.


  • Whats funny is I keep hearing about bots and operatives on Lemmy that go around promoting things that arent the status quo like we are the next social media. And the kind of people saying it thinks everyone needs to stick to a party line or else! Like or else what? If I do not love every thing Big Dem is pushing, I am a Russian shill bot trying to destroy America. The downvotes you have are from people barely able to form thoughts past doing what they are told.