This looks like it'll slurp up all your data and upload it into a cloud. Thanks,...

FloatArtifact · on April 9, 2025

Local inference only is an absolute requirement. It's not even really all that accessible if it's online only. I can say this as someone that's used over 20000 hours worth of voice dictation and computer control.

canada_dry · on April 10, 2025

First thing I looked for and read: the FAQ.

No mention of privacy (or on prem) - so assumed it's 100% cloud.

Non-starter for me. Accuracy is important, but privacy is more so.

Hopefully a service with these capabilities will be available where the first step has the user complete a brief training session, sends that to the cloud to tailor the recognition parameters for their voice and mannerisms... then loads that locally.

oulipo · on April 11, 2025

A similar but offline tool is VoiceInk, it's also open-source so you can extend it

pokstad · on April 9, 2025

This should be on the FAQ. I was trying to find out if it was 100% processed locally.

jmcintire1 · on April 10, 2025

fair point. offline+local would be ideal, but as it stands we can't run asr and an llm locally at the speed that is required to provide the level of service we want to.

given that we need the cloud, we offer zero data retention -- you can see this in the app. your concern is as much about ux and communications as it is privacy

fxtentacle · on April 10, 2025

The problem if you actually need the cloud is that it kind of completely destroys your business model. OpenAI is bleeding money every month because they massively subsidize the hosting cost of their models. But eventually they will have to post a profit. And then if they know that your product is completely dependent on their API, they can milk you until there's no profits left for you.

And self-hosting real-time streaming LLMs will probably also come out at 50 cents per hour. Arguing a $120/month price for power users is probably going to be very difficult. Especially so if there is free open-source alternatives.

mrtesthah · on April 10, 2025

MacWhisper does realtime system-wide dictation on your local machine (among other things). Just a one-time fee for an app you download -- the way shareware is supposed to be. Of course it doesn't use MoE transcription with 6 models like Aqua Voice, but if you guys expect to be acquired by Apple (that is your exit strategy, right?), you're going to need better guarantees of privacy than "we don't log".

shinycode · on April 10, 2025

I downloaded the turbo whisper model optimized for Mac, created a python script that get the mic input and paste the result. The python script is LLM generated and it works with pushing a key. For 80% of the functionality for free and done locally.

toddmorey · on April 10, 2025

And man it's another monthly subscription. I'm not mad at them for finding a gap in the market and putting a business around it. I'm mad at Apple for leaving that gap... hopefully built in voice dictation improves quickly.

FireBeyond · on April 10, 2025

Is there a gap in the market? It's being rapidly filled with the likes of MacWhisper, etc., which offer local-only, one-off pricing.

pablopeniche · on April 11, 2025

"hopefully built in voice dictation improves quickly." I would not hold my breath on that one lol

jackthetab · on April 9, 2025

Agreed.

This is where I bounce (out of this discussion).

thmsmlr · on April 9, 2025

I totally agree, I created BetterDictation (.com) exactly because of that. Offline was a super important requirement for me.