Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Siri doesn't need to have conversations with you. ChatGPT can do that. But, it should be able to do actions you'd do on your phone.


Speech to text should work. I regularly have to manually edit the transcribed input. The more special words the more frequent. Completely disregards the context of the current input, for example, on Hacker news might involve special technical and IT vocabulary.


> Completely disregards the context of the current input, for example, on Hacker news might involve special technical and IT vocabulary.

Does any voice assistant do this right now? Genuine question, I don't actually know. It sounds useful as long as it's not invasive.


Any of the LLM-based ones should pull this* off - so that's to say.. none of the popular commercially available ones, yet?

Alexa+ does, but I don't use it for anything except kitchen timers and home automation triggers, so I can't speak to how well it works in a longer conversation.

Zoom's meeting notes excels at this, Google Meet is terrible at it. Meet mishears our company name about 90% of the time; various attendee names are a coin toss.

* "this" being: context consideration in speech-to-text/transcription.


Pretty straight forward on Android at least to wire up a harness that talks to Tasker[0] or another full automation app.

[0] https://tasker.joaoapps.com/


The iOS equivalent would be Shortcuts, which, while not as powerful as Tasker depending on the context, is an official Apple feature that most apps support. Claude and ChatGPT both have various Shortcuts hooks, including voice conversation.


The experience of having to tell Siri to "Ask ChatGPT <about something>" really sucks, though. It doesn't consistently do it, the handoff frequently just stalls out and you never get a response, the transcription that gets passed to ChatGPT is low quality, etc.

And though I have the feature enabled that should cause it to ask ChatGPT about things it can't answer, that works even less frequently.

But even if all of these things were true, the stuff on your phone you would expect to be exposed to the model as available tool calls, are not. So their efficacy is very limited.

(edit: iPhone 16 Pro Max, if anyone is curious)


Oh I was just thinking creating a shortcut that you'd tap on your Home Screen/control shade (whatever it's called) to activate ChatGPT, or wire up to the action button. I forgot you can have Siri do the "ask ChatGPT xyz" thing – I agree, that integration sucks.


I'd definitely do the former. I don't even think this is specific to ChatGPT or Claude's apps.

There seems to be something about how intents get triggered by Shortcuts on iOS that feels flaky to me. Whenever some app suggests a shortcut (most recently Starbucks promoted a shortcut that orders your "usual"), the success rate when I tap it is <50%.

It's possible it's uniquely worse on my device, since I haven't done a "clean install" (vs letting the device upgrade flow copy over) in like a decade. But I'm also not up for dealing with the pain of setting up from scratch just to find out it's bad on a fresh profile, either.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: