I’m starting to develop arthritis in my fingers, which makes typing an interesting challenge.

I’m wondering, what is the best self-hosted solution for speech-to-text generally? Dragon dictate use to be the thing, but is there anything open source, self-hostable that’s superseded it?

I would love to be able to have something that I can speak into that can interact with pretty much any app, be that notepad++, or my web browser when I’m entering stuff or even when I’m creating this Lemmy post (which I actually made using futo voice on my phone).

Windows and/or Linux ideally.

Any leads? Getting old sucks.

  • JustEnoughDucks@slrpnk.net
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 hours ago

    Hey there!

    This is a job for Handy. I recently started using it with the Parakeet recommended model. It works very very well. Whisper packaged with Wyoming that I run is pretty much unusable for dutch but Parakeet in Handy seems to work quite well. Direct input of text into whatever program you want. Downloads a small local model and works only on your computer. Push-to-talk or toggle, tons of customizations.

    Note that this just speech to text and the models aren’t made for post processing. So it will dictate exactly what you say, umms and everything. If you want cleanup, formatting, etc… Then you have to rely on a much more general model, but the makers are experimenting with an opt-in for that for various external AIs.