Spherical forums

Sphere Development => Sphere Support => Topic started by: JasonLewis on June 19, 2013, 09:39:36 am

Title: Speech Recognition
Post by: JasonLewis on June 19, 2013, 09:39:36 am
G'day folks.

It's been a long time since I've used Sphere and I'm curious if it can leverage any form of speech recognition or speech-to-text.

Cheers.
Title: Re: Speech Recognition
Post by: DaVince on June 19, 2013, 05:47:28 pm
Heya.

The short answer: no.

The longer answer:
Regular Sphere doesn't, since it doesn't have any sort of sound recording.

TurboSphere probably could, if someone actually decided to write a (C++) plugin that accesses a speech-to-text library. But TurboSphere is still in beta. It also would require extra effort to write that plugin, and it would probably require some more effort if you want to make it cross platform (different STT engines?).
Title: Re: Speech Recognition
Post by: Flying Jester on June 19, 2013, 08:11:39 pm
Same as DaVince said, short answer no; but it's not impossible for it to happen. TurboSphere's audio library of choice, BASS, has good capabilities for recording. But they aren't being used as it is.

So if you want to play with C or C++, learn a little bit about V8, and find another library to turn the recorded sound into text, then yes. But it would not be easy.

The TurboSphere plugin interface is cross platform, especially if you use the tools in 'plugins.h'. You could write a platform-specific plugin, but there's nothing built in that causes that at all.
Title: Re: Speech Recognition
Post by: DaVince on June 20, 2013, 07:20:15 am
Well, I was thinking it would mainly be the STT engine being the determining factor on how portable it is. I really have no clue if there are any out there, which is why I said that. Though maybe Google's has an API for that (though you'd have to send the speech data online if you do that)?
Title: Re: Speech Recognition
Post by: N E O on June 20, 2013, 04:55:56 pm
Kinda also begs the question "Which speech recognition is better: MS, Apple, Google, Nuance (Dragon Naturally Speaking), or some random 3rd-party's?"
Title: Re: Speech Recognition
Post by: JasonLewis on June 22, 2013, 03:19:00 am
I am by no means a C or C++ developer, but this sort of thing is interesting so I might have a look into it. But not sure on how portable it would be. This is just experimenting really.