Apple patent reveals a text-to-speech and speech-to-text system for the iPhone

Apple recently filed a patent application for a text-to-speech and a speech-to-text converter designed to work in noisy environments. The patent describes a system that uses a converter included on the logic board of the phone. This hardware-based conversion would have a distinct advantage over current text-to-speech systems, which use an internet-based service from a company like Nuance to translate conversions.
Unlike Android's text-to-speech system, which is used for searching and navigation, Apple's patent describes a system used for sending and receiving phone calls. In one embodiment, a microphone on the iPhone would detect the ambient noise level and prompt the user to answer a call using text-to-speech in a noisy environment when talking on the phone may be difficult. The person answering the call would type in their messages, and the phone would convert it to speech heard by the caller.
In another example, the user could choose to talk via a two-way texting system that uses both text-to-speech and a speech-to-text within the conversation. Basically, your caller's words would be converted to text that you could read, and you could input a text response that is converted to speech for the caller on the other end. It's an elegant system that would be useful for making phone calls at a loud sporting event or a crowded bar.
Apple is rumored to be in talks with Nuance that could bring an advanced speech recognition system to iOS. This above patent may describe a small part of what is to come for iPhone owners in the future.
Share
Apple recently filed a patent application for a text-to-speech and a speech-to-text converter designed to work in noisy environments....
Add a Comment
I believe the author is slightly confused on the definitions of the two terms. This is easy to do if you do not spend your life "having conversations with technology".
Text-to-Speech (TTS): is when the computer converts text on the screen to speech. The best example of this on the iPhone is VoiceOver. Everyone who uses an iPhone in a risky situation such a driving should learn how to use VoiceOver so you don't have to take your eyes off the road.
Speech-to-Text (aka Speech Recognition): There are two forms of this, speaker independent (limited vocabulary) and speaker dependent (requires some training). An example of the speaker independent type is the Voice Command (not very useful in current form), it knows few commands and almost anyone can say them.
Speaker Dependant would be Dragon Natrally Speaking which before you can use it you must proform extensive training.
Sounds like that'd be pretty stellar for deaf people as well!
May 13 2011 at 12:48 PM Report abuse Permalink rate up rate down ReplyI use dragon speech which saves me hours typing!!
May 13 2011 at 12:06 PM Report abuse Permalink rate up rate down ReplyThat. is. awesome. Not needing an internet connection is a huge plus. Also working in noisy environments, the various text-to-speech and vice versa. This could be a really great addition to iOS.
May 13 2011 at 11:45 AM Report abuse Permalink rate up rate down ReplyWell its about damn time!
May 13 2011 at 11:11 AM Report abuse Permalink rate up rate down ReplyDeals of the Day
more deals- Apple iPhone 4 8GB for Verizon, AT&T, or Sprint for $50 + pickup at Best Buy
- Unlocked iPhone 4S 16GB for GSM (AT&T, T-Mobile) for $619 + free shipping
- Apple iMac Core i7 Quad 3.4GHz 27" w/ 24GB RAM, 2TB HDD for $2,677 + $29 s&h
- Used Apple Magic Mouse for $36 + $4 s&h
- 9-Piece iPhone Bundle, includes 1,900mAh battery for $8 + free shipping
- Skullcandy Riot Earbud Headphones for $10 + free shipping
7 Comments