Dr. Dobb's is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them. Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.


Channels ▼
RSS

Google Patents Voice Search


Conversing with computers, long the preferred mode of interaction between man and machine in science fiction, just moved closer to reality. Google Inc. on Tuesday was granted a patent for a voice interface for search engines.

Then again, it might just be a pipedream. Google spokesperson Barry Schnitt issued this characteristically opaque comment on the company's plans: "Like many companies, we file patent applications on a variety of ideas that our employees may come up with. Some of those ideas later mature into real products or services, some don't. Prospective product announcements should not be inferred from our patent applications."

Filed over five years ago, the document at least confirms Google's longstanding commitment to extending its technology beyond the desktop to mobile devices, something seen in the company's recent series of mobile-oriented moves, such as its launch of mobile phone search ads in Japan last week.

So assume it's true that "there exists a need for a voice interface that is effective for search engines," as Google states in the patent filing. It follows then that Google, having developed such an interface, might just bring it to market.

"You can expect there will be more mobile search products coming out," says Dan Miller, an analyst with Opus Research Inc. "Speech as a mode for doing searches is getting more reliable." Miller suggests that Web search advertisers are salivating over the prospect of reaching hundreds of millions of prospective phone users.

That may be why both Google and Yahoo have a large number of people on staff working on speech-related issues. It's an area of such promise that both companies have engaged in aggressive recruiting.

Kai-Fu Lee, VP of engineering and president of Google China, used to work for Microsoft on speech technology. His defection to Google brought a lawsuit from Microsoft that played out during the latter half of 2005.

Last September, Nuance Communications, a speech and imaging technology company, sued Yahoo, alleging that the search company poached 13 engineers. That case has since been settled, but the terms weren't disclosed. "We don't confirm employee numbers in any context," Yahoo spokesperson Kiersten Hollars said in an E-mail.

The patent document suggests that Google will be leveraging its logs of stored text and audio queries to improve speech recognition and relevancy, as the company does currently with text keywords. It says, "The query logs may consist of audio data (i.e., a recorded query) and/or a textual transcription of the audio data. ...The query logs may also consist of typed query logs from, for example, a text-based search engine."

The existence of audio query logs will likely prove as tempting to the Department of Justice and other entities as text search data. But audio queries are likely to contain even more information than text queries in the form of background noise. Anything said in the area of someone speaking into his or her phone could potentially be recorded and stored with GPS coordinates and a time stamp.

It's arguable that such concerns border on paranoia. But as Google becomes the repository for ever more revealing data, tin foil hats don't seem so absurd, particularly given the recent accusations that AT&T has been diverting Internet traffic wholesale to the National Security Agency for data mining.


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.