Channels ▼
RSS

Embedded Systems

Can Robots See Like Humans?


Jetpac CTO Pete Warden has blogged news this week of his team's successful port of the Deep Belief image recognition SDK to the Raspberry Pi.

This is exciting (says Warden) because it shows (or at least it is one example of the fact) that even tiny, cheap devices are capable of performing sophisticated computer vision tasks.

The Deep Belief Teacher App works such that it helps users teach their phone (or small form factor handheld) to recognize an object by taking a short video of it — the user then teaches the application "what is not" the object by taking a short video around the target object and of everything except the object.

Subsequent to the above action the user is then able to scan their entire surroundings with the phone (or other) camera and the app will detect when the user is pointing at the target object.

With the DeepBelief SDK, developers can build object recognition into iOS/Android apps and now the Raspberry Pi, giving phones and the Raspberry Pi the ability to see.

Warden says that he has talked a lot about how "object detection is going to be commoditized and ubiquitous" in the future.

"I can process a frame in around three seconds, largely thanks to heavy use of the embedded GPU for heavy lifting on the math side. I had to spend quite a lot of time writing custom assembler programs for the Pi's 12 parallel 'QPU' processors, but I'm grateful I could get access at that low a level," he said.

The demo is a lot of fun to try out.

The company has been using deep belief networks extensively for image recognition across hundreds of millions of Instagram photos, and now it's excited to bring some of that functionality to the masses and see how developers take advantage of the phone and Raspberry Pi's newfound ability to "see."


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
 

Comments:

Video