share this!
5
5
Share
Email

June 20, 2018

Stereo vision using computing architecture inspired by the brain

by Alexander Andreopoulos, IBM

The Brain-Inspired Computing group at IBM Research-Almaden will be presenting at the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018) our most recent paper titled "A Low Power, High Throughput, Fully Event-Based Stereo System." The paper describes an end-to-end stereo vision system that uses exclusively spiking neural network computation and can run on neuromorphic hardware with a live streaming spiking input. Inspired by the human vision system, it uses a cluster of IBM TrueNorth chips and a pair of digital retina sensors (also known as Dynamic Vision Sensors, DVS) to extract the depth of rapidly moving objects in a scene. Our system captures scenes in 3-D with low power, low latency and high throughput, which has the potential to advance the design of intelligent systems.

What is stereo vision?

Stereo vision is the perception of depth and 3-D structure. When you look at an object, for example, your eyes produce two disparate images of it because their positions are slightly different. The disparities between the two images are processed in the brain to generate information about the object's location and distance. Our system replicates this ability for computers. The relative positions of an object in images from the two sensors are compared, and the object's location in 3-D space is computed via triangulation of those data.

Stereo vision systems are used in intelligent systems for industrial automation (completing tasks such as bin picking, 3-D object localization, volume and automotive part measurement), autonomous driving, mobile robotics navigation, surveillance, augmented reality, and other purposes.

Neuromorphic technology

Our stereo vision system is unique because it is implemented fully on event-based digital hardware (TrueNorth neurosynaptic processors), using a fully graph-based non von-Neumann computation model, with no frames, arrays, or any other such common data structures. This is the first time that an end-to-end real-time stereo pipeline is implemented fully on event-based hardware connected to a vision sensor. Our work demonstrates how a diverse set of common sub-routines necessary for stereo vison (rectification, multi-scale spatio-temporal stereo correspondence, winner-take-all, and disparity regularization) can be implemented efficiently on a spiking neural network. This architecture uses much less power than conventional systems, which could benefit the design of autonomous mobile systems.

Furthermore, instead of conventional video cameras, which capture a scene as a series of frames, we use a pair of DVS cameras, which respond only to changes in the scene. This results in less data, lower energy consumption, high speed, low latency, and good dynamic range, all of which are also key to the design of real-time systems.

Both the processors and the sensors mimic human neural activity by representing data as asynchronous events, much like neuron spikes in the brain. Our system builds upon the early influential work of Misha Mahowald in the design of neuromorphic systems. The Brain-Inspired Computing group previously designed an event-based gesture-recognition system using similar technology.

Our end-to-end stereo system connects a pair of DVS event cameras (iniLabs DAVIS240C models) via USB to a laptop, which distributes the computation via ethernet to a cluster of nine TrueNorth processors. Each TrueNorth processor is responsible for the stereo disparity calculations on a subset of the input. In other words, this is a scale-out approach to the computation of stereo, since the system enables, in principle, the addition of many more TrueNorth processors in order to process larger inputs.

The DAVIS cameras provide two 3.5-mm audio jacks, enabling the events produced by the two sensors to be synchronized. This is critical to the system design. The disparity outputs of the TrueNorth chips are then sent back to the laptop, which converts the disparity values to actual 3-D coordinates. An openGL-based visualizer running on the laptop enables the user to visualize the reconstructed scene from any viewpoint. The live-feed version of the system running on nine TrueNorth chips is shown to calculate 400 disparity maps per second with up to 11-ms latency and a ~200X improvement in terms of power per pixel per disparity map compared to the closest state-of-the-art. Furthermore, the ability to increase this up to 2,000 disparities per second (subject to certain trade-offs) is discussed in the paper.

More information: A Low Power, High Throughput, Fully Event-Based Stereo System: researcher.watson.ibm.com/rese … aandreo/cvpr2018.pdf

Provided by IBM

Citation: Stereo vision using computing architecture inspired by the brain (2018, June 20) retrieved 24 April 2024 from https://phys.org/news/2018-06-stereo-vision-architecture-brain.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Brain-inspired supercomputing system takes spotlight in IBM, US Air Force Research Lab collab

10 shares

Feedback to editors

Scientists at the MAJORANA Collaboration look for rule-violating electrons

46 minutes ago

Lunar landforms indicate geologically recent seismic activity on the moon

1 hour ago

Japan's moon lander wasn't built to survive a weekslong lunar night. It's still going after 3

4 hours ago

Bioluminescence first evolved in animals at least 540 million years ago, pushing back previous oldest dated example

12 hours ago

Star bars show universe's early galaxies evolved much faster than previously thought

13 hours ago

Scientists study lipids cell by cell, making new cancer research possible

13 hours ago

Squids' birthday influences mating: Male spear squids shown to become 'sneakers' or 'consorts' depending on birth date

13 hours ago

Study finds rekindling old friendships as scary as making new ones

16 hours ago

How light can vaporize water without the need for heat

16 hours ago

Researchers develop eggshell 'bioplastic' pellet as sustainable alternative to plastic

17 hours ago

Load comments (0)

Stereo vision using computing architecture inspired by the brain

What is stereo vision?

Neuromorphic technology

Scientists at the MAJORANA Collaboration look for rule-violating electrons

Lunar landforms indicate geologically recent seismic activity on the moon

Japan's moon lander wasn't built to survive a weekslong lunar night. It's still going after 3

Bioluminescence first evolved in animals at least 540 million years ago, pushing back previous oldest dated example

Star bars show universe's early galaxies evolved much faster than previously thought

Scientists study lipids cell by cell, making new cancer research possible

Squids' birthday influences mating: Male spear squids shown to become 'sneakers' or 'consorts' depending on birth date

Study finds rekindling old friendships as scary as making new ones

How light can vaporize water without the need for heat

Researchers develop eggshell 'bioplastic' pellet as sustainable alternative to plastic

Relevant PhysicsForums posts

Passing variables in FORTRAN

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Brain-inspired supercomputing system takes spotlight in IBM, US Air Force Research Lab collab

Dynamic Vision Sensor tech works like human retina

IBM going TrueNorth in system lookout for seizures

Visual perception: Vertical disparity corrects stereo correspondence

'Spectacular' finding: New 3-D vision discovered in praying mantis

As Moore's law ends, brain-like computers begin

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Stereo vision using computing architecture inspired by the brain

What is stereo vision?

Neuromorphic technology

Scientists at the MAJORANA Collaboration look for rule-violating electrons

Lunar landforms indicate geologically recent seismic activity on the moon

Japan's moon lander wasn't built to survive a weekslong lunar night. It's still going after 3

Bioluminescence first evolved in animals at least 540 million years ago, pushing back previous oldest dated example

Star bars show universe's early galaxies evolved much faster than previously thought

Scientists study lipids cell by cell, making new cancer research possible

Squids' birthday influences mating: Male spear squids shown to become 'sneakers' or 'consorts' depending on birth date

Study finds rekindling old friendships as scary as making new ones

How light can vaporize water without the need for heat

Researchers develop eggshell 'bioplastic' pellet as sustainable alternative to plastic

Relevant PhysicsForums posts

Related Stories

Brain-inspired supercomputing system takes spotlight in IBM, US Air Force Research Lab collab

Dynamic Vision Sensor tech works like human retina

IBM going TrueNorth in system lookout for seizures

Visual perception: Vertical disparity corrects stereo correspondence

'Spectacular' finding: New 3-D vision discovered in praying mantis

As Moore's law ends, brain-like computers begin

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience