Clever math could enable a high-quality 3-D camera for cellphones
January 6, 2012 by Larry Hardesty
Depth-sensing cameras can produce 'depth maps' like this one, in which distances are depicted as shades on a gray-scale spectrum (lighter objects are closer, darker ones farther away). Image: flickr/Dominic
When Microsofts Kinect -- a device that lets Xbox users control games with physical gestures -- hit the market, computer scientists immediately began hacking it. A black plastic bar about 11 inches wide with an infrared rangefinder and a camera built in, the Kinect produces a visual map of the scene before it, with information about the distance to individual objects. At MIT alone, researchers have used the Kinect to create a Minority Report-style computer interface, a navigation system for miniature robotic helicopters and a holographic-video transmitter, among other things.
Now imagine a device that provides more-accurate depth information than the Kinect, has a greater range and works under all lighting conditions but is so small, cheap and power-efficient that it could be incorporated into a cellphone at very little extra cost. Thats the promise of recent work by Vivek Goyal, the Esther and Harold E. Edgerton Associate Professor of Electrical Engineering, and his group at MITs Research Lab of Electronics.
3-D acquisition has become a really hot topic, Goyal says. In consumer electronics, people are very interested in 3-D for immersive communication, but then theyre also interested in 3-D for human-computer interaction.
Andrea Colaco, a graduate student at MITs Media Lab and one of Goyals co-authors on a paper that will be presented at the IEEEs International Conference on Acoustics, Speech, and Signal Processing in March, points out that gestural interfaces make it much easier for multiple people to interact with a computer at once as in the dance games the Kinect has popularized.
When youre talking about a single person and a machine, weve sort of optimized the way we do it, Colaco says. But when its a group, theres less flexibility.
Ahmed Kirmani, a graduate student in the Department of Electrical Engineering and Computer Science and another of the papers authors, adds, 3-D displays are way ahead in terms of technology as compared to 3-D cameras. You have these very high-resolution 3-D displays that are available that run at real-time frame rates.
Sensing is always hard, he says, and rendering it is easy.
Clocking in
Like other sophisticated depth-sensing devices, the MIT researchers system uses the time of flight of light particles to gauge depth: A pulse of infrared laser light is fired at a scene, and the camera measures the time it takes the light to return from objects at different distances.
Traditional time-of-flight systems use one of two approaches to build up a depth map of a scene. LIDAR (for light detection and ranging) uses a scanning laser beam that fires a series of pulses, each corresponding to a point in a grid, and separately measures their time of return. But that makes data acquisition slower, and it requires a mechanical system to continually redirect the laser. The alternative, employed by so-called time-of-flight cameras, is to illuminate the whole scene with laser pulses and use a bank of sensors to register the returned light. But sensors able to distinguish small groups of light particles photons are expensive: A typical time-of-flight camera costs thousands of dollars.
The MIT researchers system, by contrast, uses only a single light detector a one-pixel camera. But by using some clever mathematical tricks, it can get away with firing the laser a limited number of times.
The first trick is a common one in the field of compressed sensing: The light emitted by the laser passes through a series of randomly generated patterns of light and dark squares, like irregular checkerboards. Remarkably, this provides enough information that algorithms can reconstruct a two-dimensional visual image from the light intensities measured by a single pixel.
In experiments, the researchers found that the number of laser flashes and, roughly, the number of checkerboard patterns that they needed to build an adequate depth map was about 5 percent of the number of pixels in the final image. A LIDAR system, by contrast, would need to send out a separate laser pulse for every pixel.
To add the crucial third dimension to the depth map, the researchers use another technique, called parametric signal processing. Essentially, they assume that all of the surfaces in the scene, however theyre oriented toward the camera, are flat planes. Although thats not strictly true, the mathematics of light bouncing off flat planes is much simpler than that of light bouncing off curved surfaces. The researchers parametric algorithm fits the information about returning light to the flat-plane model that best fits it, creating a very accurate depth map from a minimum of visual information.
On the cheap
Indeed, the algorithm lets the researchers get away with relatively crude hardware. Their system measures the time of flight of photons using a cheap photodetector and an ordinary analog-to-digital converter an off-the-shelf component already found in all cellphones. The sensor takes about 0.7 nanoseconds to register a change to its input.
Thats enough time for light to travel 21 centimeters, Goyal says. So for an interval of depth of 10 and a half centimeters Im dividing by two because light has to go back and forth all the information is getting blurred together, he says. Because of the parametric algorithm, however, the researchers system can distinguish objects that are only two millimeters apart in depth. It doesnt look like you could possibly get so much information out of this signal when its blurred together, Goyal says.
The researchers algorithm is also simple enough to run on the type of processor ordinarily found in a smartphone. To interpret the data provided by the Kinect, by contrast, the Xbox requires the extra processing power of a graphics-processing unit, or GPU, a powerful special-purpose piece of hardware.
This is a brand-new way of acquiring depth information, says Yue M. Lu, an assistant professor of electrical engineering at Harvard University. Its a very clever way of getting this information. One obstacle to deployment of the system in a handheld device, Lu speculates, could be the difficulty of emitting light pulses of adequate intensity without draining the battery.
But the light intensity required to get accurate depth readings is proportional to the distance of the objects in the scene, Goyal explains, and the applications most likely to be useful on a portable device such as gestural interfaces deal with nearby objects. Moreover, he explains, the researchers system makes an initial estimate of objects distance and adjusts the intensity of subsequent light pulses accordingly.
The telecom giant Qualcomm, at any rate, sees enough promise in the technology that it selected a team consisting of Kirmani and Colaco as one of eight winners out of 146 applicants from a select group of universities of a $100,000 grant through its 2011 Innovation Fellowship program.
This video is not supported by your browser at this time.
Provided by
Massachusetts Institute of Technology
This story is republished courtesy of MIT News (http://web.mit.edu/newsoffice/), a popular site that covers news about MIT research, innovation and teaching.
-
From lemons to lemonade: Reaction uses carbon dioxide to make carbon-based semiconductor,
32 comments
-
Thioridazine kills cancer stem cells in human while avoiding toxic side-effects of conventional cancer treatments,
3 comments
-
SpaceX private rocket blasts off for space station (Update),
42 comments
-
Climate scientists say they have solved riddle of rising sea,
31 comments
-
SpaceX capsule has 'new car' smell, astronauts say (Update),
4 comments
-
Need a rigid insulation material???
22 hours ago
-
magnets or EMF in car bumpers to protect from fender bender
May 26, 2012
-
length of wire in a coil of known dimensions?
May 25, 2012
-
India Engineering Powerhouse
May 25, 2012
-
electromagnet core dereference between hard and soft iron
May 25, 2012
-
Measuring water pressure in an open tank
May 24, 2012
- More from Physics Forums - General Engineering
More news stories
Browser wars flare in mobile space
The browser wars are heating up again, but this time the fight is for dominance of the mobile Internet.
15 hours ago |
5 / 5 (2) |
3
Probability of contamination from severe nuclear reactor accidents is higher than expected: study
Catastrophic nuclear accidents such as the core meltdowns in Chernobyl and Fukushima are more likely to happen than previously assumed. Based on the operating hours of all civil nuclear reactors and the number ...
Technology / Energy & Green Tech
May 22, 2012 |
3.6 / 5 (25) |
56
|
HyperSolar shows dirty water no barrier to power world
(Phys.org) -- The Santa Barbara, California, company, HyperSolar, is set to transparently share the ups and downs of its research experiences toward the companys ultimate vision, successfully producing ...
SpotterRF debuts Radar Backpack Kit (w/ Video)
(Phys.org) -- SpotterRF has announced a special radar backpack kit designed to enhance situational awareness for soldiers on the ground. The company says its special radar is designed for warfighters as part ...
Tesla to launch electric sedan in US on June 22
Tesla Motors said Tuesday it would begin deliveries of "the world's first premium electric sedan" on June 22, slightly ahead of schedule.
Technology / Energy & Green Tech
May 22, 2012 |
4.5 / 5 (12) |
18
Stunning image of smallest possible five-ringed structure
Scientists have created and imaged the smallest possible five-ringed structure about 100,000 times thinner than a human hair and you'll probably recognise its shape.
'Unzipped' carbon nanotubes could help energize fuel cells, batteries
Multi-walled carbon nanotubes riddled with defects and impurities on the outside could replace some of the expensive platinum catalysts used in fuel cells and metal-air batteries, according to scientists at ...
Change in developmental timing was crucial in the evolutionary shift from dinosaurs to birds: study
At first glance, it's hard to see how a common house sparrow and a Tyrannosaurus Rex might have anything in common. After all, one is a bird that weighs less than an ounce, and the other is a dinosaur that ...
Computer model used to pinpoint prime materials for efficient carbon capture
When power plants begin capturing their carbon emissions to reduce greenhouse gases and to most in the electric power industry, it's a question of when, not if it will be an expensive undertaking.
T cells 'hunt' parasites like animal predators seek prey, study shows
By pairing an intimate knowledge of immune-system function with a deep understanding of statistical physics, a cross-disciplinary team at the University of Pennsylvania has arrived at a surprising finding: T cells use a movement ...
Land and sea species differ in climate change response: study
(Phys.org) -- Marine and terrestrial species will likely differ in their responses to climate warming, new research by Simon Fraser University and Australia’s University of Tasmania has found.
Jan 06, 2012
Rank: 1 / 5 (3)
After reading the article, I'm extremely skeptical that this system is more practical than using 2 nominally passive CCD sensors.
For imaging, the lower power requirements and low processing requirements render this new technology moot.
For gesture recognition you have, power requirement, questionable practibility for a phone/mobile device at all, and low resolution requirements that allow other technologies to take it's place.
The technology is clever and may have a place somewhere, but I think in the mobile arena, it is a nonstarter.
Jan 06, 2012
Rank: 1 / 5 (4)
Jan 06, 2012
Rank: 4.5 / 5 (6)
Depth perception in biological organisms developed LONG before humans or even apes existed.
Jan 06, 2012
Rank: 2.8 / 5 (4)
These techniques are decades old.
Jan 06, 2012
Rank: 4.7 / 5 (3)
It has much higher resolution than the Kinect.
They will have to expand their work to include curved surfaces before the algorithm is really complete. But this is a promising start on a new concept.
Jan 06, 2012
Rank: 1.5 / 5 (2)
Jan 07, 2012
Rank: 5 / 5 (3)
Jan 11, 2012
Rank: not rated yet
Those might not be today's mobile devices, but similar, high-volume devices will probably be enhancing our mobile and home experiences in less than a decade.
This is really great systems engineering. I hope that the extensions to handle curved surfaces don't trip up the algorithm, so that my future robotic servants and caretakers will be affordable.
(Btw, I had no idea that the spatial light modulator chipset had gotten so cheap! It's about $6 or $7 ea/qty 100 for 768x1024 pixels.)