From 2-D pictures to 3 dimensions

Mar 03, 2008
From 2-D pictures to 3 dimensions
Your pictures of the Grand Canyon, Times Square or other destinations may be pretty good, but wouldn't it be nice to show them off in three dimensions? An award-winning 3-D reconstruction algorithm designed by a team of computer science researchers from UC-San Diego brings this dream within the grasp of reality. Credit: Manmohan Chandraker / UC San Diego

Your pictures of the Grand Canyon, Times Square or other destinations may be pretty good, but wouldn’t it be nice to show them off in three dimensions?

An award-winning 3D reconstruction algorithm designed by a team of computer science researchers from UC San Diego brings this dream within the grasp of reality.

This research gets at the heart of “autocalibration,” a well-studied, fundamental problem in computer vision. Autocalibration aims to recover the three dimensional structure of a scene using only its images, acquired from cameras whose internal settings and spatial orientations are unknown.

Autocalibraton is part of a larger 3D image reconstruction challenge that has caught the attention of Google, Microsoft and others.

Manmohan Chandraker, a fifth-year PhD student in the Department of Computer Science and Engineering at UCSD’s Jacobs School of Engineering led the work. He, Sameer Agarwal – a computer science UCSD alumnus now at the University of Washington, and their respective Ph.D. advisors, David Kriegman and Serge Belongie presented their research at the International Conference on Computer Vision (ICCV), held in Rio de Janeiro, Brazil in October 2007. ICCV is the premier conference in the field of computer vision. For this work, Chandraker took home one of three honorable mentions for ICCV’s prestigious David Marr prize.

This technology could be put to use in a wide variety of applications. For example, someone selling shoes online could take pictures of their shoes and create 3D reconstructions of their inventory. Such reconstructions would provide more information about what the shoes actually look like than images or video footage can.

The algorithm could also be used to automatically align security camera networks used in casinos and airports. Coupled with existing technology for immersive media, the algorithm could be used to create augmented-reality walkthroughs of cities, supermarkets or any other places of interest.

In the ICCV paper, the UCSD computer scientists propose the first practically scalable algorithm for 3D reconstruction which provides “a theoretical certificate of optimality.” In other words, the technique computes the best possible 3D reconstruction obtainable from the input data and does not slow down drastically for a large number of photographs.

“Our algorithm is guaranteed to provide the best 3D reconstruction,” said Chandraker. “It is very much a practical algorithm. In fact, the significance of the paper lies in our approaches for designing a theoretically correct algorithm that also works well in practice. Our approach utilizes modern convex optimization techniques to globally minimize the involved cost functions in a branch and bound framework,” explained Chandraker.

The paper, titled “Globally Optimal Affine and Metric Upgrades in Stratified Autocalibration” is available at vision.ucsd.edu/kriegman-grp/papers/iccv07a.pdf . MATLAB prototype code for the implementation will be available online when it is ready.

Source: University of California - San Diego

Explore further: Computerized emotion detector

add to favorites email to friend print save as pdf

Related Stories

Phone snooping via gyroscope to be detailed at Usenix

Aug 15, 2014

Put aside fears of phone microphones and cameras doing eavesdropping mischief for a moment, because there is another sensor that has been flagged. Researchers from Stanford and defense research group at Rafael ...

3-D microscope method to look inside brains

Aug 14, 2014

(Phys.org) —A University of Utah team discovered a method for turning a small, $40 needle into a 3-D microscope capable of taking images up to 70 times smaller than the width of a human hair. This new method ...

Taking great ideas from the lab to the fab

Jul 31, 2014

A "valley of death" is well-known to entrepreneurs—the lull between government funding for research and industry support for prototypes and products. To confront this problem, in 2013 the National Science ...

Blueprints finalized for digital archive

Jun 23, 2014

One of the oldest and most complete historical archives in the world is a step closer to being developed into an open digital archive. The Lombard Odier Foundation is joining as a funding partner to take ...

Recommended for you

Computerized emotion detector

8 hours ago

Face recognition software measures various parameters in a mug shot, such as the distance between the person's eyes, the height from lip to top of their nose and various other metrics and then compares it with photos of people ...

Cutting the cloud computing carbon cost

Sep 12, 2014

Cloud computing involves displacing data storage and processing from the user's computer on to remote servers. It can provide users with more storage space and computing power that they can then access from anywhere in the ...

Teaching computers the nuances of human conversation

Sep 12, 2014

Computer scientists have successfully developed programs to recognize spoken language, as in automated phone systems that respond to voice prompts and voice-activated assistants like Apple's Siri.

Mapping the connections between diverse sets of data

Sep 12, 2014

What is a map? Most often, it's a visual tool used to demonstrate the relationship between multiple places in geographic space. They're useful because you can look at one and very quickly pick up on the general ...

User comments : 0