Researchers use Web images to add realism to edited photos

Jul 10, 2007

Computer graphics researchers at Carnegie Mellon University have developed systems for editing or altering photographs using segments of the millions of images available on the Web.

Whether adding people or objects to a photo, or filling holes in an edited photo, the systems automatically find images that match the context of the original photo so they blend realistically. Unlike traditional photo editing, these results can be achieved rapidly by users with minimal skills.

“We are able to leverage the huge amounts of visual information available on the Internet to find images that make the best fit,” said Alexei A. Efros, assistant professor of computer science and robotics. “It’s not applicable for all photo editing, such as when an image of a specific object or person is added to a photo. But it’s good enough in many cases,” he added. “Why Photoshop if you can ‘photoswap’ instead?”

Efros and his colleagues will present papers on two related systems at the Association for Computing Machinery’s Special Interest Group on Graphics and Interactive Techniques (SIGGRAPH) annual conference Aug. 5–9 in San Diego.

One system, called Photo Clip Art (graphics.cs.cmu.edu/projects/photoclipart/), was developed with graduate students Jean-François Lalonde and Derek Hoiem, and with Carsten Rother, John Winn and Antonio Criminisi of Microsoft Research Cambridge. It uses thousands of labeled images from a Web site called LabelMe as clip art that can be added to photos. A photo showing a vacant street, for instance, might be populated with images of people, vehicles and even parking meters derived from the LabelMe database (labelme.csail.mit.edu/).

To make the resulting image appear as realistic as possible, the system analyzes the original photo to estimate the camera angle and lighting conditions, and then looks in the clip art library for an object — a car, for instance — that matches those criteria. The user need only identify the horizon in the original photo to orient the system. Using previously developed Carnegie Mellon technology for analyzing the geometric context of a photo, the system can then place the object within the scene, adjusting its size as necessary to put it in proportion to other objects of equal distance from the camera.

“Matching an object with the original photo and placing that object within the 3-D landscape of the photo is a complex problem,” said Lalonde, who led development of the system. “But with our approach, and a lot of clip art data, we can hide the complexity from the user and make the process simple and intuitive.”

The other system, called Scene Completion (graphics.cs.cmu.edu/projects/scene-completion/), was developed by graduate student James Hays, another member of Efros’ research team. It draws upon millions of photos from the Flickr Web site to fill in holes in photos. Some of the holes might be from damage to a physical photograph, but more often they are created when an editor cuts out part of an image to eliminate an unsightly truck from a picturesque street scene, or removing a passerby from a group shot of friends. Photo editors often try to fill in those holes with sections derived elsewhere in the same image, but Efros said that a better match can often be found in a different photo.

The system looks for image segments that match the colors and textures that surround the hole on the original photo. It also looks for image segments that make sense contextually — in other words, it wouldn’t put an elephant in a suburban backyard or a boat in a desert.

In the case of well-photographed cities or popular tourist attractions, Efros said, the system might get lucky and find a photo of the same scene on the Web. In other cases, it might offer a number of possible images that could fill in the hole. A retaining wall edited out of one photo, for instance, might be replaced by the image of a building, a grassy slope or a rock outcropping. The system typically gives the user 20 different choices for filling in the hole.

The success of this approach depends on the number of photos available to the system, Hays said. “We saw a dramatic improvement when we moved from a database of 10,000 images to two million images,” he noted. “And that is just a tiny fraction of the hundreds of millions of images already available on sites like Picasa and Flickr. We have tons of photos from which to choose.”

Source: Carnegie Mellon University

Explore further: Microsoft Research project can interpret, caption photos

Related Stories

Say Freeze: Photogs do 365-gigapixel sweep of Mont Blanc

5 hours ago

Mont Blanc is the highest mountain in the Alps and has taken on an added distinction as the subject of the world's largest photograph. The Telegraph reported Monday that a photography team accomplished a worl ...

A solar eclipse sheds light on physics

1 hour ago

On 29 May 1919, a shadow dance took place over the Caribbean which was to make history: While the new moon covered the blazingly bright disk of the Sun, astronomers around Arthur Stanley Eddington measured ...

'Deep web search' may help scientists

May 25, 2015

When you do a simple Web search on a topic, the results that pop up aren't the whole story. The Internet contains a vast trove of information—sometimes called the "Deep Web"—that isn't indexed by search ...

Ceres bright spots sharpen but questions remain

May 25, 2015

The latest views of Ceres' enigmatic white spots are sharper and clearer, but it's obvious that Dawn will have to descend much lower before we'll see crucial details hidden in this overexposed splatter of ...

Recommended for you

Microsoft Research project can interpret, caption photos

10 hours ago

If you're surfing the web and you come across a photo of the Mariners' Felix Hernandez on the pitchers' mound at Safeco Field, chances are you'll quickly interpret that you are looking at a picture of a baseball ...

Rumor-detection software IDs disputed claims on Twitter

12 hours ago

A week after the Boston marathon bombing, hackers sent a bogus tweet from the official Twitter handle of the Associated Press. It read: "Breaking: Two Explosions in the White House and Barack Obama is injured."

User comments : 1

Adjust slider to filter visible comments by rank

Display comments: newest first

kellimaier
not rated yet Nov 01, 2007
These photos ( those on Flckr...can't speak for the others, since I am unfamiliar with that service/site) are copyright protected...seems a sticky situation to determine the permissions for each photo...or are they just simply not looking at that aspect? In that case they are breaking the law?

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.