share this!
2
5
Share
Email

November 30, 2018

Helping computers to see 3-D structures

If you can recognize structures around you while walking down a city street, you have your eyes to thank. Humans can automatically perceive 3-D structure in the world by identifying lines, shapes, symmetries and the patterns and relationships between them in things like buildings, sidewalks and everyday objects. But can a computer be taught to do the same?

Zihan Zhou, assistant professor of information sciences and technology at Penn State, is setting out to explore that question thanks to a recent grant from the National Science Foundation.

"We want a computer to see 3-D space as humans do," said Zhou. "This particular award and project is about structure perception, which has been largely ignored in 3-D vision. This is something that has not been done before."

Structure perception is the ability of a human's eyes to organize data or patterns and group them in certain ways. For example, a human can look at a line drawing of a building and visualize doors, windows and walls.

"There are many types of these relationships in the real world, and humans make use of those relationships to sense the 3-D space," he said. "Human eyes can easily perceive these kinds of things. The question now is: Can the computer have the ability to sense these things as a human does?"

To answer that question, Zhou plans to develop a new data-driven framework for structure discovery, leveraging the availability of massive visual data and recent advances in machine learning techniques.

These techniques could then be applied to a wide spectrum of real-world computer vision problems, including 3-D modeling of urban environments, virtual and augmented reality, and autonomous driving. The research could also impact cognitive sciences, by suggesting new computational mechanisms for image understanding; and human-robot interaction, by enabling robots to reason in terms of geometric shape, physics and dynamics.

"If a robot recognizes something as a specific type of structure, then it knows how to interact with it," said Zhou. "For example, if a robot is able to recognize a structure with a flat top, it would know that it could put an object like a cup on it."

Additionally, the framework may impact the work of architects, designers and engineers.

"If you think of those architects, they are working with 3-D models every day," said Zhou. "If they build something, they first create line drawings. So if a computer can understand doors and windows in the drawings, it would be very useful for architectural design and engineering."

Zhou developed an interest in this topic while a graduate intern at Adobe. In his internship, he studied the relationship between camera motion and the environment, which could help the movie industry to analyze scenes.

"I tried to extract some kinds of structures from the videos and the sequence of the camera," he said. "At that point it was to analyze camera trajectory for the movie industry, but later we realized it was more systematic."

Now, at Penn State, Zhou hopes to leverage the interdisciplinary network to advance his work.

"IST has people working in diverse areas, and many of them can be impacted by this kind of work," he said. "This has generated a lot of interest in different areas. We are looking to extend this beyond and to find applications to make this more collaborative."

"About 70 percent of information we obtain is from visual cues from our eyes," he concluded. "Obviously we have areas like natural language processing to help understand speaking and sounds, but human vision is the dominating factor in how we understand this world. To make the computer see the world as we do is one of the most exciting areas in artificial intelligence and computer science."

Provided by Pennsylvania State University

Citation: Helping computers to see 3-D structures (2018, November 30) retrieved 18 June 2024 from https://phys.org/news/2018-11-d.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers use AI to add 4-D effects to movies

7 shares

Feedback to editors

Lung-targeting lipid nanoparticles with CRISPR components successfully treat cystic fibrosis mouse models

8 minutes ago

Investigating nematode-microbe interactions in lab-simulated decomposed beetle environments

14 minutes ago

Starlings found to expend 25% less energy in follower position compared to flying solo

18 minutes ago

Physicists find a new way to represent π

20 minutes ago

Research investigates chemical composition of globular cluster Terzan 6

21 minutes ago

Study suggests at-camera gaze can increase scores in simulated interviews

27 minutes ago

Study proposes novel hypothesis to explain occupation of Brazil's southern coast 2,000 years ago

36 minutes ago

Scientists use tyrosine nanomedicine to halt melanoma growth

39 minutes ago

Ultra-high spectral purity revealed in exciton-polariton laser

45 minutes ago

Quantum computing trade-off problem addressed by new system

47 minutes ago

Load comments (0)

Helping computers to see 3-D structures

Lung-targeting lipid nanoparticles with CRISPR components successfully treat cystic fibrosis mouse models

Investigating nematode-microbe interactions in lab-simulated decomposed beetle environments

Starlings found to expend 25% less energy in follower position compared to flying solo

Physicists find a new way to represent π

Research investigates chemical composition of globular cluster Terzan 6

Study suggests at-camera gaze can increase scores in simulated interviews

Study proposes novel hypothesis to explain occupation of Brazil's southern coast 2,000 years ago

Scientists use tyrosine nanomedicine to halt melanoma growth

Ultra-high spectral purity revealed in exciton-polariton laser

Quantum computing trade-off problem addressed by new system

Relevant PhysicsForums posts

Math Major Trying to Learn CS

Parallelizing N-Queens

How to test locally hosted websites on mobile?

Question about learning programming

Why do emails from my contact form bounce?

Anyone with experience linking FFTW for C

Researchers use AI to add 4-D effects to movies

Want computers to see better in the real world? Train them in virtual reality

Recognizing the partially seen

Research identifies key weakness in modern computer vision systems

How do robots 'see' the world?

Teaching a computer to perceive the world without human input

Machine learning approach for low-dose CT imaging yields superior results

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Team breaks world record for fast, accurate AI training

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Medical Xpress

Tech Xplore

Science X

Helping computers to see 3-D structures

Lung-targeting lipid nanoparticles with CRISPR components successfully treat cystic fibrosis mouse models

Investigating nematode-microbe interactions in lab-simulated decomposed beetle environments

Starlings found to expend 25% less energy in follower position compared to flying solo

Physicists find a new way to represent π

Research investigates chemical composition of globular cluster Terzan 6

Study suggests at-camera gaze can increase scores in simulated interviews

Study proposes novel hypothesis to explain occupation of Brazil's southern coast 2,000 years ago

Scientists use tyrosine nanomedicine to halt melanoma growth

Ultra-high spectral purity revealed in exciton-polariton laser

Quantum computing trade-off problem addressed by new system

Relevant PhysicsForums posts

Related Stories

Researchers use AI to add 4-D effects to movies

Want computers to see better in the real world? Train them in virtual reality

Recognizing the partially seen

Research identifies key weakness in modern computer vision systems

How do robots 'see' the world?

Teaching a computer to perceive the world without human input

Recommended for you

Machine learning approach for low-dose CT imaging yields superior results

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Team breaks world record for fast, accurate AI training

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Newsletter sign up

Donate and enjoy an ad-free experience