February 8, 2016

New algorithm improves speed and accuracy of pedestrian detection

What if computers could recognize objects as well as the human brain could? Electrical engineers at the University of California, San Diego have taken an important step toward that goal by developing a pedestrian detection system that performs in near real-time (2-4 frames per second) and with higher accuracy (close to half the error) compared to existing systems. The technology, which incorporates deep learning models, could be used in "smart" vehicles, robotics and image and video search systems.

"We're aiming to build computer vision systems that will help computers better understand the world around them," said Nuno Vasconcelos, electrical engineering professor at the UC San Diego Jacobs School of Engineering who directed the research. A big goal is real-time vision, he says, especially for pedestrian detection systems in self-driving cars. Vasconcelos is a faculty affiliate of the Center for Visual Computing and the Contextual Robotics Institute, both at UC San Diego.

The new pedestrian detection algorithm developed by Vasconcelos and his team combines a traditional computer vision classification architecture, known as cascade detection, with deep learning models.

Pedestrian detection systems typically break down an image into small windows that are processed by a classifier that signals the presence or absence of a pedestrian. This approach is challenging because pedestrians appear in different sizes—depending on distance to the camera—and locations within an image. Typically, millions of windows must be inspected by video frame at speeds ranging from 5-30 frames per second.

In cascade detection, the detector operates throughout a series of stages. In the first stages, the algorithm quickly identifies and discards windows that it can easily recognize as not containing a person (such as the sky). The next stages process the windows that are harder for the algorithm to classify, such as those containing a tree, which the algorithm could recognize as having person-like features (shape, color, contours, etc.). In the final stages, the algorithm must distinguish between a pedestrian and very similar objects. However, because the final stages only process a few windows, the overall complexity is low.

Traditional cascade detection relies on "weak learners," which are simple classifiers, to do the job at each stage. The first stages use a small number of weak learners to reject the easy windows, while the later stages rely on larger numbers of weak learners to process the harder windows. While this method is fast, it isn't powerful enough when it reaches the final stages. That's because the weak learners used in all stages of the cascade are identical. So even though there are more classifiers in the last stages, they're not necessarily capable of performing highly complex classification.

Deep learning models

To address this problem, Vasconcelos and his team developed a novel algorithm that incorporates deep learning models in the final stages of a cascaded detector. Deep learning models are better suited for complex pattern recognition, which they can perform after being trained with hundreds or thousands of examples—in this case, images that either have or don't have a person. However, deep learning models are too complex for real-time implementation. While they work well for the final cascade stages, they are too complex to be used in the early ones.

The solution is a new cascade architecture that combines classifiers from different families: simple classifiers (weak learners) in the early stages complex classifiers (deep learning models) in the later stages. This is not trivial to accomplish, noted Vasconcelos, since the algorithm used to learn the cascade has to find the combination of weak learners that achieves the optimal trade-off between detection accuracy and complexity for each cascade stage. Accordingly, Vasconcelos and his team introduced a new mathematical formulation for this problem, which resulted in a new algorithm for cascade design.

"No previous algorithms have been capable of optimizing the trade-off between detection accuracy and speed for cascades with stages of such different complexities. In fact, these are the first cascades to include stages of deep learning. The results we're obtaining with this new algorithm are substantially better for real-time, accurate pedestrian detection," said Vasconcelos.

The algorithm currently only works for binary detection tasks, such as pedestrian detection, but the researchers are aiming to extend the cascade technology to detect many objects simultaneously.

"One approach to this problem is to train, for example, five different detectors to recognize five different objects. But we want to train just one detector to do this. Developing that algorithm is the next challenge," said Vasconcelos.

The work, titled "Learning Complexity-Aware Cascades for Deep Pedestrian Detection," was presented Dec. 15, 2015 at the International Conference on Computer Vision in Santiago, Chile.

Provided by University of California - San Diego

Citation: New algorithm improves speed and accuracy of pedestrian detection (2016, February 8) retrieved 1 May 2024 from https://phys.org/news/2016-02-algorithm-accuracy-pedestrian.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Google team rises to 2014 visual recognition challenge

967 shares

Feedback to editors

EPA underestimates methane emissions from landfills and urban areas, researchers find

18 minutes ago

This Texas veterinarian helped crack the mystery of bird flu in cows

36 minutes ago

Researchers discover key functions of therapeutically promising jumbo viruses

46 minutes ago

Marine sharks and rays 'use' urea to delay reproduction, finds study

1 hour ago

Researchers unlock potential of 2D magnetic devices for future computing

1 hour ago

Researchers build new device that is a foundation for quantum computing

1 hour ago

Satellite images of plants' fluorescence can predict crop yields

1 hour ago

New work reveals the 'quantumness' of gravity

1 hour ago

Mystery behind huge opening in Antarctic sea ice solved

2 hours ago

Do earthquake hazard maps predict higher shaking than actually occurred? Research finds discrepancy

2 hours ago

Load comments (0)

New algorithm improves speed and accuracy of pedestrian detection

EPA underestimates methane emissions from landfills and urban areas, researchers find

This Texas veterinarian helped crack the mystery of bird flu in cows

Researchers discover key functions of therapeutically promising jumbo viruses

Marine sharks and rays 'use' urea to delay reproduction, finds study

Researchers unlock potential of 2D magnetic devices for future computing

Researchers build new device that is a foundation for quantum computing

Satellite images of plants' fluorescence can predict crop yields

New work reveals the 'quantumness' of gravity

Mystery behind huge opening in Antarctic sea ice solved

Do earthquake hazard maps predict higher shaking than actually occurred? Research finds discrepancy

Relevant PhysicsForums posts

User-Defined Functions in Sql Server SSMS

Classifiers, threshold, and ROC curve

Parallel processing for loops and pointer defined outside the loop

Passing variables in FORTRAN

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Google team rises to 2014 visual recognition challenge

Improving machine learning with an old approach

Teaching robots to see

New research outlines more effective diagnosis for people with heart conditions

Combining computer vision and brain computer interface for faster mine detection

Scientists teach machines to learn like humans

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

New algorithm improves speed and accuracy of pedestrian detection

EPA underestimates methane emissions from landfills and urban areas, researchers find

This Texas veterinarian helped crack the mystery of bird flu in cows

Researchers discover key functions of therapeutically promising jumbo viruses

Marine sharks and rays 'use' urea to delay reproduction, finds study

Researchers unlock potential of 2D magnetic devices for future computing

Researchers build new device that is a foundation for quantum computing

Satellite images of plants' fluorescence can predict crop yields

New work reveals the 'quantumness' of gravity

Mystery behind huge opening in Antarctic sea ice solved

Do earthquake hazard maps predict higher shaking than actually occurred? Research finds discrepancy

Relevant PhysicsForums posts

Related Stories

Google team rises to 2014 visual recognition challenge

Improving machine learning with an old approach

Teaching robots to see

New research outlines more effective diagnosis for people with heart conditions

Combining computer vision and brain computer interface for faster mine detection

Scientists teach machines to learn like humans

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience