July 11, 2017

Human pose estimation for care robots using deep learning

Expectations for care robots are growing against the backdrop of declining birthrates, an aging population, and a lack of care staff. As an example, for care at nursing homes and other such facilities, it is anticipated that robots will check the condition of the residents while patrolling the facility. When evaluating a person's condition, while an initial estimation of the pose (standing, sitting, fallen, etc.) is useful, most methods to date have utilized images. These methods face challenges such as privacy issues, and difficulties concerning application within darkly lit spaces. As such, the research group (Kaichiro Nishi, a 2016 master's program graduate, and Professor Miura) has developed a method of pose recognition using depth data alone (Fig. 1).

For poses such as upright positions and sitting positions, where body parts are able to be recognized relatively easily, methods and instruments which can estimate poses with high precision are available. In the case of care, however, it is necessary to recognize various poses, such as a recumbent position (the state of lying down) and a crouching position, which has posed a challenge up until now. Along with the recent progress of deep learning (a technique using a multistage neural network), the development of a method to estimate complex poses using images is advancing. Although deep learning requires preparation of a large amount of training data, in the case of image data, it is relatively easy for a person to see each part in an image and identify it, with some datasets also having been made open to the public. In the case of depth data, however, it is difficult to see the boundaries of parts, making it difficult to generate training data.

As such, this research has established a method to generate a large amount of training data by combining computer graphics (CG) technology and motion capture technology (Fig. 2). This method first creates CG data of various body shapes. Next, it adds to the data information of each part (11 parts including a head part, a torso part, and a right upper arm part), and skeleton information including each joint position. This makes it possible to make CG models take arbitrary poses simply by giving the joint angles using a motion capture system. Fig. 3 shows an example of generating data for various sitting poses.

By using this developed method, training data can be generated corresponding to a combination of persons with arbitrary body shapes, and arbitrary poses. So far, we have created and released a total of about 100,000 pieces of data, both for sitting positions (with/without occlusions), and for several poses in a recumbent positions. This data is freely available for research purposes (http://www.aisl.cs.tut.ac.jp/database_HDIBPL.html). In the future, we will release human models and detailed procedures for data generation so that everyone can make data easily by using them. We hope that this will contribute to the progress of the related fields.

The result of this research was published in Pattern Recognition on Saturday, June 3, 2017.

More information: K. Nishi et al, Generation of human depth images with body part labels for complex human pose recognition, Pattern Recognition (2017). DOI: 10.1016/j.patcog.2017.06.006

Provided by Toyohashi University of Technology

Citation: Human pose estimation for care robots using deep learning (2017, July 11) retrieved 14 August 2024 from https://phys.org/news/2017-07-human-pose-robots-deep.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A computer that reads body language

13 shares

Feedback to editors

Fish in Greenland consume more jellyfish than previously assumed, researchers discover

7 minutes ago

Historic map reveals how mussel farm is bringing shellfish reefs back to the seabed

9 minutes ago

Trojan horse method gives malaria parasites a taste of their own medicine

11 minutes ago

New tools for fungicide resistance detection

21 minutes ago

Consumer-grade insecticide sprays fail to control cockroaches, study shows

51 minutes ago

Rocks collected on Mars hold key to water and perhaps life on the planet: Researchers urge bringing them back to Earth

1 hour ago

Scientist performs the first nonlinear study of black hole mimickers

2 hours ago

Exploring the structures of xenon-containing crystallites

2 hours ago

Rising mercury pollution in soil could be related to climate change, study suggests

2 hours ago

A first definitive demonstration of nonthermal particle acceleration in magnetorotational turbulence

3 hours ago

Load comments (0)

Human pose estimation for care robots using deep learning

Fish in Greenland consume more jellyfish than previously assumed, researchers discover

Historic map reveals how mussel farm is bringing shellfish reefs back to the seabed

Trojan horse method gives malaria parasites a taste of their own medicine

New tools for fungicide resistance detection

Consumer-grade insecticide sprays fail to control cockroaches, study shows

Rocks collected on Mars hold key to water and perhaps life on the planet: Researchers urge bringing them back to Earth

Scientist performs the first nonlinear study of black hole mimickers

Exploring the structures of xenon-containing crystallites

Rising mercury pollution in soil could be related to climate change, study suggests

A first definitive demonstration of nonthermal particle acceleration in magnetorotational turbulence

Relevant PhysicsForums posts

Python Socket library to create a server and client scripts

Safe, free and unlimited xls to xlsx converter?

Help solving a geometrical matching issue with Graph Neural Networks

5 GHz PC WiFi connection Cybersecurity question

Help with some optimization code for Block Matrices

Is an API Always Necessary for Server-Client Communication?

A computer that reads body language

Tracking humans in 3-D with off-the-shelf webcams

New face-aging technique could boost search for missing people

Human-computer interactions could be improved by a new efficient and accurate hand-gesture-recognition model

Fast and efficient detection of hand poses could lead to enhanced human-computer interactions

Disney, CMU researchers build face models that give animators intuitive control of expressions

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Human pose estimation for care robots using deep learning

Fish in Greenland consume more jellyfish than previously assumed, researchers discover

Historic map reveals how mussel farm is bringing shellfish reefs back to the seabed

Trojan horse method gives malaria parasites a taste of their own medicine

New tools for fungicide resistance detection

Consumer-grade insecticide sprays fail to control cockroaches, study shows

Rocks collected on Mars hold key to water and perhaps life on the planet: Researchers urge bringing them back to Earth

Scientist performs the first nonlinear study of black hole mimickers

Exploring the structures of xenon-containing crystallites

Rising mercury pollution in soil could be related to climate change, study suggests

A first definitive demonstration of nonthermal particle acceleration in magnetorotational turbulence

Relevant PhysicsForums posts

Related Stories

A computer that reads body language

Tracking humans in 3-D with off-the-shelf webcams

New face-aging technique could boost search for missing people

Human-computer interactions could be improved by a new efficient and accurate hand-gesture-recognition model

Fast and efficient detection of hand poses could lead to enhanced human-computer interactions

Disney, CMU researchers build face models that give animators intuitive control of expressions

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience