September 25, 2013

New speech recognition model: Hidden Conditional Neural Fields

Toyohashi Tech researchers propose the Hidden Conditional Neural Fields (HCNF) model for continuous speech recognition. The model is a combination of the Hidden Conditional Random Fields (HCRF) and a Multi-Layer Perceptron (MLP), that is, an extension of Hidden Markov Model (HMM).

This new speech recognition model has the discriminative property for sequences from HCRF and the ability to extract non-linear features from an MLP. Furthermore, the HCNF can incorporate many types of features from non-linear features can be extracted, and is trained by sequential criteria.

In this paper, the researchers describe the formulation of HCNF and examine three methods to further improve automatic speech recognition using HCNF, which was an objective function that explicitly considered training errors, provided a hierarchical tandem-style feature, and included a deep non-linear feature extractor for the observation function.

HCRF can use a deep feed forward neural network (DNN) in the observation function, and therefore, a sophisticated pre-training algorithm such as the deep belief network (DBN) can be used to provide a deep observation function.

The research shows that HCNF can be trained realistically without any initial model and outperform the HCRF and triphone hidden Markov model trained by the minimum phone error (MPE) manner using experimental results for continuous English phoneme recognition on the TIMIT core test and Japanese phoneme recognition on the IPA 100 test set.

More information: Fujii, Y., Yamamoto, K. and Nakagawa, S. Hidden Conditional Fields for Continuous Phoneme Speech Recognition, IEICE Trans. Inf.&Sys., E95-D,2094-2104 (2012). DOI: 10.1587/transinf.E95.D.2094

Provided by Toyohashi University of Technology

Citation: New speech recognition model: Hidden Conditional Neural Fields (2013, September 25) retrieved 3 July 2024 from https://phys.org/news/2013-09-speech-recognition-hidden-conditional-neural.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Speech recognition leaps forward

0 shares

Feedback to editors

Scientists pinpoint strategies that could stop cats from scratching your furniture

3 hours ago

Two new species of Psilocybe mushrooms discovered in southern Africa

10 hours ago

UV radiation damage leads to ribosome roadblocks, causing early skin cell death

11 hours ago

Dual-laser approach could lower cost of high-resolution 3D printing

12 hours ago

Novel method enhances size-controlled production of luminescent quantum dots

12 hours ago

Cosmic simulation reveals how black holes grow and evolve

13 hours ago

How climate change is affecting where species live

13 hours ago

Human presence shifts balance between leopards and hyenas in East Africa

13 hours ago

Physicists' laser experiment excites atom's nucleus, may enable new type of atomic clock

13 hours ago

Treatment with a mixture of antimicrobial peptides found to impede antibiotic resistance

13 hours ago

Load comments (0)

New speech recognition model: Hidden Conditional Neural Fields

Scientists pinpoint strategies that could stop cats from scratching your furniture

Two new species of Psilocybe mushrooms discovered in southern Africa

UV radiation damage leads to ribosome roadblocks, causing early skin cell death

Dual-laser approach could lower cost of high-resolution 3D printing

Novel method enhances size-controlled production of luminescent quantum dots

Cosmic simulation reveals how black holes grow and evolve

How climate change is affecting where species live

Human presence shifts balance between leopards and hyenas in East Africa

Physicists' laser experiment excites atom's nucleus, may enable new type of atomic clock

Treatment with a mixture of antimicrobial peptides found to impede antibiotic resistance

Relevant PhysicsForums posts

Number of Multiplications in the FFT Algorithm

Newbie question about deep learning

Who can find the largest prime number with their own programmed code?

Math Major Trying to Learn CS

Parallelizing N-Queens

How to test locally hosted websites on mobile?

Speech recognition leaps forward

Research aims to improve speech recognition software

Error rate higher in breast imaging reports generated by automatic speech recognition

Bionic speech recognition

Smart listeners and smooth talkers

Study evaluates transcription accuracy in men and women

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

New speech recognition model: Hidden Conditional Neural Fields

Scientists pinpoint strategies that could stop cats from scratching your furniture

Two new species of Psilocybe mushrooms discovered in southern Africa

UV radiation damage leads to ribosome roadblocks, causing early skin cell death

Dual-laser approach could lower cost of high-resolution 3D printing

Novel method enhances size-controlled production of luminescent quantum dots

Cosmic simulation reveals how black holes grow and evolve

How climate change is affecting where species live

Human presence shifts balance between leopards and hyenas in East Africa

Physicists' laser experiment excites atom's nucleus, may enable new type of atomic clock

Treatment with a mixture of antimicrobial peptides found to impede antibiotic resistance

Relevant PhysicsForums posts

Related Stories

Speech recognition leaps forward

Research aims to improve speech recognition software

Error rate higher in breast imaging reports generated by automatic speech recognition

Bionic speech recognition

Smart listeners and smooth talkers

Study evaluates transcription accuracy in men and women

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience