September 17, 2015

96.7% recognition rate for handwritten Chinese characters using AI that mimics the human brain

Fujitsu today announced the development of the world's first handwriting recognition technology by utilizing AI technology modeled on human brain processes to surpass a human equivalent recognition rate of 96.7%, that was established at a conference. Fujitsu had previously achieved top-level accuracy in this field, as demonstrated by taking first place, with a recognition rate of 94.8%, at a handwritten Chinese character recognition contest held at the International Conference on Document Analysis and Recognition (ICDAR), a top-level conference in the document image processing field.

However, in order to further increase recognition accuracy, a new mechanism for studying the diversity of character deformations was required. Now, with a focus on a hierarchical model of expanded connections between neurons, a model based on the human brain which grasps the features of the characters, Fujitsu has developed a technology to automatically create numerous patterns of character deformation from the character's base pattern, thereby "training" this hierarchical neural model. Using this method, Fujitsu has achieved an accuracy rate of 96.7%, surpassing the human equivalent recognition rate of 96.1% for handwritten Chinese characters. Fujitsu expects that this technology will enable further automation of computer input and recognition.

Ordinarily, while humans can easily recognize media such as characters, images and sounds, for computers this recognition is much more difficult, due to both the many variations in shape, brightness and so on of the object to be recognized, as well as the existence of similar objects. This has become a central problem in artificial intelligence research. Fujitsu has decades of experience in character recognition, with commercialized technologies used in such areas as Japan's finance and insurance fields for Japanese language, as well as a Chinese character recognition technology used by the Chinese government for 800 million handwritten census forms. Fujitsu started research using artificial intelligence based on deep learning for character recognition in 2010. In 2013, the character recognition technology developed on the basis of this artificial intelligence took first place (recognition rate of 94.8%) at a handwritten Chinese character recognition contest held at a top-level international contest in the document image processing field, achieving the highest accuracy in the field.

With character recognition technology, the goal is to learn and store the features of the many character patterns thought to be used by humans when recognizing characters, using a model of connected hierarchies based on human neurons. When a character image is input, the first layer of the model perceives the simple features of the character, and then the next layer perceives the complex features of the character. In this way, the features effective for differentiating characters are extracted in an automatic and hierarchical fashion, and then the results of the learning process, including which features (neurons) the model reacted to, are accumulated. When attempting to recognize a character, the features of the input character are extracted in the same way as in the learning process, and the character is identified and recognition results output on the basis of which features (neurons) reacted as determined by the learning process. In order to further increase the accuracy of recognition, there was a need for a new effort to study the diversity of character deformations. This is because while Fujitsu had achieved the top level of accuracy in the field, it was not at a level comparable to human recognition activity (a recognition rate of 96.1%).

Now, by increasing the number of connections between the neurons in the hierarchical model by over fifty times, Fujitsu has developed a technology to automatically produce many varieties of deformed character patterns for learning. Using this method, the model is able to learn more meticulously, and achieve a recognition rate of 96.7% to surpass the human equivalent rate of 96.1%, in recognizing handwritten Chinese characters. The features of this newly developed technology are listed below.

1. Expanding the scale of the hierarchical model

Fujitsu has expanded the scale of the connections between neurons in the hierarchical model used in the character recognition process, raising recognition accuracy by increasing the number of connections from 2.8 million used in the previous technology (recognition rate 94.8%) to 150 million, in order to fine-tune the study of deformations (Figure 1, Figure 2).

2. Generating diverse character samples based on three-dimensional random deformation

There are about 3,800 Chinese characters to be recognized, making it extremely difficult to collect real-world patterns of deformation for each character. Therefore, Fujitsu has developed a technology to randomly deform existing character samples to automatically create all sorts of character samples for learning. This made it possible to have the hierarchical model study a multitude of different types of deformed character patterns (Figure 3).

With previous methods, because they only randomized the character's position in two dimensions, differences in the brightness of parts of the background or parts of the character (strokes) and localized differences created problems. To address this, Fujitsu devised a character sample generation technology based on random deformations in three dimensions. By adding the grey value of each image element as a Z-axis parameter to the existing X and Y axes of the character pattern image, they were able to generate a variety of deformed patterns.

With this newly developed technology, Fujitsu achieved a recognition rate of 96.7% for handwritten Chinese characters, surpassing the human equivalent rate of 96.1%. Fujitsu anticipates that this technology will further automate all sorts of computer input and recognition tasks.

Fujitsu is aiming for the practical application of this technology in fiscal 2015, while also further increasing the accuracy of character recognition technology and expanding its use to the recognition of media other than written characters, such as pictures and voice. In addition, Fujitsu is also studying the applications of this character recognition technology to many other languages, such as Japanese, alphabet-based languages, and numerals.

Provided by Fujitsu

Citation: 96.7% recognition rate for handwritten Chinese characters using AI that mimics the human brain (2015, September 17) retrieved 14 August 2024 from https://phys.org/news/2015-09-recognition-handwritten-chinese-characters-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Fujitsu develops ring-type wearable device capable of text input by fingertip

62 shares

Feedback to editors

Physicists throw world's smallest disco party with a levitating ball of fluorescent nanodiamond

19 minutes ago

First-of-its-kind analysis reveals importance of storms in air–sea carbon exchange in Southern Ocean

32 minutes ago

Fine fragrances from test tubes: A new method to synthesize ambrox

32 minutes ago

NASA's Perseverance rover to begin long climb up Martian crater rim

33 minutes ago

Revealing the mysteries within microbial genomes with a new high-throughput approach

46 minutes ago

Characterizing the impact of 700 years of Inuvialuit subsistence hunting on beluga whales

50 minutes ago

Interactive map shows thresholds for coastal nuisance flooding

1 hour ago

Studying the journey, not the destination, provides new insight into songbird migrations

1 hour ago

Newly discovered ability of comammox bacteria could help reduce nitrous oxide emissions in agriculture

1 hour ago

Planetary health diet adoption would reduce emissions by 17%, environmental scientists suggest

1 hour ago

Load comments (3)

96.7% recognition rate for handwritten Chinese characters using AI that mimics the human brain

2. Generating diverse character samples based on three-dimensional random deformation

Physicists throw world's smallest disco party with a levitating ball of fluorescent nanodiamond

First-of-its-kind analysis reveals importance of storms in air–sea carbon exchange in Southern Ocean

Fine fragrances from test tubes: A new method to synthesize ambrox

NASA's Perseverance rover to begin long climb up Martian crater rim

Revealing the mysteries within microbial genomes with a new high-throughput approach

Characterizing the impact of 700 years of Inuvialuit subsistence hunting on beluga whales

Interactive map shows thresholds for coastal nuisance flooding

Studying the journey, not the destination, provides new insight into songbird migrations

Newly discovered ability of comammox bacteria could help reduce nitrous oxide emissions in agriculture

Planetary health diet adoption would reduce emissions by 17%, environmental scientists suggest

Relevant PhysicsForums posts

Python Socket library to create a server and client scripts

Safe, free and unlimited xls to xlsx converter?

Help solving a geometrical matching issue with Graph Neural Networks

5 GHz PC WiFi connection Cybersecurity question

Help with some optimization code for Block Matrices

Is an API Always Necessary for Server-Client Communication?

Fujitsu develops ring-type wearable device capable of text input by fingertip

Fujitsu digitizes sticky-note brainstorming with proprietary digital pen technology

Study suggests left-side bias in visual expertise

Software detects and extracts text from within video frames, makes it searchable

Fujitsu develops technology capable of searching encrypted data to maintain privacy

Neural networks that function like the human visual cortex may help realize faster, more reliable pattern recognition

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

96.7% recognition rate for handwritten Chinese characters using AI that mimics the human brain

2. Generating diverse character samples based on three-dimensional random deformation

Physicists throw world's smallest disco party with a levitating ball of fluorescent nanodiamond

First-of-its-kind analysis reveals importance of storms in air–sea carbon exchange in Southern Ocean

Fine fragrances from test tubes: A new method to synthesize ambrox

NASA's Perseverance rover to begin long climb up Martian crater rim

Revealing the mysteries within microbial genomes with a new high-throughput approach

Characterizing the impact of 700 years of Inuvialuit subsistence hunting on beluga whales

Interactive map shows thresholds for coastal nuisance flooding

Studying the journey, not the destination, provides new insight into songbird migrations

Newly discovered ability of comammox bacteria could help reduce nitrous oxide emissions in agriculture

Planetary health diet adoption would reduce emissions by 17%, environmental scientists suggest

Relevant PhysicsForums posts

Related Stories

Fujitsu develops ring-type wearable device capable of text input by fingertip

Fujitsu digitizes sticky-note brainstorming with proprietary digital pen technology

Study suggests left-side bias in visual expertise

Software detects and extracts text from within video frames, makes it searchable

Fujitsu develops technology capable of searching encrypted data to maintain privacy

Neural networks that function like the human visual cortex may help realize faster, more reliable pattern recognition

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience