Development of an automatic system for translating biomedical patents in real time

Jul 17, 2014
Screenshot of MOLTO translation system.

The Center for Language and Speech Technologies and Applications (TALP) of the Universitat Politècnica de Catalunya · BarcelonaTech (UPC), a member of CIT UPC, has developed the prototype of an automatic translation system for patents in the biomedical area. The system can be used to create multilingual documents with the same structure as the original patents, including images, formulae and other kinds of annotations. In addition, the system works in real time and can be incorporated in web applications.

The TALP UPC researchers' work, which has been carried out over three years, is part of a collaborative project called MOLTO in the European Union's Seventh Framework Programme. MOLTO has involved the collaboration of research groups in Göteborg (Sweden), Helsinki (Finland), Utrecht (Holland), Sofía (Bulgaria) and Zurich (Switzerland).

With the general objective of obtaining an system in several languages that can produce high quality translations, MOLTO researchers have worked on three cases: the formulations of mathematical exercises, a description of objects in a museum and a model for translating patents, which is the case that TALP members have worked on directly.

As a general technique in the MOLTO project, the researchers used syntactic-semantic grammars created on the basis of specific domain ontologies (conceptual schemes that facilitate information exchange between systems). In turn, these components have been integrated into what is known as a Grammatical Framework (GF): the IT tool that makes automatic translations into different languages possible through a common abstract representation. To facilitate its use online, an Application Programming Interface (API) has been designed so that the tool can be included in any Web application.

Screenshot of MOLTO translation system.

For the patent translation, hybridization techniques were used that combine the Grammatical Framework and statistical methods. GF produces grammatically correct translations, whilst the inclusion of statistical techniques (similar to those used by machine translators such as Google Translate) can cover extensive domains such as biomedicine.

In addition, the patents are part of a document recovery system that initially could only search for documents in English. Therefore, special care has been taken to create a method that maintains the complex arrangement of tags and semantic annotations that are present in documents. Among other factors, this means that the structure of chemical compounds described in biotechnology registers can be maintained, and documents can be searched for in the translation language.

The result is the automatic of into English, French and German (the three official languages of the European Patent Office), with the added advantage that the translations can be carried out in . This is of great use in the task of multilingual database searches.

Explore further: UCLA addresses 'lost in translation' issues in Chinese medicine

Provided by Universitat Politècnica de Catalunya

5 /5 (1 vote)
add to favorites email to friend print save as pdf

Related Stories

Making online translation accurate, reliable and efficient

Jun 13, 2013

European cooperation is based on our ability to understand each other. Given that there are presently 23 official EU languages, the availability of online tools to facilitate accurate translation is fundamentally ...

Google signs deal to translate European patents

Nov 30, 2010

(AP) -- Google announced an agreement Tuesday to use its technology to translate patents into 29 European languages, a deal officials hope will smooth the way toward a simplified European patent system after ...

Google adds automatic translation to Gmail

May 20, 2009

Google added automatic translation technology to Gmail on Tuesday, allowing users of its email service to translate messages in another language with a single mouse click.

Recommended for you

Forging a photo is easy, but how do you spot a fake?

Nov 21, 2014

Faking photographs is not a new phenomenon. The Cottingley Fairies seemed convincing to some in 1917, just as the images recently broadcast on Russian television, purporting to be satellite images showin ...

Algorithm, not live committee, performs author ranking

Nov 21, 2014

Thousands of authors' works enter the public domain each year, but only a small number of them end up being widely available. So how to choose the ones taking center-stage? And how well can a machine-learning ...

Professor proposes alternative to 'Turing Test'

Nov 19, 2014

(Phys.org) —A Georgia Tech professor is offering an alternative to the celebrated "Turing Test" to determine whether a machine or computer program exhibits human-level intelligence. The Turing Test - originally ...

User comments : 0

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.