Novel voice recognition technology completes Interpol's legal arsenal

February 9, 2018, CORDIS
Novel voice recognition technology completes Interpol’s legal arsenal
Credit: SIIP

Watching mainstream forensics-related TV shows could easily make us believe that there is no piece of evidence stronger than conclusive DNA samples or fingerprints. Yet, that would be forgetting the importance of voice recognition. Thanks to new Speaker-Identification technology and a large database of voices maintained by Interpol, the latter will now become much easier.

Imagine a criminal, face hidden, recorded by a security camera as he threatens one of his victims. Or a monitored phone conversation between a suspected drug trafficker and an unknown person who seems to be pulling the strings. In such scenarios, a 100  percent accurate would be a game changer.

Although can already be presented as legal evidence, there is still scepticism around its scientific grounds. The EU-funded SIIP (Speaker Identification Integrated Project) aims to discard these doubts with an innovative probabilistic, language-independent identification system. This system uses a novel Speaker-Identification (SID) engine and a Global Info Sharing Mechanism (GISM) to identify unknown speakers who are captured in lawfully intercepted calls, recorded crime or terror arenas, and any other type of speech source.

SIIP's strong point resides in the merger of multiple speech recognition algorithms related to speaker model, gender, age, language and accent provided by different vendors. This fusion results in highly reliable and confident detection, keeping false positives and false negatives to the minimum.

By using this technology, Law enforcement agencies (LEAs) can overcome the two main challenges they have been facing up until now: the evasion problem, which consists in the use of hidden, fake and arbitrary identities by terrorists and criminals in phone or Internet-based conversations; and the difficulty in identifying an unknown conversation in a lawfully-intercepted call of a known .

Once the conversation has been recorded, SIIP will enable the identification of speakers by comparing their voices to rich-metadata from various sources and enable information sharing with LEAs across the world via Interpol. This way, agents can gather valuable intelligence to prevent a crime or terrorist activity, solve it if it has already happened, and use identification as a pre-forensic tool to create evidence for judges.

The system has already been demonstrated by the project's end user partners themselves in real cases, including identification of speakers on social media and information sharing between users. The consortium indicated that the feedback was really positive, to the point where SIIP may actually join other Interpol central biometric databases such as fingerprint, face and DNA. This would not only enhance Interpol's global activity of information sharing between its 190 member states, but also improve and expedite investigation work.

Although the project ends in April 2018, SIIP's development phase is completed, and the consortium says that the system will be ready for commercialisation in a 'very short time'.

A budget should be allocated by the EU and Interpol to create a spin-off company that they would officially support and promote. This spin-off company will take care of marketing and sales, customisation to specific customers' needs, maintenance and future developments.

Explore further: Voice impersonators can fool speaker recognition systems

Related Stories

Voice impersonators can fool speaker recognition systems

November 15, 2017

Skilful voice impersonators are able to fool state-of-the-art speaker recognition systems, as these systems generally aren't efficient in recognising voice modifications, according to new research from the University of Eastern ...

Machines have nothing on mum when it comes to listening

October 8, 2015

More than 99% of the time, two words are enough for people with normal hearing to distinguish the voice of a close friend or relative amongst other voices, says the University of Montreal's Julien Plante-Hébert. His study, ...

What makes a voice unique?

April 5, 2012

A new digital speech database which captures how voices vary between different speakers or situations for the purpose of forensic speaker comparison has been launched by researchers at the University of Cambridge.

Speech signal processing—enhancing voice conversion models

December 27, 2016

Altering a person's voice so that it sounds like another person is a useful technique for use in security and privacy, for example. This computational technique, known as voice conversion (VC), usually requires parallel data ...

Recommended for you


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.