April 24, 2006

Scientists devise means to test for phony technical papers

Authors of bogus technical articles beware. A team of researchers at the Indiana University School of Informatics has designed a tool that distinguishes between real and fake papers. It's called the Inauthentic Paper Detector -- one of the first of its kind anywhere -- and it uses compression to determine whether technical texts are generated by man or machine.

"This is a potential problem since no existing systems, the Web for example, can or do discriminate between content that is meaningful or bogus," says assistant professor Mehmet Dalkilic, a data mining expert. "We believe that there are subtle, short- and long-range word or even word string repetitions that exist in human texts, but not in many classes of computer-generated texts that can be used to discriminate based on meaning."

Joining Dalkilic on the IPD project are Assistant Professor Predrag Radivojac, informatics doctoral student James Costello, and Wyatt T. Clark, who will graduate in May with a bachelor's degree in informatics.

The IPD system is based on a combination of compression algorithms that reduce the amount of data to save space and speed transmission time.

To begin their study, the team identified two kinds of texts they would analyze. "Authentic text" (or document) is a collection of several hundreds or thousands of syntactically correct sentences that are wholly meaningful. "Inauthentic text" (or document) is a collection of several hundreds of thousands of syntactically correct sentences that, taken all together, have no meaning.

The researchers' work is documented in the very authentic paper, "Using Compression to Identify Classes of Inauthentic Texts," which they presented at the Society for Industrial and Applied Mathematics Conference on Data Mining in Bethesda, Md., this weekend.

The informatics study largely was inspired by a prank pulled by three Massachusetts Institute of Technology students, who in 2004 developed a computer program that churned out randomly generated fake computer science language, essentially a four-page compilation of gibberish. They submitted it as a research paper to an international conference on computer science and informatics – and it was accepted without review.

Radivojac, whose research expertise is machine learning, says the IPD easily detected numerous inauthentic technical papers tested, including the MIT students' spurious submission.

"We hypothesized we could build a reliable and fast model that recognizes fake papers automatically," says Radivojac. "We combined these with machine-learning methods to build a predictor of these kinds of papers."

In general, identifying meaning in a technical document is difficult, Dalkilic says. "We don't claim we have found a way to distinguish between meaning and nonsense, but we do emphasize that there are many nontrivial classes of inauthentic documents that can be easily distinguished based on compression algorithms."

Source: Indiana University School of Informatics

Citation: Scientists devise means to test for phony technical papers (2006, April 24) retrieved 27 July 2024 from https://phys.org/news/2006-04-scientists-phony-technical-papers.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

From takeoff to flight, the wiring of a fly's nervous system is mapped

0 shares

Feedback to editors

Spacecraft to swing by Earth, moon on path to Jupiter

3 hours ago

New process uses light and enzymes to create greener chemicals

3 hours ago

Outsourcing conservation in Africa: NGO management reduces poaching and boosts tourism, but raises risks for civilians

3 hours ago

Two shark species documented in Puget Sound for first time

3 hours ago

New study disputes Hunga Tonga volcano's role in 2023–24 global warm-up

18 hours ago

New self-powered electrostatic tweezer enhances object manipulation and microfluidics

18 hours ago

Climate is most important factor in where mammals choose to live, study finds

19 hours ago

Team develops novel hybrid scheme for compressible flow computations

19 hours ago

Twisted carbon nanotubes could achieve significantly better energy storage than advanced lithium-ion batteries

19 hours ago

3D models show dolphins already used narrow-band sound waves for orientation 5 million years ago

19 hours ago

Load comments (0)

Scientists devise means to test for phony technical papers

Spacecraft to swing by Earth, moon on path to Jupiter

New process uses light and enzymes to create greener chemicals

Outsourcing conservation in Africa: NGO management reduces poaching and boosts tourism, but raises risks for civilians

Two shark species documented in Puget Sound for first time

New study disputes Hunga Tonga volcano's role in 2023–24 global warm-up

New self-powered electrostatic tweezer enhances object manipulation and microfluidics

Climate is most important factor in where mammals choose to live, study finds

Team develops novel hybrid scheme for compressible flow computations

Twisted carbon nanotubes could achieve significantly better energy storage than advanced lithium-ion batteries

3D models show dolphins already used narrow-band sound waves for orientation 5 million years ago

From takeoff to flight, the wiring of a fly's nervous system is mapped

Wolves reintroduced to Isle Royale temporarily affect other carnivores, humans have influence as well

Scientists develop a new generation of DNA tests for a wide range of applications

Scientists discover next-generation system for programmable genome design

Experiment captures atoms in free fall to look for gravitational anomalies caused by dark energy

Scientists pioneer technique to visualize anti-ferroelectric materials

Spacecraft to swing by Earth, moon on path to Jupiter

New process uses light and enzymes to create greener chemicals

Outsourcing conservation in Africa: NGO management reduces poaching and boosts tourism, but raises risks for civilians

Two shark species documented in Puget Sound for first time

Exploring what happens when different spherical objects hit the water

New study disputes Hunga Tonga volcano's role in 2023–24 global warm-up

Medical Xpress

Tech Xplore

Science X

Scientists devise means to test for phony technical papers

Spacecraft to swing by Earth, moon on path to Jupiter

New process uses light and enzymes to create greener chemicals

Outsourcing conservation in Africa: NGO management reduces poaching and boosts tourism, but raises risks for civilians

Two shark species documented in Puget Sound for first time

New study disputes Hunga Tonga volcano's role in 2023–24 global warm-up

New self-powered electrostatic tweezer enhances object manipulation and microfluidics

Climate is most important factor in where mammals choose to live, study finds

Team develops novel hybrid scheme for compressible flow computations

Twisted carbon nanotubes could achieve significantly better energy storage than advanced lithium-ion batteries

3D models show dolphins already used narrow-band sound waves for orientation 5 million years ago

Related Stories

From takeoff to flight, the wiring of a fly's nervous system is mapped

Wolves reintroduced to Isle Royale temporarily affect other carnivores, humans have influence as well

Scientists develop a new generation of DNA tests for a wide range of applications

Scientists discover next-generation system for programmable genome design

Experiment captures atoms in free fall to look for gravitational anomalies caused by dark energy

Scientists pioneer technique to visualize anti-ferroelectric materials

Recommended for you

Spacecraft to swing by Earth, moon on path to Jupiter

New process uses light and enzymes to create greener chemicals

Outsourcing conservation in Africa: NGO management reduces poaching and boosts tourism, but raises risks for civilians

Two shark species documented in Puget Sound for first time

Exploring what happens when different spherical objects hit the water

New study disputes Hunga Tonga volcano's role in 2023–24 global warm-up

Newsletter sign up

Donate and enjoy an ad-free experience