August 3, 2007

New program color-codes text in Wikipedia entries to indicate trustworthiness

The online reference site Wikipedia enjoys immense popularity despite nagging doubts about the reliability of entries written by its all-volunteer team. A new program developed at the University of California, Santa Cruz, aims to help with the problem by color-coding an entry's individual phrases based on contributors' past performance.

The program analyzes Wikipedia's entire editing history--nearly two million pages and some 40 million edits for the English-language site alone--to estimate the trustworthiness of each page. It then shades the text in deepening hues of orange to signal dubious content. A 1,000-page demonstration version is already available on a web page operated by the program's creator, Luca de Alfaro, associate professor of computer engineering at UCSC.

Other sites already employ user ratings as a measure of reliability, but they typically depend on users' feedback about each other. This method makes the ratings vulnerable to grudges and subjectivity. The new program takes a radically different approach, using the longevity of the content itself to learn what information is useful and which contributors are the most reliable.

"The idea is very simple," de Alfaro said. "If your contribution lasts, you gain reputation. If your contribution is reverted [to the previous version], your reputation falls." De Alfaro will speak about his new program this Saturday, August 4, at the Wikimania conference in Taipei, Taiwan.

The program works from a user's history of edits to calculate his or her reputation score. The trustworthiness of newly inserted text is computed as a function of the reputation of its author. As subsequent contributors vet the text, their own reputations contribute to the text's trustworthiness score. So an entry created by an unknown author can quickly gain (or lose) trust after a few known users have reviewed the pages.

A benefit of calculating author reputation in this way is that de Alfaro can test how well his reliability scores work. He does so by comparing users' reliability scores with how long their subsequent edits last on the site. So far, the program flags as suspect more than 80 percent of edits that turn out to be poor. It's not overly accusatory, either: 60 to 70 percent of the edits it flags do end up being quickly corrected by the Wikipedia community.

The exhaustive analysis of Wikipedia's seven-year edit history takes de Alfaro's desktop PC about a week to complete. At present he is working from copies of the site that Wikipedia periodically distributes. Once the initial backlog of edits is calculated, however, de Alfaro said that updating reliability scores in real time should be fairly simple.

While the program prominently displays text trustworthiness, de Alfaro favors keeping hidden the reputation ratings of individual users. Displaying reputations could lead to competitiveness that would detract from Wikipedia's collaborative culture, he said, and could demoralize knowledgeable contributors whose scores remain low simply because they post infrequently and on few topics.

"We didn't want to modify the experience of a user going in to Wikipedia," de Alfaro said. "It is very relaxing right now and we didn't want to modify what has worked so well and is so welcoming to the new user."

Source: UC Santa Cruz

Citation: New program color-codes text in Wikipedia entries to indicate trustworthiness (2007, August 3) retrieved 24 April 2024 from https://phys.org/news/2007-08-color-codes-text-wikipedia-entries-trustworthiness.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Images: Moon, asteroids and new rockets topped the world's space news in 2023

0 shares

Feedback to editors

Lunar landforms indicate geologically recent seismic activity on the moon

40 minutes ago

Japan's moon lander wasn't built to survive a weekslong lunar night. It's still going after 3

3 hours ago

Bioluminescence first evolved in animals at least 540 million years ago, pushing back previous oldest dated example

12 hours ago

Star bars show universe's early galaxies evolved much faster than previously thought

13 hours ago

Scientists study lipids cell by cell, making new cancer research possible

13 hours ago

Squids' birthday influences mating: Male spear squids shown to become 'sneakers' or 'consorts' depending on birth date

13 hours ago

Study finds rekindling old friendships as scary as making new ones

15 hours ago

How light can vaporize water without the need for heat

16 hours ago

Researchers develop eggshell 'bioplastic' pellet as sustainable alternative to plastic

16 hours ago

Previous theory on how electrons move within protein nanocrystals might not apply in every case

17 hours ago

Load comments (0)

New program color-codes text in Wikipedia entries to indicate trustworthiness

Lunar landforms indicate geologically recent seismic activity on the moon

Japan's moon lander wasn't built to survive a weekslong lunar night. It's still going after 3

Bioluminescence first evolved in animals at least 540 million years ago, pushing back previous oldest dated example

Star bars show universe's early galaxies evolved much faster than previously thought

Scientists study lipids cell by cell, making new cancer research possible

Squids' birthday influences mating: Male spear squids shown to become 'sneakers' or 'consorts' depending on birth date

Study finds rekindling old friendships as scary as making new ones

How light can vaporize water without the need for heat

Researchers develop eggshell 'bioplastic' pellet as sustainable alternative to plastic

Previous theory on how electrons move within protein nanocrystals might not apply in every case

Relevant PhysicsForums posts

Passing variables in FORTRAN

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Images: Moon, asteroids and new rockets topped the world's space news in 2023

New technology to assemble three-dimensional structures using gold nanoparticles confined in nanocapsules

New approach overcomes long-standing limitations in optics to enhance the efficiency of Mie scattering

Cyclone Jasper makes landfall in Australia

Researcher discovers new technique for photon detection

Understanding the key to predicting heat events in Central Europe

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

New program color-codes text in Wikipedia entries to indicate trustworthiness

Lunar landforms indicate geologically recent seismic activity on the moon

Japan's moon lander wasn't built to survive a weekslong lunar night. It's still going after 3

Bioluminescence first evolved in animals at least 540 million years ago, pushing back previous oldest dated example

Star bars show universe's early galaxies evolved much faster than previously thought

Scientists study lipids cell by cell, making new cancer research possible

Squids' birthday influences mating: Male spear squids shown to become 'sneakers' or 'consorts' depending on birth date

Study finds rekindling old friendships as scary as making new ones

How light can vaporize water without the need for heat

Researchers develop eggshell 'bioplastic' pellet as sustainable alternative to plastic

Previous theory on how electrons move within protein nanocrystals might not apply in every case

Relevant PhysicsForums posts

Related Stories

Images: Moon, asteroids and new rockets topped the world's space news in 2023

New technology to assemble three-dimensional structures using gold nanoparticles confined in nanocapsules

New approach overcomes long-standing limitations in optics to enhance the efficiency of Mie scattering

Cyclone Jasper makes landfall in Australia

Researcher discovers new technique for photon detection

Understanding the key to predicting heat events in Central Europe

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience