Researchers develop prototype to detect fake websites

July 26, 2011 By La Monica Everett-Haynes

Researchers develop prototype to detect fake websites

It seems logical that a more Internet-driven world would translate into a heightened awareness of fake websites. But it isn't so. The vast majority of people still are unable to determine the authenticity of websites, resulting in tremendous monetary loses. That is what is driving the work of UA Artificial Intelligence Lab members who, along with a UA alumnus, have earned a top honor from MIS Quarterly for their research.

(PhysOrg.com) -- Do you go online to pay bills, shop, transfer funds, sign up for classes, send email or instant messages or search for medical information? If so, then this pertains to you.

Members of a University of Arizona Eller College of Management team and a UA alumnus developed a to detect fake websites. When tested against other existing commercial systems, the team found that its system resulted in effective and more accurate detections of spoof sites – better than a human can. 

The team's subsequent article, “Detecting Fake Websites: The Contribution of Statistical Learning Theory" was published last year in an issue of MIS Quarterly, or MISQ. A preeminent peer-reviewed journal in the field of management information systems, MISQ has since been named the article its top paper for 2010.

"Even to get into MISQ is very difficult, and this is probably the first technical paper to receive the Best Paper award," said Hsinchun Chen, the UA Artificial Intelligence Lab director, one of the paper's five authors.

MISQ will formally honor the researchers in Shanghai, China later this year during the International Conference on Information Systems. 

"The topic of detecting fake websites and also our computational approach are both considered major contributions. This topic has great relevance to the industry, the society and the citizens in general," said Chen, also the McClelland Professor of Management Information Systems.

"This award is not something just for me, or my lab, but also for our department," he said, adding that the team's eventual goal is technology transfer. 

UA alumnus Ahmed Abbasi, now a University of Virginia assistant professor of information technology, is lead author on the paper. Chen served as his dissertation adviser. Other co-authors are UA Eller College's department of management information systems faculty members Zhu Zhang and Jay F. Nunamaker Jr.; and David Zimbra, a doctoral student in the Artificial Intelligence Lab.

For the research, the team used the prototype and several other detection systems to evaluate the authenticity of 900 websites.

It is easy to pick up on a site's authenticity by checking whether the URL contains "http" when it should read "https," when it was last updated, if a security key is missing or if images appear strangely pixelated.

The team found that its system – founded on statistical learning technology, which evaluates a large accumulation of data – was more apt to detect imitation sites and those that were entirely concocted, said Abbasi, who earned his doctoral degree in management information systems from the UA in 2008. 

The major difference between the authors’ prototype and the other systems? Their system relied on a tremendously rich set of fraud cues.

The team developed five categories with thousands of cues, finding that the best results were attained when utilizing thousands of highly visible and also deeply embedded cues, such as placement, URL length, the number of links, characters types on the site and how thorough the site's "frequently asked questions" section is detailed, among other features.  

The project's origins were born out of the Artificial Intelligence Lab, where Abbasi developed the mathematical formula the team eventually used while working as a project lead and research associate. He continued the work after having taken a faculty position at the University of Wisconsin-Milwaukee.

"It creates a greater awareness for a problem that has been around for a while yet still remains an issue as we increasingly move to the Internet for everything – online banking, online health initiatives and ," Abbasi said. 

Given the pervasive nature of online phishing scams, being able to readily and frequently detect a site's validity is crucial, Abbasi said, also noting research that indicates people are less than 60 percent accurate in detecting fake sites, and other security issues.

"The problem we're looking at is quite big. Fake websites constitute much of the Internet fraud's multi-billion dollar industry, and that is monetary loss…we can’t even quantify the social ramifications," Abbasi said. "That's the whole motivation. It is so profitable for fraudsters, and it is slipping through the cracks."

Today, Chen and more than one dozen of his collaborators are continuing to investigate fake sites. Meanwhile, Abbasi is undertaking an investigation of peoples' abilities to detect fake sites through a grant funded by the National Science Foundation.

Today, Chen and more than one dozen of his collaborators are continuing to investigate fake sites. Meanwhile, Abbasi is undertaking an investigation of users and peoples' abilities to detect fake sites.

Abbasi said developing better detection systems requires improved statistical learning technology that utilize larger quantities of cues. It also is important to dismiss long-held perceptions about how fake sites might and should appear. 

"The idea of protecting from the front level has been around for a while," Abbasi said, adding that companies have begun to employ software that better detects fake sites. "But we are not where we need to be, and there is a lot of potential in future development."

Provided by University of Arizona search and more info website

4.3 /5 (10 votes)  

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

Star_Gazer
Jul 26, 2011

Rank: 5 / 5 (6)
In related news, fake website developers develop prototype to go around the fake website detection algorithm.
MadLintElf
Jul 26, 2011

Rank: not rated yet
zing high five!
that_guy
Jul 26, 2011

Rank: not rated yet
Or you could use a good antivirus program, and follow simple internet adage - never use an unknown link for an important site you do business with, type it in. (Then bookmark it).

It appears tho, that rnyspace.com did very well for a while...but consider the clientelle...

An ounce of prevention is better than a pound of cure. Why identify it if you can avoid it altogether.

Don't get me wrong, if someone takes over a legit site, you're basically screwed...but you'd be screwed either way.
Rank 4.3 /5 (10 votes)
Relevant PhysicsForums posts
  • Ideas to mitigate risk of 911 calls being misdirected
    createdMay 24, 2012
  • Live scribe pen?
    createdMay 10, 2012
  • Shallow water flow simulation
    createdMay 07, 2012
  • Tablet for taking notes?
    createdMay 05, 2012
  • Best fit tablet for me?
    createdMay 05, 2012
  • Measure of Informaton
    createdMay 04, 2012
  • More from Physics Forums - Computing & Technology

More news stories

Browser wars flare in mobile space

The browser wars are heating up again, but this time the fight is for dominance of the mobile Internet.

Technology / Software

created 5 hours ago | popularity 5 / 5 (1) | comments 2

Probability of contamination from severe nuclear reactor accidents is higher than expected: study

Catastrophic nuclear accidents such as the core meltdowns in Chernobyl and Fukushima are more likely to happen than previously assumed. Based on the operating hours of all civil nuclear reactors and the number ...

Technology / Energy & Green Tech

created May 22, 2012 | popularity 3.6 / 5 (22) | comments 56 | with audio podcast

SpotterRF debuts Radar Backpack Kit (w/ Video)

(Phys.org) -- SpotterRF has announced a special radar backpack kit designed to enhance situational awareness for soldiers on the ground. The company says its special radar is designed for warfighters as part ...

Technology / Hi Tech & Innovation

created May 26, 2012 | popularity 5 / 5 (5) | comments 12 | with audio podcast report

HyperSolar shows dirty water no barrier to power world

(Phys.org) -- The Santa Barbara, California, company, HyperSolar, is set to transparently share the ups and downs of its research experiences toward the company’s ultimate vision, successfully producing ...

Technology / Energy & Green Tech

created May 24, 2012 | popularity 4.8 / 5 (16) | comments 17 | with audio podcast report

Tesla to launch electric sedan in US on June 22

Tesla Motors said Tuesday it would begin deliveries of "the world's first premium electric sedan" on June 22, slightly ahead of schedule.

Technology / Energy & Green Tech

created May 22, 2012 | popularity 4.5 / 5 (11) | comments 18


Nvidia trumpets Tegra 3 phone design wins for 2012

(Phys.org) -- Nvidia’s competitive war paint has a name, Tegra 3. On the heels of Nvidia announcements about lowering costs of its Tegra 3 processors and Nvidia-enabled tablets running Android Ice Cream ...

Scientist: Evolution debate will soon be history

(AP) -- Richard Leakey predicts skepticism over evolution will soon be history. Not that the avowed atheist has any doubts himself.

Dell tablet leak: 10.1-inch display, two-battery choice

(Phys.org) -- Headline after headline talks about vendors’ tablets in the wings as likely number-one contenders for the iPad. Such claims have justifiably been taken with a grain of salt, considering ...

Keep food safety in mind this memorial day weekend

(HealthDay) -- Picnics, parades and cookouts are as much a part of Memorial Day weekend as tributes to the United States' war veterans.

Social welfare cuts ultimately come with heavy price, researchers say

(Phys.org) -- Slashing government funding for Medicaid, food stamps and other programs that serve the poor – while politically popular with some lawmakers and many conservatives – may do more harm ...

Is a classical electrodynamics law incompatible with special relativity?

(Phys.org) -- The laws of classical electromagnetism that were developed in the 19th century are the same laws that scientists use today. They include Maxwell’s four equations along with the Lorentz la ...