April 18, 2016

System predicts 85 percent of cyber-attacks using input from human experts

by Adam Conner-Simons

Today's security systems usually fall into one of two categories: human or machine. So-called "analyst-driven solutions" rely on rules created by living experts and therefore miss any attacks that don't match the rules. Meanwhile, today's machine-learning approaches rely on "anomaly detection," which tends to trigger false positives that both create distrust of the system and end up having to be investigated by humans, anyway.

But what if there were a solution that could merge those two worlds? What would it look like?

In a new paper, researchers from MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) and the machine-learning startup PatternEx demonstrate an artificial intelligence platform called AI2 that predicts cyber-attacks significantly better than existing systems by continuously incorporating input from human experts. (The name comes from merging artificial intelligence with what the researchers call "analyst intuition.")

The team showed that AI2 can detect 85 percent of attacks, which is roughly three times better than previous benchmarks, while also reducing the number of false positives by a factor of 5. The system was tested on 3.6 billion pieces of data known as "log lines," which were generated by millions of users over a period of three months.

To predict attacks, AI2 combs through data and detects suspicious activity by clustering the data into meaningful patterns using unsupervised machine-learning. It then presents this activity to human analysts who confirm which events are actual attacks, and incorporates that feedback into its models for the next set of data.

"You can think about the system as a virtual analyst," says CSAIL research scientist Kalyan Veeramachaneni, who developed AI2 with Ignacio Arnaldo, a chief data scientist at PatternEx and a former CSAIL postdoc. "It continuously generates new models that it can refine in as little as a few hours, meaning it can improve its detection rates significantly and rapidly."

Veeramachaneni presented a paper about the system at last week's IEEE International Conference on Big Data Security in New York City.

Creating cybersecurity systems that merge human- and computer-based approaches is tricky, partly because of the challenge of manually labeling cybersecurity data for the algorithms.

For example, let's say you want to develop a computer-vision algorithm that can identify objects with high accuracy. Labeling data for that is simple: Just enlist a few human volunteers to label photos as either "objects" or "non-objects," and feed that data into the algorithm.

But for a cybersecurity task, the average person on a crowdsourcing site like Amazon Mechanical Turk simply doesn't have the skillset to apply labels like "DDOS" or "exfiltration attacks," says Veeramachaneni. "You need security experts."

That opens up another problem: Experts are busy, and they can't spend all day reviewing reams of data that have been flagged as suspicious. Companies have been known to give up on platforms that are too much work, so an effective machine-learning system has to be able to improve itself without overwhelming its human overlords.

AI2's secret weapon is that it fuses together three different unsupervised-learning methods, and then shows the top events to analysts for them to label. It then builds a supervised model that it can constantly refine through what the team calls a "continuous active learning system."

Specifically, on day one of its training, AI2 picks the 200 most abnormal events and gives them to the expert. As it improves over time, it identifies more and more of the events as actual attacks, meaning that in a matter of days the analyst may only be looking at 30 or 40 events a day.

"This paper brings together the strengths of analyst intuition and machine learning, and ultimately drives down both false positives and false negatives," says Nitesh Chawla, the Frank M. Freimann Professor of Computer Science at the University of Notre Dame. "This research has the potential to become a line of defense against attacks such as fraud, service abuse and account takeover, which are major challenges faced by consumer-facing systems."

The team says that AI2 can scale to billions of log lines per day, transforming the pieces of data on a minute-by-minute basis into different "features", or discrete types of behavior that are eventually deemed "normal" or "abnormal."

"The more attacks the system detects, the more analyst feedback it receives, which, in turn, improves the accuracy of future predictions," Veeramachaneni says. "That human-machine interaction creates a beautiful, cascading effect."

More information: Paper: "AI2: training a big data machine to defend" people.csail.mit.edu/kalyan/AI2_Paper.pdf

Citation: System predicts 85 percent of cyber-attacks using input from human experts (2016, April 18) retrieved 3 July 2024 from https://phys.org/news/2016-04-percent-cyber-attacks-human-experts.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

System that replaces human intuition with algorithms outperforms human teams

682 shares

Feedback to editors

'Acceleration beats' shine bright light on a novel universal modulation regime in a semiconductor-based laser

12 minutes ago

New possibilities for reservoir computing with topological magnetic and ferroelectric systems

20 minutes ago

Synthesis method for 1D segmented heteronanostructures uses stress-induced axial ordering

27 minutes ago

New models suggest Milky Way is not as packed with stars as previously thought

27 minutes ago

Flexible and durable bioelectrodes: The future of health care wearables

37 minutes ago

New mRNA technology turns cells into long-lasting drug factories

47 minutes ago

Discovering a new piranha species in the Amazon Basin

52 minutes ago

Astronomers observe a strong shock front in galaxy cluster SPT-CLJ 2031-4037

53 minutes ago

Researchers capture never-before-seen view of gene transcription

55 minutes ago

Genetic algorithm enables precise design of phononic crystals

2 hours ago

Load comments (0)

System predicts 85 percent of cyber-attacks using input from human experts

'Acceleration beats' shine bright light on a novel universal modulation regime in a semiconductor-based laser

New possibilities for reservoir computing with topological magnetic and ferroelectric systems

Synthesis method for 1D segmented heteronanostructures uses stress-induced axial ordering

New models suggest Milky Way is not as packed with stars as previously thought

Flexible and durable bioelectrodes: The future of health care wearables

New mRNA technology turns cells into long-lasting drug factories

Discovering a new piranha species in the Amazon Basin

Astronomers observe a strong shock front in galaxy cluster SPT-CLJ 2031-4037

Researchers capture never-before-seen view of gene transcription

Genetic algorithm enables precise design of phononic crystals

Relevant PhysicsForums posts

Number of Multiplications in the FFT Algorithm

Newbie question about deep learning

Who can find the largest prime number with their own programmed code?

Math Major Trying to Learn CS

Parallelizing N-Queens

How to test locally hosted websites on mobile?

System that replaces human intuition with algorithms outperforms human teams

AI system solves SAT geometry questions as well as average human test taker

Researcher tackles some of the biggest bottlenecks holding back the data science industry

Google Cloud Machine Learning is sailing into mainstream

New techniques could help identify students at risk for dropping out of online courses

Human eyes assist drones, teach machines to see

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

System predicts 85 percent of cyber-attacks using input from human experts

'Acceleration beats' shine bright light on a novel universal modulation regime in a semiconductor-based laser

New possibilities for reservoir computing with topological magnetic and ferroelectric systems

Synthesis method for 1D segmented heteronanostructures uses stress-induced axial ordering

New models suggest Milky Way is not as packed with stars as previously thought

Flexible and durable bioelectrodes: The future of health care wearables

New mRNA technology turns cells into long-lasting drug factories

Discovering a new piranha species in the Amazon Basin

Astronomers observe a strong shock front in galaxy cluster SPT-CLJ 2031-4037

Researchers capture never-before-seen view of gene transcription

Genetic algorithm enables precise design of phononic crystals

Relevant PhysicsForums posts

Related Stories

System that replaces human intuition with algorithms outperforms human teams

AI system solves SAT geometry questions as well as average human test taker

Researcher tackles some of the biggest bottlenecks holding back the data science industry

Google Cloud Machine Learning is sailing into mainstream

New techniques could help identify students at risk for dropping out of online courses

Human eyes assist drones, teach machines to see

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience