March 5, 2018

AI's dirty little secret: It's powered by people

by Ryan Nakashima

There's a dirty little secret about artificial intelligence: It's powered by hundreds of thousands of real people.

From makeup artists in Venezuela to women in conservative parts of India, people around the world are doing the digital equivalent of needlework —drawing boxes around cars in street photos, tagging images, and transcribing snatches of speech that computers can't quite make out.

Such data feeds directly into "machine learning" algorithms that help self-driving cars wind through traffic and let Alexa figure out that you want the lights on. Many such technologies wouldn't work without massive quantities of this human-labeled data.

These repetitive tasks pay pennies apiece. But in bulk, this work can offer a decent wage in many parts of the world—even in the U.S. This burgeoning but largely unseen cottage industry represents the foundation of a technology that could change humanity forever: AI that will drive us around, execute verbal commands without flaw, and, possibly, one day think on its own.

___

This human input industry has long been nurtured by search engines Google and Bing, who for more than a decade have used people to rate the accuracy of their results. Since 2005, Amazon's Mechanical Turk service, which matches freelance workers with temporary online jobs, has also made crowd-sourced data entry available to researchers worldwide.

More recently, investors have poured tens of millions of dollars into startups like Mighty AI and CrowdFlower, which are developing software that makes it easier to label photos and other data, even on smartphones.

Venture capitalist S. "Soma" Somasegar says he sees "billions of dollars of opportunity" in servicing the needs of machine learning algorithms. His firm, Madrona Venture Group, invested in Mighty AI. Humans will be in the loop "for a long, long, long time to come," he says.

Accurate labeling could make the difference between a self-driving car distinguishing between the sky and the side of a truck—a distinction Tesla's Model S failed in the first known fatality involving self-driving systems in 2016.

"We're not building a system to play a game, we're building a system to save lives," says Mighty AI CEO Daryn Nakhuda.

___

Marjorie Aguilar, a 31-year-old freelance makeup artist in Maracaibo, Venezuela, spends four to six hours a day drawing boxes around traffic objects to help train self-driving systems for Mighty AI.

She earns about 50 cents an hour, but in a crisis-wracked country with runaway inflation, just a few hours' work can pay a month's rent in bolivars.

"It doesn't sound like a lot of money, but for me it's pretty decent," she says. "You can imagine how important it is for me getting paid in U.S. dollars."

Aria Khrisna, a 36-year-old father of three in Tegal, Indonesia, says doing things like adding word tags to clothing pictures on websites such as eBay and Amazon pays him about $100 a month, roughly half his income.

And for 25-year-old Shamima Khatoon, her job annotating cars, lane markers and traffic lights at an all-female outpost of data-labeling company iMerit in Metiabruz, India, represents the only chance she has to work outside the home in her conservative Muslim community.

"It's a good platform to increase your skills and support your family," she says.

___

Major automakers like Toyota, Nissan and Ford, ride-hailing companies like Uber and other tech giants like Alphabet Inc.'s Waymo are paying reams of labelers, often through third-party vendors.

The benefits of greater accuracy can be immediate.

At InterContinental Hotels Group, every call that its digital assistant Amelia can take from a human saves $5 to $10, says information technology director Scot Whigham.

When Amelia fails, the program listens while a call is rerouted to one of about 60 service desk workers. It learns from their response and tries the technique out on the next call, freeing up human employees to do other things.

"We've transformed those jobs," Whigham says.

When a computer can't make out a customer call to the Hyatt Hotels chain, an audio snippet is sent to AI-powered call center Interactions in an old brick building in Franklin, Massachusetts.

There, while the customer waits on the phone, one of a roomful of headphone-wearing "intent analysts" transcribes everything from misheard numbers to profanities and quickly directs the computer how to respond.

That information feeds back into the system. "Next time through, we've got a better chance of being successful," says Robert Nagle, Interactions' chief technology officer.

___

Researchers have tried to find workarounds to human-labeled data, but the results are often inadequate.

In a project that used Google Street View images of parked cars to estimate the demographic makeup of neighborhoods, then-Stanford researcher Timnit Gebru tried to train her AI by scraping Craigslist photos of cars for sale that were labeled by their owners.

But the product shots didn't look anything like the car images in Street View, and the program couldn't recognize them. In the end, she says, she spent $35,000 to hire auto dealer experts to label her data.

The need for human labelers is "enormous" and "dynamic," says Robin Bordoli, CEO of labeling technology company CrowdFlower. "You can't trust the algorithm 100 percent."

___

At the moment, figuring out how to get computers to learn without so-called "ground truth" data provided by humans remains an open research question.

Trevor Darrell, a machine learning expert at the University of California Berkeley, says he expects it will be five to 10 years before computer algorithms can learn to perform without the need for human labeling.

His group alone spends hundreds of thousands of dollars a year paying people to annotate images. "Right now, if you're selling a product and you want perfection, it would be negligent not to invest the money in that kind of annotation," he says.

Several companies like Alphabet's Waymo and game-maker Unity Technologies are developing simulated worlds to train their algorithms in controlled scenarios where every object comes pre-defined.

For the most part, even companies trying to push humans out of the loop still rely on them.

CloudSight, for instance, offers website and app developers a handy tool for uploading a photo and getting a few words back describing it. The retailer Kohl's uses the service for a "Snap and Shop" visual search feature on its app.

But it's not just a fancy computer program spitting back responses. If the algorithm doesn't have a good answer, one of its 800 employees in places like India, Southeast Asia or Africa type in the answer in real time.

"We want to be the ones that can label any image without any human involvement," says Ian Parnes, CloudSight's head of business development. "How long that will take is anyone's guess."

Citation: AI's dirty little secret: It's powered by people (2018, March 5) retrieved 29 June 2024 from https://phys.org/news/2018-03-ai-dirty-secret-powered-people.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Artificial intelligence algorithm can determine a neighborhood's political leanings by its cars

81 shares

Feedback to editors

AI's dirty little secret: It's powered by people

NASA astronauts will stay at the space station longer for more troubleshooting of Boeing capsule

The beginnings of fashion: Paleolithic eyed needles and the evolution of dress

Analysis of NASA InSight data suggests Mars hit by meteoroids more often than thought

New computational microscopy technique provides more direct route to crisp images

A harmless asteroid will whiz past Earth Saturday. Here's how to spot it

Tiny bright objects discovered at dawn of universe baffle scientists

New method for generating monochromatic light in storage rings

Soft, stretchy electrode simulates touch sensations using electrical signals

Updating the textbook on polarization in gallium nitride to optimize wide bandgap semiconductors

Investigating newly discovered hydrothermal vents at depths of 3,000 meters off Svalbard

Relevant PhysicsForums posts

Cyber security in the modern/post-modern internet

AI In Actual Use

Help! Old PC dog has to learn new Mac tricks

How can you trade non integer values of Bitcoin?

Help with my buggy TV/Streaming Services

Looking for a reliable inkjet All-In-One printer for photos and docs

Artificial intelligence algorithm can determine a neighborhood's political leanings by its cars

Silicon Valley is winning the race to build the first driverless cars

Self-driving cars with no in-vehicle backup driver get OK for California public roads

Waymo ramps up self-driving fleet with 'thousands' of cars

Developing bots that talk more like people

Uber close to scrapping human backups in self-driving cars (Update)

European Parliament adopts copyright reform in blow to big tech

Facebook's messaging ambitions amount to much more than chat

Apps send intimate user data to Facebook: report

New bug prompts earlier end to Google+ social network

Twitter bots had 'disproportionate' role spreading misinformation in 2016 election: study

Web pioneer wants new 'contract' for internet

Medical Xpress

Tech Xplore

Science X

AI's dirty little secret: It's powered by people

NASA astronauts will stay at the space station longer for more troubleshooting of Boeing capsule

The beginnings of fashion: Paleolithic eyed needles and the evolution of dress

Analysis of NASA InSight data suggests Mars hit by meteoroids more often than thought

New computational microscopy technique provides more direct route to crisp images

A harmless asteroid will whiz past Earth Saturday. Here's how to spot it

Tiny bright objects discovered at dawn of universe baffle scientists

New method for generating monochromatic light in storage rings

Soft, stretchy electrode simulates touch sensations using electrical signals

Updating the textbook on polarization in gallium nitride to optimize wide bandgap semiconductors

Investigating newly discovered hydrothermal vents at depths of 3,000 meters off Svalbard

Relevant PhysicsForums posts

Related Stories

Artificial intelligence algorithm can determine a neighborhood's political leanings by its cars

Silicon Valley is winning the race to build the first driverless cars

Self-driving cars with no in-vehicle backup driver get OK for California public roads

Waymo ramps up self-driving fleet with 'thousands' of cars

Developing bots that talk more like people

Uber close to scrapping human backups in self-driving cars (Update)

Recommended for you

European Parliament adopts copyright reform in blow to big tech

Facebook's messaging ambitions amount to much more than chat

Apps send intimate user data to Facebook: report

New bug prompts earlier end to Google+ social network

Twitter bots had 'disproportionate' role spreading misinformation in 2016 election: study

Web pioneer wants new 'contract' for internet

Newsletter sign up

Donate and enjoy an ad-free experience