New reporter? Call him Al, for algorithm

Jul 11, 2012 by Rob Lever
The media covers President Barack Obama's press conference in June 2012. The new reporter on the US media scene takes no coffee breaks, churns out articles at lightning speed, and has no pension plan.

The new reporter on the US media scene takes no coffee breaks, churns out articles at lightning speed, and has no pension plan.

That's because the reporter is not a person, but a , honed to translate such as corporate earnings reports and previews or into readable prose.

Algorithms are producing a growing number of articles for newspapers and websites, such as this one produced by Narrative Science:

"Wall Street is high on Wells Fargo, expecting it to report earnings that are up 15.7 percent from a year ago when it reports its second quarter earnings on Friday, July 13, 2012," said the article on Forbes.com.

While computers cannot parse the subtleties of each story, they can take vast amounts of raw data and turn it into what passes for news, analysts say.

"This can work for anything that is basic and formulaic," says Ken Doctor, an analyst with the media research firm Outsell.

And with media companies under intense financial pressure, the move to automate some news production "does speak directly to the rebuilding of the cost economics of journalism," said Doctor.

Stephen Doig, a journalism professor at Arizona State University who has used computer systems to sift through data which is then provided to reporters, said the new computer-generated writing is a logical next step.

"I don't have a philosophical objection to that kind of writing being outsourced to a computer, if the reporter who would have been writing it could use the time for something more interesting," Doig said.

Scott Frederick, of Automated Insights, another firm in the sector, said he sees this as "the next generation of content creation."

The company got its start in 2007 as StatSheet, which generates news stories from raw feeds of play-by-play data from major sports events.

The company generates advertising on its own website and is now beginning to sell its services to other organizations for sports and real estate news.

"Over the next 12 to 24 months, every media property will need some automation strategy," Frederick told AFP.

To mimic the effect of the hometown newspaper, the company generates articles with a different "tonality" depending on the reader's preference or location.

For the 2012 Super Bowl, the article for New York Giants' fans read like this: "Hakeem Nicks had a big night, paving the way to a victory for the Giants over the Patriots, 21-17 in Indianapolis. With the victory, New York is the champion of Super Bowl XLVI."

For New England fans, the story was different: "Behind an average day from Tom Brady, the Patriots lost to the Giants, 21-17 at home. With the loss, New England falls short of a Super Bowl ring."

"Data becomes the seeds of the content trees. When you can create an entire story out of raw data, that is technologically impressive," Frederick said.

Kristian Hammond, chief technology officer at Chicago-based Narrative Science, said he had been involved in computer content generation for more than a decade.

Hammond is on leave from Northwestern University, where he was on the computer science faculty and headed a joint project generating content with the university's journalism school.

The company formed in 2010 has 40 clients including Forbes, and some corporate clients which use the technology to take spreadsheets or other data for internal reports that are more readable.

"We're about two-thirds engineering and one-third journalism," he said.

"We knew there were places in traditional journalism where raw data was used as the driver for telling stories, and we wanted to take that model and turn it into something a machine can do," he told AFP.

While some articles are reviewed by editors, others are automatically delivered without human intervention because of client preference or because the task is too voluminous: Narrative Science, he said, produced stories on 370,000 Little League baseball games in the past year.

The computers cannot pick up on certain things, such as if an injury or weather affects the game.

"If it's not in the data, we can't say anything about it. We're very aware of that, but more of what goes on is data-driven," Hammond said.

"The feedback has been very positive. We haven't done anything goofy or embarrassing so far."

One goof came from a company called Journatic, a partner of the Chicago Tribune, which uses a combination of human editors in the US and overseas and computer algorithms to generated "hyperlocal" news.

Some news organizations complained when they discovered the "bylines" generated were made-up names, not real journalists, in the Tribune, Houston Chronicle and San Francisco Chronicle, a violation of ethics policies for the dailies.

Journatic chief executive Brian Timpone said the flap stemmed from a misunderstanding with news clients and the fact that bylines were needed to be seen on Google News.

"We're taking them off," Timpone said, arguing that should not distract attention from the business model which can help media companies.

"The way news is produced has not changed in 50 years," he told AFP.

Timpone said his company can produce news more efficiently "with technology, lots of local news gathering, and a distributed writing team."

"It's not about algorithms. Algorithms only work if the data is structured. There's no way to automate everything."

Explore further: Google building fleet of package-delivering drones

add to favorites email to friend print save as pdf

Related Stories

US university coding future of news

Dec 16, 2009

Personalized newscasts culled from the Web and presented by digital avatars. Baseball stories written by computers using raw data.

News outlets losing ground to tech rivals: report

Mar 19, 2012

Mobile devices and social networks are boosting news consumption but media outlets are lagging behind technology companies in reaping the profits, according to a report published on Monday.

Report: Tablets helping improve news consumption

Mar 19, 2012

Mobile technology appears to be increasing the public appetite for news but it's far from clear whether the news industry will profit from that, a study issued Monday concluded.

Recommended for you

FIXD tells car drivers via smartphone what is wrong

2 hours ago

A key source of anxiety while driving solo, when even a bothersome back-seat driver's comments would have made you listen: the "check engine" light is on but you do not feel, smell or see anything wrong. ...

Watching others play video games is the new spectator sport

8 hours ago

As the UK's largest gaming festival, Insomnia, wrapped up its latest event on August 25, I watched a short piece of BBC Breakfast news reporting from the festival. The reporter and some of the interviewees appeared baff ...

User comments : 7

Adjust slider to filter visible comments by rank

Display comments: newest first

sstritt
Jul 11, 2012
This comment has been removed by a moderator.
Dr_toad
Jul 11, 2012
This comment has been removed by a moderator.
Sean_W
3.7 / 5 (6) Jul 11, 2012
Unless they can program in blatant political bias, slimy ethics and contempt for all things, the jobs of human journalists are safe for now. The forgoing constitutes 95% of their job.
JRDarby
5 / 5 (2) Jul 11, 2012
This sort of simple input-output prose "translation" will remain limited to reporting sports, weather, and finance information without a data-organizing system behind it. The system could parse data from a variety of sources into a manipulable format and relate the data according to your desire using a database of related concepts or ideas. Perhaps the system could ultimately parse social network and similar data, and perform similar semantic relation processes, to determine any person's news preferences based on conversation topics and stated interests and create personally-tailored news.

http://www.scribd...Proposal

Unfortunately the input would only be as good as the output, and it's hard to trust the news these days.
sennekuyl
4 / 5 (2) Jul 11, 2012
@JRDarby:

''Unfortunately the input would only be as good as the output, and it's hard to trust the news these days.''

Is that quantum programming?

nuge
1 / 5 (1) Jul 11, 2012
Yay! Maybe now we'll see less mistakes on PhysOrg.
TabulaMentis
1 / 5 (4) Jul 11, 2012
I am still wondering who is going to repay the National debts when these things go big time and put half of the workforce out of work?
Vendicar_Decarian
3.7 / 5 (3) Jul 12, 2012
Who cares about National debts as long as multinational corporations continue to rake in the profits?

Perhaps you think it is time for revolution?

What will be your economic model?
TabulaMentis
2.4 / 5 (5) Jul 12, 2012
What will be your economic model?
Balanced budget amendments that will prevent politicians from spending more than they receive, except during a real world war. If an amendment does not exist, then perhaps there may be a law(s) or bill(s) that the politician(s) may be violating that could get them impeached. If the politicians continue to disobey their people, then a coup would be in order.

If the people continue to allow their political system to run their country's economy or debt into the ground, then God help them, especially the U.S., and the huge debt that it (politicians/elite corporations) is creating.