More is less: Complex computer models can involve thousands of variables

June 2, 2010 by Larry Hardesty

(PhysOrg.com) -- The architect Mies van der Rohe is famous for promoting the slogan "less is more." But if Venkat Chandrasekaran, a graduate student in the Department of Electrical Engineering and Computer Science, had a slogan for his own work, it might be "more is less."

Science, engineering and other quantitative disciplines are largely concerned with uncovering the mathematical relationships between data points — such as energies of molecules, measurements of temperature or , or stock prices. In most cases, adding more data points just makes the math more complicated. But sometimes it makes it simpler. And for many types of calculations, if there are additional data points that will make them simpler, Chandrasekaran’s techniques will find them.

To see how adding data points can mean simpler calculations, suppose that you’re trying to understand the relationships between a bunch of stocks in the same industry sector — say, Apple, Gateway, Dell, Hewlett-Packard and other computer manufacturers. On the one hand, an increase in Apple’s share price could mean a decrease in, say, Dell’s, because Apple and Dell compete for a limited pool of computer buyers’ dollars; on the other hand, if large institutional investors are bullish about computer stocks in general, an increase in Apple’s stock could indicate an increase in Dell’s as well.

It might be possible to build a complicated mathematical model that, on the basis of considerations like the companies’ price-to-earnings ratios, trade volumes and revenues determines whether an increase in Apple’s share price will cause an increase or decrease in Dell’s — and Gateway’s, and Hewlett-Packard’s, and so on. But it might also turn out that a single extra variable — say, the average price of all the companies’ stock — provides a good indication of general trends in the sector. Since the new variable accounts for institutional investors’ enthusiasm or skittishness, the relationships between the individual stocks no longer have to. The overall calculation becomes much simpler.

Irrelevant referent

In this case, Chandrasekaran’s techniques would tell you only that adding another variable — the average stock price — simplifies the overall calculation. They wouldn’t tell you why. And indeed, the extra variable could turn out to be something more complicated than an average. It might factor in the price-to-earnings ratios of some companies, the revenues of others, the share prices of still others, and so on. A savvy analyst might be able to deduce that this new, more complex variable represents the trading strategies of a bunch of large hedge funds that concentrate on the computer industry. But then again, it could be that no one has any idea what the new variable refers to.

“There’s this temptation that I even had initially, that you can sort of discover hidden variables,” says Chandrasekaran. “And that’s true: You can discover hidden variables. But it’s not going to be easy to attribute meaning to these hidden variables.” For most purposes, however, that may not matter. “From the mathematical point of view, just putting these things in helps you simplify,” Chandrasekaran says. If the added variable helps you predict Dell’s from Apple’s, does it really matter what it refers to — or whether it refers to anything at all?

Getting to the bottom

At the most recent Symposium on System Identification, hosted by the International Federation of Automatic Control, Chandrasekaran and MIT Professors of Electrical Engineering Alan Willsky and Pablo Parrilo described their approach to finding hidden variables that simplify calculations.

Generally, computer science is concerned with questions of computational complexity: Given a particular algorithm, you want to know whether a computer can execute it quickly, slowly or never. So provides some standard methods for calculating the complexity of mathematical models.

If you have an equation that describes the complexity of a , you want to find its minimum values: where the complexity is lowest, the model is simplest, and thus easiest to work with. If you imagine the graph of the equation as a complex surface with lots of peaks and troughs, you want to find the bottom of the deepest trough.

But that in itself can be a prohibitively complex process. Computer scientists have developed a host of methods for analyzing such equations and finding solutions that are probably near the bottom of a trough in a particular region of the graph. For certain types of problems, however, the techniques developed by Chandrasekaran and his colleagues are mathematically guaranteed to find the bottom of the graph’s lowest trough.

According to Ben Recht, an assistant professor in the University of Wisconsin’s computer sciences department, “There are a lot of people who would be surprised if you told them that you could solve this particular hidden-variable problem using [Chandrasekaran’s] methods.” He adds, however, that “it’s not a general-purpose tool, even for these hidden-variable problems.” Chandrasekaran agrees. In fact, he prefers to describe his methods as “tricks” rather than “techniques,” because it might require some mathematical insight to determine how to apply them in any particular case.

Still, Recht says, “he’s shown that in a relatively large set of cases, you can actually use this. And it’s a first step to explore the space of what sorts of problems can be solved using this technology.”

Related Stories

Nonlinear thinker: Making sense of previously insoluble problems

January 29, 2010

If an airplane is cruising along and raises the flaps on its wings a degree or two, it will tilt upward. If it raises the flaps twice as much, it will tilt upward about twice as much. But if it tilts upward too far — generally ...

HP, Dell win Israeli government tender

May 23, 2006

The Israeli government has reached a supply agreement with the Israeli Dell and HP computer distributors, the business magazine TheMarker reported Tuesday.

Explained: Regression analysis

March 16, 2010

(PhysOrg.com) -- Regression analysis. It sounds like a part of Freudian psychology. In reality, a regression is a seemingly ubiquitous statistical tool appearing in legions of scientific papers, and regression analysis is ...

Research Reveals Why Some Stocks Keep Winning, While Others Keep Losing

January 9, 2006

A new study suggests that investor psychology plays a big role in why stock prices show strong momentum – the tendency for prices to continue in the same direction, either rising or falling.

Statistics Professor Hides Pictures, Messages in Problem Solutions

April 11, 2007

Say you’re an aspiring statistician who has just spent hours trying to figure out the answer to a particularly thorny problem. As you plug the final numbers into the computer program you’re running in order to confirm ...

Dell planning acquisition: WSJ

June 11, 2009

US computer giant Dell is planning to acquire a"significant-sized company" in the next few months, The Wall Street Journal reported on Thursday.

Recommended for you

Sexual harassment common among middle school children, study finds

December 9, 2016

The recent suicide of Brandy Vela, a teen in Texas City, Texas, was a potent reminder of the sometimes tragic consequences of bullying. According to Vela's parents, the teen fatally shot herself Nov. 29 following months of ...

Digitally reconstructed skull and face may reveal Robert the Bruce, king-hero of the Scots

December 9, 2016

Could this be the face of Robert the Bruce, as it has never been seen before?

Archeologist claims to have found proof that Hebrew was the first written alphabet

December 8, 2016

(Phys.org)—Douglas Petrovich, an archaeologist with Ontario's Wilfrid-Laurier University in Canada has sparked controversy in the ancient history scholarly community by making claims that he has found proof that Hebrew ...

Mobile money access lifted two percent of Kenyan households out of poverty: study

December 8, 2016

Since 2008, MIT economist Tavneet Suri has studied the financial and social impacts of Kenyan mobile-money services, which allow users to store and exchange monetary values via mobile phone. Her work has shown that these ...

Amber specimen offers rare glimpse of feathered dinosaur tail

December 8, 2016

Researchers have discovered a dinosaur tail complete with its feathers trapped in a piece of amber. The finding reported in Current Biology on December 8 helps to fill in details of the dinosaurs' feather structure and evolution, ...

Fossilized evidence of a tumor in a 255-million-year-old mammal forerunner

December 8, 2016

When paleontologists at the University of Washington cut into the fossilized jaw of a distant mammal relative, they got more than they bargained for—more teeth, to be specific.