What sort of stream networks do scientific ideas flow along?
When scientists have an interesting idea, the result is usually a joint publication. At the Institute of Nuclear Physics of the Polish Academy of Sciences in Cracow, it has been shown that tracking the dependencies between co-authors reveals not only see the paths along which scientific ideas flow, but also reconstructs the structure of scientific cooperation and detects emerging communities. Interestingly, the proposed method of analysis can be an effective tool to fight terrorists and even dishonest politicians.
A group of researchers from the Institute of Nuclear Physics of the Polish Academy of Sciences (IFJ PAN) in Cracow, headed by Prof. Stanislaw Drozdz (IFJ PAN, Cracow University of Technology), has analyzed the structures of links between the co-authors of scientific works forming around such eminent figures of modern science as Harry Eugene Stanley or Edward Witten. Illustrated in the form of graphs, the research results give a unique insight into the various forms of modern scientific cooperation.
"More and more people are now participating in scientific projects, and science itself is becoming more and more interdisciplinary. The scale of difficulty of the scientific issues under research is increasing, and we are also dealing with the rapid development of modern methods of communication. All this means that the links between scientists today have not only a significant complexity, but also constantly growing dynamics," says Prof. Drozdz.
When analyzing the structures of scientific cooperation, the Cracow researchers made use of an idea connected with Paul Erdos, one of the greatest mathematicians of the 20th century. Erdos was the author and co-author of around 1500 publications that included a total of over 500 people collaborating. Erdos' exceptional scientific activity provoked mathematicians to invent Erdos numbers, reflecting the scientific 'kinship' of a given person with Erdos himself. According to the definition, Erdos was assigned the number 0; a scientist who co-authored a publication with him was assigned 1; a scientist who co-authored a publication with an Erdos 1 collaborator was assigned Erdos 2, etc.
The simplest graph produced according to the above principles would be one for Paul Dirac, an English physicist-theoretician who always published his works as the sole author. Dirac's graph would therefore consist of just one node. There would be an equally uninteresting topology in graphs of modern scientific ventures in which hundreds of scientists participate, assigned to many subsequent publications in an often purely administrative manner (this is a common situation in the case of complex, long-term experiments, such as LHC and the LIGO gravitational wave detectors). These graphs would consist of a large number of nodes, most of them connecting with all the others. The network of connections here is so dense that it is difficult to find any interesting dependencies.
The graph of Erdos himself also turned out to be moderately interesting: Numerous links radiate from the central node to many neighboring nodes, and almost always end there. Only some of the nodes symbolizing Erdos' associates were connected to each other. The shape of the structure was probably influenced here by the fact that the communication tools existing in the 20th century were much more poorly developed than at present, and it was much more difficult for many scientists, especially the lesser known ones, to establish new contacts.
"When Erdos was dealing with the mathematical description of networks reflecting human interactions, he predicted that the connection structure would be quite democratic: most nodes would be in direct contact with a comparable, though not very large, number of other nodes. But at the turn of the century, for the first time, a large network was visualized: the Internet. Suddenly, it turned out that this network has a much less democratic structure: only 20 percent of nodes have access to 80 percent of connections. We discovered a self-organization resulting from the operation of a similar power law in the network of Professor Stanley," notes Prof. Drozdz.
Prof. H. Eugene Stanley is an interdisciplinary statistical physicist, co-author of several dozen publications a year. His Hirsch index, reflecting the number and response of scientific publications, is one of the highest in the natural and technical sciences. In the idea flow network surrounding Stanley there are many highlighted nodes, around which there are more nodes, representing scientists developing their own research, and inspired by Stanley's works. Some of them, corresponding to such figures of science as Marcel Ausloos, Shlomo Havlin or Sergey Buldyrev, influenced their colleagues in a similarly creative way, focusing their own communities around themselves. As a result, the Stanley network acquired hierarchical features, in many places demonstrating the self-similarity characteristic of fractal objects.
The hierarchy of the ideas network is neither a universal feature nor is it characteristic of the outstanding figures of today's science. The graph illustrating the scientific connections of Edward Witten, a well-known mathematical physicist developing string theory, turned out to have a general outline resembling a simple star, as was so clearly visible in Erdos's graph. In Witten's, however, the presence of several distinct sub-structures separated from each other is visible. They correspond to communities dealing with specific topics, e.g. the theory of everything or the physics of high energies.
"The mathematical procedure proposed by us makes it possible to detect links between people which are not always perceived at first glance. The existence of some communities only became obvious when, constructing the graphs, we took into account the fact that connections between nodes can have different strengths. Some authors may, after all, publish joint works many times," notes Prof. Drozdz.
In networks of interpersonal contacts, mathematics reveals of the presence of communities connected by common ideas. The methods of analysis proposed by the Cracow physicists could thus be used for other purposes, e.g. to follow the structures of terrorist organizations. But it is possible to imagine other, more peaceful applications. For example, modern democracies could perhaps function more effectively if, before elections, researchers produced a graph illustrating the network of the candidate's connections to other people.