Researchers use new tools of data science to capture single molecules in action
In high school chemistry, we all learned about chemical reactions. But what brings two reacting molecules together? As explained to us by Einstein, it is the random motion of inert molecules driven by the bombardment of solvent molecules. If brought close enough together, by random chance, these molecules may react.
Capturing the motion of single molecules is achieved by a method known as Fluorescence Correlation Spectroscopy (FCS). The catch? It takes very many detections of light particles, photons, emitted by single molecules to get a clear picture of molecular motion.
As an illustration, think of a political poll. At any given time in a campaign cycle, polls are used to predict the outcome of an upcoming election. But how many voters must we interrogate to get an accurate prediction and, given how time-sensitive polling information is, how quickly can we probe the nation's political leanings? Asking every voter in every state would yield accurate results but be too costly in time and dollars. For practical reasons, we need to take a sample of voters and efficiently exploit all information contained in that sample. The voters in this illustration are our proverbial photons here.
The long times needed to acquire data in FCS is just like the naïve polling strategy highlighted earlier. It takes too long, and the chemistry we care about learning might already be done. Furthermore, exposing samples to the laser for long periods of time may result in the photochemical damage of molecules under study, preventing the widespread use of FCS in biological research.
"Single-molecule fluorescence techniques have revolutionized our understanding of the dynamics of many critical molecular processes, but signals are inherently noisy and experiments require long acquisition times," explained Marcia Levitus, an associate professor in the School of Molecular Sciences and the Biodesign Institute.
"New mathematical tools make it possible to think about old but powerful experiments in a new light," said Steve Pressé, lead author on the study and joint professor in the Department of Physics and School of Molecular Sciences at ASU at Arizona State University.
A paper published in Nature Communications by Pressé and collaborators now addresses these issues using tools from data science and, more specifically, Bayesian nonparametrics—a type of statistical modeling tool so far largely used outside the natural sciences. Levitus adds "Old strategies limited our ability to probe anything but slow processes, leaving a vast number of interesting biological questions involving faster chemical reactions out of reach. Now we can begin asking questions on processes resolved in short order."