AI algorithm solves structural biology challenges

Determining the 3D shapes of biological molecules is one of the hardest problems in modern biology and medical discovery. Companies and research institutions often spend millions of dollars to determine a molecular structure - and even such massive efforts are frequently unsuccessful.

Using clever, new machine learning techniques, Stanford University PhD students Stephan Eismann and Raphael Townshend, under the guidance of Ron Dror, associate professor of computer science, have developed an approach that overcomes this problem by predicting accurate structures computationally.

Most notably, their approach succeeds even when learning from only a few known structures, making it applicable to the types of molecules whose structures are most difficult to determine experimentally.

Their work is demonstrated in two papers detailing applications for RNA molecules and multi-protein complexes, published in Science on Aug. 27, 2021, and in Proteins in December 2020, respectively. The paper in Science is a collaboration with the Stanford laboratory of Rhiju Das, associate professor of biochemistry.

"Structural biology, which is the study of the shapes of molecules, has this mantra that structure determines function," said Townshend.

The algorithm designed by the researchers predicts accurate molecular structures and, in doing so, can allow scientists to explain how different molecules work, with applications ranging from fundamental biological research to informed drug design practices.

"Proteins are molecular machines that perform all sorts of functions. To execute their functions, proteins often bind to other proteins," said Eismann. "If you know that a pair of proteins is implicated in a disease and you know how they interact in 3D, you can try to target this interaction very specifically with a drug."

Eismann and Townshend are co-lead authors of the Science paper with Stanford postdoctoral scholar Andrew Watkins of the Das lab, and also co-lead authors of the Proteins paper with former Stanford PhD student Nathaniel Thomas.

Designing the algorithm

Instead of specifying what makes a structural prediction more or less accurate, the researchers let the algorithm discover these molecular features for itself. They did this because they found that the conventional technique of providing such knowledge can sway an algorithm in favor of certain features, thus preventing it from finding other informative features.

"The problem with these hand-crafted features in an algorithm is that the algorithm becomes biased towards what the person who picks these features thinks is important, and you might miss some information that you would need to do better," said Eismann.

"The network learned to find fundamental concepts that are key to molecular structure formation, but without explicitly being told to," said Townshend. "The exciting aspect is that the algorithm has clearly recovered things that we knew were important, but it has also recovered characteristics that we didn’t know about before."

Having shown success with proteins, the researchers next applied their algorithm to another class of important biological molecules, RNAs. They tested their algorithm in a series of “RNA Puzzles” from a long-standing competition in their field, and in every case, the tool outperformed all the other puzzle participants and did so without being designed specifically for RNA structures.

Broader applications

The researchers are excited to see where else their approach can be applied, having already had success with protein complexes and RNA molecules.

"Most of the dramatic recent advances in machine learning have required a tremendous amount of data for training. The fact that this method succeeds given very little training data suggests that related methods could address unsolved problems in many fields where data is scarce," said Dror, who is senior author of the Proteins paper and, with Das, co-senior author of the Science paper.

Specifically for structural biology, the team says that they’re only just scratching the surface in terms of scientific progress to be made.

"Once you have this fundamental technology, then you’re increasing your level of understanding another step and can start asking the next set of questions," said Townshend. "For example, you can start designing new molecules and medicines with this kind of information, which is an area that people are very excited about."

Raphael J L Townshend, Stephan Eismann, Andrew M Watkins, Ramya Rangan, Maria Karelina, Rhiju Das, Ron O Dror.
Geometric deep learning of RNA structure.
Science, 2021. doi: 10.1126/science.abe5650

Most Popular Now

SK bioscience and GSK start Phase 3 trial of adjuv…

SK bioscience (SK) and GlaxoSmithKline plc (GSK) today announced the initiation of a Phase 3 clinical study of SK's COVID-19 vaccine candidate, GBP510, in combination wit...

No serious health effects linked to mRNA COVID-19 …

Federal and Kaiser Permanente researchers combing the health records of 6.2 million patients found no serious health effects that could be linked to the 2 mRNA COVID-19 v...

Blood vessels produce growth factor that promotes …

Blood vessels supply tumors with nutrients and, on the other hand, enable cancer cells to spread throughout the body. The settlement of circulating tumor cells in a dista...

First-in-human clinical trial for a vaccine to tre…

The first patients have been enrolled in a phase 1 randomized placebo-controlled clinical trial to study a therapeutic vaccine for opioid use disorder developed by resear...

Sandoz strengthens pipeline by entering into agree…

Sandoz, a Novartis division, today announced that it has entered into a commercialization agreement with Bio-Thera Solutions, Ltd. for biosimilar bevacizumab (BAT1706). B...

Pfizer and BioNTech submit a variation to EMA with…

Pfizer Inc. (NYSE: PFE) and BioNTech SE (Nasdaq: BNTX) announced that they submitted a variation to the European Medicines Agency (EMA) requesting to update the Condition...

A drug costing less than €2 a day helps in the tre…

Metoprolol, a drug widely used to treat cardiovascular disease, is beneficial when administered to COVID-19patients. This is the finding of a study by investigators at th...

Rheumatoid arthritis treated with implanted cells …

With a goal of developing rheumatoid arthritis therapies with minimal side effects, researchers at Washington University School of Medicine in St. Louis have genetically ...

NIH scientists build a cellular blueprint of multi…

Chronic lesions with inflamed rims, or "smoldering" plaques, in the brains of people with multiple sclerosis (MS) have been linked to more aggressive and disabling forms ...

Scientific evidence to date on COVID-19 vaccine ef…

An expert review by an international group of scientists, including some at the WHO and FDA, concludes that, even for the delta variant, vaccine efficacy against severe C...

How a plant virus could protect and save your lung…

Using a virus that grows in black-eyed pea plants, nanoengineers at the University of California San Diego developed a new treatment that could keep metastatic cancers at...

COVID-19 nasal vaccine candidate effective at prev…

Breathe in, breathe out. That’s how easy it is for SARS-CoV-2, the virus that causes COVID-19, to enter your nose. And though remarkable progress has been made in develop...