Deep learning algorithm could boost drug development

Artificially intelligent algorithms can learn to identify amazingly subtle information, enabling them to distinguish between people in photos or to screen medical images as well as a doctor. But in most cases their ability to perform such feats relies on training that involves thousands to trillions of data points. This means artificial intelligence doesn't work all that well in situations where there is very little data, such as drug development.

Vijay Pande, professor of chemistry at Stanford University, and his students thought that a fairly new kind of deep learning, called one-shot learning, that requires only a small number of data points might be a solution to that low-data problem.

"We're trying to use machine learning, especially deep learning, for the early stage of drug design," said Pande. "The issue is, once you have thousands of examples in drug design, you probably already have a successful drug."

The group admitted the idea of applying one-shot learning to drug design problems was farfetched -- the data was likely too limited. However, they'd had success in the past with machine learning methods requiring only hundreds of data points, and they had data available to test the one-shot approach. It seemed worth a try.

Much to their surprise, their results, published April 3 in ACS Central Science, show that one-shot learning methods have potential as a helpful tool for drug development and other areas of chemistry research.

Moving from images to molecules
Other researchers have successfully applied one-shot learning to image recognition and genomics, but applying it to problems relevant to drug development is a bit different. Whereas pixels and bases are fairly natural types of data to feed into an algorithm, properties of small molecules aren't.

To make molecular information more digestible, the researchers first represented each molecule in terms of the connections between atoms (what a mathematician would call a graph). This step highlighted intrinsic properties of the chemical in a form that an algorithm could process.

With these graphical representations, the group trained an algorithm on two different datasets - one with information about the toxicity of different chemicals and another that detailed side effects of approved medicines. From the first dataset, they trained the algorithm on six chemicals and had it make predictions about the toxicity of the other three. Using the second dataset, they trained it to associate drugs with side effects in 21 tasks, testing it on six more.

In both cases, the algorithm was better able to predict toxicity or side effects than would have been possible by chance.

"We worked on some prototype algorithms and found that, given a few data points, they were able to make predictions that were pretty accurate," said Bharath Ramsundar, who is a graduate student in the Pande lab and co-lead author of the study.

However, Ramsundar cautioned that this isn't a "magical" technique. It was built off of several recent advances in a particular style of one-shot learning and it works by relying on the closeness of different molecules, as indirectly indicated by their formula. For example, when the researchers trained their algorithm on the toxicity data and tested it on the side effect data, the algorithm completely collapsed.

An experimentalist's help
People concerned about AI taking jobs from humans have nothing to fear from this work. The researchers envision this as groundwork for a potential tool for chemists who are early in their research and trying to choose which molecule to pursue from a set of promising candidates.

"Right now, people make this kind of choice by hunch," Ramsundar said. "This might be a nice compliment to that: an experimentalist's helper."

Beyond giving insight into drug design, this tool would be broadly applicable to molecular chemistry. Already, the Pande lab is testing these methods on different chemical compositions for solar cells. They have also made all of the code they used for the experiment open source, available as part of the DeepChem library.

"This paper is the first time that one-shot has been applied to this space and it's exciting to see the field of machine learning move so quickly," Pande said. "This is not the end of this journey - it's the beginning."

This research was funded by the Fannie and John Hertz Foundation.

Han Altae-Tran, Bharath Ramsundar, Aneesh S. Pappu, Vijay Pande.
Low Data Drug Discovery with One-Shot Learning.
ACS Cent. Sci., doi: 10.1021/acscentsci.6b00367.

Most Popular Now

FDA approves Roche's OCREVUS™ (ocrelizumab) for re…

Roche (SIX: RO, ROG; OTCQX: RHHBY) announced today that the US Food and Drug Administration (FDA) approved OCREVUS™ (ocrelizumab) as the first and only medicine for both ...

Read more

First collaborative definition of patient centrici…

AstraZeneca announced that BMJ Innovations has published the first collaborative definition of patient centricity for the pharmaceutical industry. The definition was co-d...

Read more

High doses of vitamin C to improve cancer treatmen…

Clinical trials found that it is safe to regularly infuse brain and lung cancer patients with 800 - 1000 times the daily recommended amount of vitamin C as a potential st...

Read more

Deep learning algorithm could boost drug developme…

Artificially intelligent algorithms can learn to identify amazingly subtle information, enabling them to distinguish between people in photos or to screen medical images ...

Read more

Bristol-Myers Squibb enters into separate agreemen…

Bristol-Myers Squibb Company (NYSE:BMY) today announced that it has entered into two separate agreements to license BMS-986168, an anti-eTau compound in development for ...

Read more

Tagrisso (osimertinib) receives US FDA full approv…

AstraZeneca today announced that the US Food and Drug Administration (FDA) has granted full approval for Tagrisso (osimertinib) 80mg once-daily tablets, for the treatment...

Read more

A one-two punch hits pancreatic cancer where it hu…

Australian scientists have uncovered a promising new approach to treating pancreatic cancer, by targeting the tissue around the tumour to make it 'softer' and more respon...

Read more

FDA allows marketing of first direct-to-consumer t…

The U.S. Food and Drug Administration today allowed marketing of 23andMe Personal Genome Service Genetic Health Risk (GHR) tests for 10 diseases or conditions. These are ...

Read more

Novartis to strengthen R&D pipeline by in-lice…

Novartis has exercised an option to in-license ECF843 for ophthalmic indications worldwide (outside Europe). The closing of the deal is subject to customary closing condi...

Read more

New drug delivery system shows promise for fightin…

A new cancer-drug delivery system shows the ability to exploit the oxygen-poor areas of solid tumors that make the growths resistant to standard chemotherapy and radiatio...

Read more

Towards a safe and scalable cell therapy for type …

More than 36 million people globally are affected by type 1 diabetes (T1D), a lifelong disorder where insulin producing cells are attacked and destroyed by the immune sys...

Read more

XELJANZ® (tofacitinib citrate) receives marketing …

Pfizer Inc. (NYSE:PFE) announced today that the European Commission (EC) has approved XELJANZ® (tofacitinib citrate) 5 mg twice daily (BID) oral tablets in combination wi...

Read more

Pharmaceutical Companies

[ A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Z ]