IBM Contributes Data to the National Institutes of Health to Speed Drug Discovery and Cancer Research Innovation

IBM (NYSE: IBM) announced it is contributing a massive database of chemical data extracted from millions of patents and scientific literature to the National Institutes of Health. This contribution will allow researchers to more easily visualize important relationships among chemical compounds to aid in drug discovery and support advanced cancer research.

In collaboration with AstraZeneca, Bristol-Myers Squibb, DuPont and Pfizer, IBM is providing a database of more than 2.4 million chemical compounds extracted from about 4.7 million patents and 11 million biomedical journal abstracts from 1976 to 2000. The announcement was made at an IBM forum on U.S. economic competitiveness in the 21st century, exploring how private sector innovations and investment can be more easily shared in the public domain.

The publicly available chemical data can be used by researchers worldwide to gain new insights and enable new areas of research. It will also help researchers save time by more efficiently finding information buried in millions of pages of patent documents. Access to this data will also allow researchers to analyze far larger sets of documents than the traditional manual process, adding a whole new dimension to the ability to search intellectual property.

The data was extracted using the IBM business analytics and optimization strategic IP insight platform (SIIP), a combination of data and analytics delivered via the IBM SmartCloud, and developed by IBM Research in collaboration with several major life sciences organizations. This new cloud-driven method for curating and analyzing massive amounts of patents, scientific content and molecular data. It uses techniques such as automated image analysis and enhanced optical recognition of chemical images and symbols to extract information from patents and literature upon publication. This is a task that otherwise takes weeks and months to complete manually, but can be done rapidly using this new technology.

"Information overload continues to be a challenge in drug discovery and other areas of scientific research," said Steve Heller, project director for the InChI Trust, a non-profit which supports the InChI international standard to represent chemical structures. "Rich data and content is often buried in patents, drawings, figures and scholarly articles. This contribution by IBM and its collaborators will make it easier for researchers to use this data, link to other data using the InChI structure representation and derive new insight."

Over the past six years, several major life sciences organizations have worked on this project with IBM Research gaining access to a comprehensive chemical library extracted from worldwide patents and scientific abstracts. Public structure extraction tools developed by researchers at the National Institutes of Health were also used successfully in this project.

"The scientific community will receive enormous benefit from this advancement," said Heller. "This is an important addition to the open chemistry data sets. The comprehensiveness of the data and the new ways researchers can look at these data and cross-link to other data associated with each chemical is expected to help with drug development to fight many forms of cancers and other human diseases, as well as the development of other chemical compounds."

The data will be contributed to the National Center for Biotechnology Information (NCBI), part of the National Library of Medicine (NLM), and the Computer-Aided Drug Design (CADD) Group of the National Cancer Institute (NCI) at the National Institutes of Health. It will be incorporated in the NCBI's PubChem, a public resource for the scientific community that serves as an aggregator for scientific results as well as in NCI CADD Group services such as the Chemical Structure Lookup Service and the Chemical Identifier Resolver.

Most Popular Now

Pfizer receives positive FDA Advisory Committee vo…

Pfizer Inc. (NYSE: PFE) announced that the U.S. Food and Drug Administration's (FDA) Vaccines and Related Biological Products Advisory Committee (VRBPAC) voted that avail...

Engineered bacteria find tumors, then alert the au…

Combining discoveries in cancer immunology with sophisticated genetic engineering, Columbia University researchers have created a sort of "bacterial suicide squad" that ...

First nasal monoclonal antibody treatment for COVI…

A pilot trial by investigators from Brigham and Women's Hospital, a founding member of the Mass General Brigham healthcare system, tested the nasal administration of the ...

US FDA Advisory Committee votes to support effecti…

GSK plc (LSE/NYSE: GSK) announced that the US Food and Drug Administration (FDA) Vaccines and Related Biological Products Advisory Committee (VRBPAC) voted that the avail...

"Semantic similarity" leads to novel dru…

The words that researchers use to describe their results can be harnessed to discover potential new treatments for Parkinson's disease, according to a new study published...

Tumour cells' response to chemotherapy is driven b…

Cancer cells have an innate randomness in their ability to respond to chemotherapy, which is another tool in their arsenal of resisting treatment, new research led by the...

Pfizer invests $43 billion to battle cancer

Pfizer Inc. (NYSE: PFE) and Seagen Inc. (Nasdaq: SGEN) today announced that they have entered into a definitive merger agreement under which Pfizer will acquire Seagen, a...

Pfizer's ZAVZPRET™ (zavegepant) migraine nasal spr…

Pfizer Inc. (NYSE: PFE) today announced the U.S. Food and Drug Administration (FDA) has approved ZAVZPRET™ (zavegepant), the first and only calcitonin gene-related peptid...

Gene and cell therapies to combat pancreatic cance…

Pancreatic cancer is an incurable form of cancer, and gene therapies are currently in clinical testing to treat this deadly disease. A comprehensive review of the gene an...

Scientists reveal a potential new approach to trea…

Scientists at the National Institutes of Health and Massachusetts General Hospital in Boston have uncovered a potential new approach against liver cancer that could lead ...

Normalizing tumor blood vessels may improve immuno…

A type of immune therapy called chimeric antigen receptor (CAR)-T cell therapy has revolutionized the treatment of multiple types of blood cancers but has shown limited e...

Digital twin opens way to effective treatment of i…

Inflammatory diseases like rheumatoid arthritis have complex disease mechanisms that can differ from patient to patient with the same diagnosis. This means that currently...