Elsevier opens data model for life sciences innovation

ao link

Members

Contact

New to DataIQ?

Take our FREE data literacy indicator now

Unlock the power of data - take our FREE data literacy indicator now

In the 1930s, if you were visiting London and wanted to find a specific street, you had two choices - get near your destination and ask a policeman or rely on a cab driver to find it. Not until Phyllis Isobella Gross walked the city’s 23,000 streets and created the first indexed A to Z did planning a route become quick and easy.

For life science researchers, developing new research projects on therapies or materials can still feel like being a London visitor in the pre-A to Z days. Information on previous experiments exists, but is recorded and stored in a wide variety of different data formats and locations. Just assembling the background needed to justify funding can be lengthy, painstaking and often infuriating.

It is to lower such barriers to innovations in life science that The Pistoia Alliance was formed. Named for the Italian town where its first conference was held in 2009, it brings together representatives of AstraZeneca, GSK, Novartis, Pfizer and Roche, among over 80 member companies, to create a framework for pre-competitive collaboration by overcoming common research and development obstacles, especially around data, knowledge sharing and technology pilots.

That goal has just received a major boost from the decision by Elsevier, the science and health information analytics business, to donate its Unifed Data Model (UDM) to the Alliance. UDM is an XML file format, originally developed by Elsevier in partnership with Roche, that helps to upload data sets into horizontal systems. Typically, it is the difficulty of integrating between in-house Electronic Lab Notebooks (ELN) and vertical systems (like those use by academics and publishers) that adds costs to projects.

“A common data model is the answer to a critical need.”

“We are very much a data company and as with any company dealing with data we had to think carefully about the risks and benefits of exposing our intellectual property,” Tim Hoctor, VP of professional services at Elsevier told DataIQ. “That said, implementing a common data model with major pharmaceutical companies that allows them to integrate their data is the answer to a critical need.”

By using a common data model across all of the systems involved in pharmaceutical and life sciences R&D, the discovery and analysis of information speeds up. It also opens the door to new approaches using machine learning, for example, which are ideal for this industry to exploit given the scale of the data involved. “That is what we are all trying to support so that data can be applied to studies that lead to breakthroughs in therapies and material sciences. That is to everybody’s benefit,” explained Hoctor.

When Elsevier surveyed its clients about the number of data sources they typically access for research projects, the average turned out to be three. “As we have five primary data sources, that was a cause for concern because it meant 40% of our data resource were going undiscovered. If a scientist doesn’t find the answer in those data sets, they may go on to duplicate an experiment which costs money and takes time,” he said.

Hoctor has been the driving force behind releasing the UDM to The Pistoia Alliance of which he is an elected board director. While he had to make the argument that releasing it into the public domain would be to the benefit of everybody in the industry, including Elsevier, the company took little persuading. With thousands of competing standards in use, moving to a shared open standard is an obvious step which everybody could buy into. He noted, “there was some caution, but nobody red flagged it.”

“If you don’t have common data models, you can’t access the science.”

“There is more and more interest around open standards and collaboration. If you look at big pharmaceutical mergers and acquisitions, if the companies involved had common data models, it would remove the issues of bringing together and integration the data sets from both sides,” he said. “If you don’t have that, you can’t access the science.”

Releasing additional value from existing studies is one driver behind M&A, especially with pressures on the new product pipeline making the demand for more rapid innovation even greater. Hoctor notes that pharmaceutical and chemistry companies typically work with a wide range of partners where data sharing is critical. Until now, integration was challenging.

“We have put some things into the public domain on a smaller scale before, but we had not done anything at this level,” said Hoctor. “We hope it will enable a broader impact from research.”

Log in to read the entire article

Gain access to the entire article by logging in or registering for a free account here.

Did you find this content useful?

Thank you for your input

Thank you for your feedback

Next read

A case of the AI biter bit?

DataIQ’s Chief Knowledge Officer and Evangelist, David Reed, examines the hype cycle around generative AI and the actual speed of transformation being seen.

Next read

Key data leader challenges in 2024: Part two – People

08 May 2024by Rachael Pimblett

Rachael Pimblett, Research Analyst, DataIQ, examines people challenges in the second part of a four-part series examining what data leaders feel will be their main challenges.

Key data leader challenges in 2024: Part one – Foundations

30 Apr 2024by Rachael Pimblett

DataIQ’s Research Analyst, Rachael Pimblett, shares the findings on what data leaders feel will be their main challenges in the next year, presented in the first of a four-part article series.

A case of the AI biter bit?

23 Apr 2024by David Reed

DataIQ’s Chief Knowledge Officer and Evangelist, David Reed, examines the hype cycle around generative AI and the actual speed of transformation being seen.

You may also be interested in

DataIQ 100 Success Series: EDF – National sustainability and preparing for the unexpected

EDF’s head of data and CRM, and member of the DataIQ 100 Martin Aylward, spoke to DataIQ editor Alex Roberts, about what data leaders need to succeed and how investment in data teams can provide extreme unseen wins.

AI just rocked Las Vegas. But where was data?

DataIQ chief knowledge officer and evangelist, David Reed, examines the gamble surrounding AI and why businesses need to play the game.

Analytics and Insight artificial intelligence business leaders CIO data objectives digital information gamble Prediction Technology tools US vegas

DataIQ 100 Success Series: Data Driven Danske – Leveraging data in a new way for legacy business

Legacy businesses have a unique set of challenges when adopting a new data-driven future. Data Driven Danske is a transformational journey taking Danske Bank employees to the next level of leveraging data and analytics to drive value for customers, shareholders, colleagues and broader stakeholders.

Analytics and Insight business leaders data culture data literacy data objectives DataIQ 100 finance Financial Services/Banking investment legacy talent Technology Technology and Tools

Newspapers, radio and television – An insight into the impact of generative AI on media businesses

With generative AI paving the way for a new era of data, businesses are rapidly seeking ways to incorporate tools into their operations, DataIQ member News UK delves into their approach.

AI Analytics and Insight artificial intelligence generative AI machine learning Media ML News skills Technology Technology and Tools upskilling

DataIQ is a trading name of IQ Data Group Limited
10 York Road, London, SE1 7ND

We use cookies so we can provide you with the best online experience. By continuing to browse this site you are agreeing to our use of cookies. Click on the banner to find out more.

Cookie Settings