page loader

Pharma R&DPharma companies use science-based innovations, analytical tools, and services to give out answers to some of the most challenging healthcare problems

They help people live a long and healthy life. One such Fortune 500 pharma company wanted to analyze the biomarker data and unmined clinical trial. Modak Analytics, along with a consultant team, undertook this transformation. There were many challenges:

  1. A large amount of legacy trial data – The biopharmaceutical company had thousands of legacy datasets ranging from the past 10 years to a constantly growing set of new clinical trial data, to be integrated.

  2. No schema alignment across the trial data – Each study had a unique set of datasets and variables which varied in the therapeutic area and trial type. In addition, internal data standards had changed over time and had different implementations.

Modak developed a system for integrating new studies that have already been standardized to SDTM (Study Data Tabulation Model) format. This helps to scale the larger number of legacy studies that could not be addressed and where ETL and statistical programming could not keep up. Modak combined machine learning and expert analytical tools to map legacy clinical trials to the master schema by:

  • Automatically ingesting thousands of study datasets and associated metadata from SAS binary files
  • Applying machine learning guided mapping of source datasets to the custom standard implementation
  • Providing a point-and-click user interface for simple and complex transformations (e.g. pivoting)
  • Documentation of all the mapping and transformations performed in the system is generated automatically
  • Plugging data into an existing data loading pipeline after it had been aligned to a standard schema

Modak’s Novel approach of preparing data for downstream analytics enables scientists to access more data to make better, faster, and more accurate decisions. Moreover, it enables biopharmaceutical organizations to finally see a return on their enormous data investment.