Skip to main content

The OMOP Common Data Model explained: speaking the language of health data

The second training of the INDICATE Training Programme on Interoperability, OMOP and Vocabularies took place on April 9, 2026. The programme is designed to support data providers in using the INDICATE infrastructure effectively, securely, and in a fully standardised way. It helps participants, such as clinicians and data engineers, build both the conceptual understanding and practical skills needed to work with interoperable health data.

During this second training, led by Maxim Moinat (Data Engineer and OHDSI Collaborator, Erasmus MC) and moderated by Boris Delange (MD, Medical Informatics, Université de Rennes), participants learned that data from different hospitals and institutions must be combined and compared to enable research at a European level. However, this is only possible when data is structured in a way that makes comparison meaningful and reliable.

This is where standardisation becomes essential. Without a shared structure, data remains fragmented across systems, making large-scale analysis difficult or even impossible. By harmonising data into a common format, researchers can generate evidence that is consistent, reproducible, and scalable across countries.

The OMOP Common Data Model provides exactly this; a shared way of organising patient data and a shared vocabulary for describing clinical events, so that hospitals across Europe can describe the same reality in the same terms. Maxim walked participants through the main building blocks of the model and showed how they apply to ICU data, with concrete examples detailed during the session. He also presented the wider OHDSI community and European networks such as EHDEN and DARWIN EU, which already federate data on hundreds of millions of patients.

Maxim then walked participants through the full journey from raw hospital data to interoperable, OMOP-formatted data, step by step, from the initial exploration of the source system to the final validation of the mapped database. At each stage, he introduced the corresponding tools from the OHDSI ecosystem, a suite of open-source resources designed to support data providers throughout the process. He also showed how the INDICATE Data Dictionary, presented in Session 1, fits into this journey by guiding data providers on which clinical concepts to prioritise for mapping.

The session concluded with key take-home messages on the importance of clear mapping specifications, vocabulary alignment, and the value of a shared data model for enabling collaborative research across institutions and countries.

Overall, the training provided participants with both a conceptual and practical understanding of how the OMOP CDM and the surrounding OHDSI ecosystem support interoperable and scalable health data research within INDICATE.

The next training will focus on the ETL Workflow, data preparation requirements, and data quality expectations and is planned on May 7 2026. 

Read more about the first training.

INDICATE Training Programme – Legal Framework

In order to support all consortium members in using the INDICATE infrastructure effectively, correctly, and securely, we are organizing a three-session series of the INDICATE Training Programme on Legal Framework, running in parallel with and complementing the ongoing Data Models sessions. 

The programme will give participants a comprehensive understanding of the INDICATE legal framework, covering GDPR and EHDS principles, data protection and privacy-enhancing technologies, governance and rulebook structures, and practical skills to navigate data access processes, compliance requirements, and organizational implementation challenges within INDICATE.

Session dates

All sessions will be held from 14.00 – 16.00 (CEST) via Zoom.

  • May 4 – Session 1 | Understanding GDPR
  • June 24 – Session 2 | Understanding and using Data Access
  • September 10 – Session 3 |  Understanding the Rulebook and legal onboarding steps

Vacancy: Statistician / Applied Mathematician (INDICATE Project)

Position Overview

AP-HP Assistance publique – Hôpitaux de Paris, a valued partner for the INDICATE project, is seeking a highly motivated Statistician / Applied Mathematician / Data Scientist to contribute to the development and validation of predictive models of organ failure in critically ill patients. The position is part of the European INDICATE project and focuses on translational research at the interface between medicine, statistics, and artificial intelligence.

Scientific Scope

INDICATE focuses on predicting major organ failures in ICU patients using multimodal data (clinical, biological, and high-frequency physiological signals). The goal is to identify early predictive signatures of organ dysfunction (renal, respiratory and cardiovascular) and support personalized decision-making in critical care.

Methodological Framework

The candidate will implement and validate advanced statistical and machine learning models, including supervised learning, time-series modeling, and trajectory analysis. Key aspects include feature engineering from high-frequency data, handling missing data, model calibration and discrimination assessment, and external validation when available.

Required skills

  • Strong background in statistics, applied mathematics, or data science
  • Experience in predictive modeling and machine learning
  • Programming skills: Python (mandatory), SQL; Java/C++ is a plus
  • Interest in biomedical applications and clinical data

Contract and Conditions

  • Fixed-term contract (18 months)
  • Full-time (100%)
  • Location: INSERM U942, Paris (AP-HP / Université Paris Cité)
  • English required; French not mandatory

Application process

To apply for this position, please send your CV and motivational letter to contact Dr. Benjamin Deniau via benjamin.deniau@aphp.fr and Ms. Fatima Zunara via fatima.zunara@aphp.fr.

Dr. Benjamin Deniau
benjamin.deniau@aphp.fr

Fatima Zunara
fatima.zunara@aphp.fr

INDICATE Training session on Onboarding & Data Model: Unlocking ICU data across Europe without moving patient data

This week marked the first session of the INDICATE Training Programme, designed to support data providers in using the INDICATE infrastructure effectively, securely, and in a fully standardized way.

The session, guided by moderator Maxim Moinat (Data Engineer, Erasmus MC) and co-moderated by Maarten Ligtenberg (Co-founder Cradeq), provided a solid introduction to key building blocks on the INDICATE onboarding framework, interoperability in complex healthcare data environments and federated data infrastructure and secure data sharing principles.

Jan van den Brand (technical lead INDICATE) highlighted key challenges in ICU clinical decision-making and innovation, driven by fragmented data, a lack of standardized data-sharing agreements, and limited secure infrastructure. He illustrated this using a metaphor: hospitals today resemble a house with different types of power sockets, where every device requires its own adapter to function.

In this analogy, medical and AI software represent the appliances, while hospital systems such as electronic health records and laboratory databases represent the power sources. Without a shared standard, hospitals are often forced to build and maintain these “adapters” themselves, increasing complexity, cost, and operational risk. This underlines the need for shared standards and interoperable data models.

A central theme, introduced by our presenter Boris Delange (Doctor in Medical Informatics, Université de Rennes), was the reality of hospital data: each institution often uses its own “language” to describe the same clinical concepts. This creates significant challenges for interoperability and data integration, while also highlighting the importance of standardization for enabling meaningful reuse of healthcare data in research and innovation. 

Boris also addressed the broader context of Hospital Information Systems and Clinical Data Warehouses, focusing on challenges related to data quality, semantic alignment, and making heterogeneous data usable beyond clinical care. Despite its value, a large proportion of hospital data (97%!) remains underutilized for research purposes.

INDICATE addresses this challenge by developing a federated data infrastructure, where data remains securely stored within its original institution (the data never leaves the hospital) while becoming interoperable and accessible for analysis across organisations  through shared standards.

The training programme consists of five sessions. The next session will take place on April 9, 14:00–16:00 CEST.

The training sessions are organised by Maarten Ligtenberg, Melania Istrate, Elisa Vera, Jan van den Brand, Aliza Bos, Maaike van Zuilen, and Irene Gebuis, a collaboration between Work Packages 1 and 5 and the INDICATE Training and Education Workgroup.

Moving from Vision to Implementation in federated ICU Research

We’re gathered in Brussels for the INDICATE Design Workshop – a two-day event bringing together hospitals, technical experts, clinicians and communication advisors from across Europe. INDICATE is building a federated data infrastructure for intensive care data. That means: collaborating on better care and research, without patient data ever leaving a single hospital.

Day 1 ‘Understanding the data provider journey’

  • Jan van den Brand walked us through the INDICATE mission – why federated ICU data matters for European healthcare and also through the Onboarding Blueprint – the journey from commitment to production
  • Bert Cappelle gave a detailed and inspiring live demo on how to conduct a study with federated data within the INDICATE platform

The afternoon was hands-on: data providers and guests worked through a stakeholder mapping exercise, identifying who is needed to implement INDICATE within a hospital and whether those stakeholders can actually be named today. This was followed by a data provider gap analysis focused on for example identity and access management.

This year we are making the shift from concept to implementation in real hospital environments. That takes collaboration, honesty about challenges, and a willingness to learn from each other. And that’s what we did today!

Day 2 ‘What does it actually take to onboard a hospital as a data provider?’

During day 2 of the INDICATE Design Workshop we spent the day working through what it actually takes to onboard a hospital as a data provider: not in theory, but in practice. 

What does the onboarding journey look like? What do data providers need to implement INDICATE in their organisation, who to get involved? During the day we challenged ourselves to rethink the onboarding process and follow up steps from a data provider perspective – At the end of the day, all attending data providers had developed an actionable implementation roadmap for the next three months. 

One principle that kept coming back: patient data never leaves the hospital. That’s not just a technical design choice, it’s the foundation of trust that makes the federated data network possible.

Next meeting? On Monday March 30 the INDICATE Training Programme on Data Enablement & Data Model will start! We look forward to welcoming you to the session!

Thanks to all who actively participated in person and online!

Celia Alvarez-Romero (Servicio Andaluz de Salud), Marcel Giemsa (Universitätsklinikum Düsseldorf), Bert Cappelle (UZ Gent), Christian Jung (Universitätsklinikum Düsseldorf), Kirsten Colpaert (UZ Gent), Maurizio Cecconi (ESICM), Anouk Kruiswijk (KPMG), Maaike van Zuilen (Erasmus MC – philogirl), Maarten Ligtenberg (Cradeq), Daniel Laxar (Medical University of Vienna), Maria Theodorakopoulou (Hellenic Society of Intensive Care Medicine (HSICM), Maurice Walny (Charité – Universitätsmedizin Berlin), Joost Schotsman (UMC Utrecht), Rachit Gupta (KPMG), Maxim Moinat (Erasmus MC), Irene Gebuis (philogirl), Alexander Lang, Nils Woge, Kai Marten Vogl, Daniel Wetzler, Lorenz Kapral, Natalja Zilinski.

INDICATE Round Table: Federated Infrastructures in Medical Research

On 5 March 2026, INDICATE hosted a hybrid round table in Düsseldorf, bringing together experts from across Europe for an active exchange between clinicians, researchers, and technical experts working on data-driven healthcare. The day was moderated by INDICATE PI’s Michel van Genderen from Erasmus MC and Christian Jung from Universitätsklinikum Düsseldorf.

The event featured presentations from leading initiatives, including:

Each project shared its objectives, current progress, and key lessons learned. 

Key topics included:

  • Building federated platforms for cross-hospital research
  • Ensuring secure, privacy-by-design data sharing
  • Governance and legal frameworks for multi-institutional studies
  • Interoperability and data harmonization across systems

Insights

Federated infrastructures are increasingly viable, but challenges remain around data standards, governance, and coordination. Synthetic data environments and shared governance models are helping build trust while enabling large-scale research and AI development.

The round table was a valuable forum for sharing experiences, identifying challenges, and exploring future European collaborations in health data research.