Identification and enablement of existing health data for AI solutions
Service Description
Overview
Identifying and acquiring existing data sets—full-service data management.
This service identifies data sets and makes them available to SME’s for TEF-project purposes, such as validation and testing of AI systems. Data can be collected from routine health records and local, regional, and national databases in collaboration with healthcare providers in Sweden.
We provide expertise and technical support in the following areas:
- Project design & management
- Ethical application support
- Data assembly
- Data cleaning, annotation, anonymisation
How can the service help you?
Access to existing health data can help you validate your AI solution without needing to generate new data. We provide access to high quality datasets for validation and testing of your AI system to facilitate placing it on the market, and supporting to increase its market readiness level. Validation and testing data can be used for independent performance evaluation of your AI system in a real-world or simulated environment.
How the service will be delivered?
The service will be delivered according to established ethical agreements and guidelines, in collaboration with the SME and researchers from the Swedish TEF-Health node. Data usage is facilitated through a secure virtual environment managed by Karolinska Institutet, ensuring the highest standards of data protection.
Additional information
Provider description
The Swedish TEF-Health node is a collaboration between Karolinska Institutet, SciLifeLab and RISE, and is led by Karolinska Institutet. Together, we offer world-leading services with our unique collection of core facilities. We can grant services in expert consulting, virtual- and physical testing in the range of in vivo imaging, ex vivo OMICS, pharmaceutical development, simulated healthcare environments, AI-system validation and development, advanced data analysis and other data-driven life science.
Technical description
Data access and usage are facilitated through a secure environment managed by Karolinska Institutet. Datasets will be identified and made available to suit your AI system, to support with testing and validating your solution. The data will be tailored to your specific request to ensure it is compatible with your AI system, with full support from Karolinska Insitutet to ensure compliance with all relevant regulations. Upon completion of this service, you will receive an evaluation of the performance of your AI system, which can support the placing of your solution on the EU market.
Service customization
The service can be customised according to your specific needs, taking into account which type of data you require.
Use case example
Context
A biotech SME is developing an AI-based system designed to improve cancer diagnosis, treatment planning, and prognosis prediction. To gain regulatory approval and to facilitate market entry, the SME requires access to diverse, high-quality datasets, including cancer imaging, electronic healthcare records (EHR), and multi-omics data, for independent performance evaluation.
Objective
The goal is to validate the AI system's ability to analyze complex datasets and deliver accurate, reliable outputs in simulated and real-world scenarios.
Solution
The identification and enablement of existing health data for AI solutions service, provided by the Swedish TEF-Health node in collaboration with Karolinska Institutet, offers ethically sourced datasets tailored to the SME’s needs.
Implementation
Ethical Agreement
The SME enters into an ethical agreement with researchers from the Swedish TEF-Health node, ensuring all data collection and usage comply with GDPR and national Swedish regulations.
Data Collection
- Cancer Imaging Data: Retrospective datasets from imaging modalities like CT, MRI, and PET scans, annotated for various cancer types and stages.
- EHR Data: Pseudonymised clinical data, including patient histories, treatment outcomes, and longitudinal follow-up records.
- Omics Data: Retrospective genomic, transcriptomic, and proteomic datasets linked to cancer cases.
Secure Access
Usage of the collected data is facilitated through a secure virtual environment managed by Karolinska Institutet, ensuring the highest standards of data protection.
Validation and Testing
Through a collaboration between Karolinska Institutet and the SME, the dataset is used to test the AI system's ability to:
- Detect cancer accurately across imaging modalities.
- Predict treatment responses and outcomes using combined EHR and omics data.
- Identify biomarkers associated with different cancer subtypes and prognostic outcomes.
Outcome
Following the validation and testing, the AI system can be shown for its robustness, accuracy, and reliability, supporting regulatory approval and enhancing confidence for healthcare providers and stakeholders.
Benefits
- Comprehensive Dataset: Integration of imaging, EHR, and omics data ensures the AI system is tested across real-world complexities.
- Ethical and Secure: Compliance with ethical guidelines builds trust and supports regulatory approval.
- Accelerated Innovation: Access to retrospective data saves time, allowing the SME to focus on model optimization and deployment.
Impact
The validated AI system enhances early cancer detection and personalized treatment planning, improving patient outcomes while streamlining healthcare workflows.
Provider & Contact
Pricing is available to registered users. SMEs receive significant state-aid reductions (GBER) — or, depending on the call, free services during the funded project. Sign in or register to see the price for your organisation.
Sign in or register to see pricing