Medical Imaging AI Training Data Without Export
Medical imaging AI is blocked on data access
The bottleneck for radiology AI startups is not model architecture. It is access to diverse, high-quality DICOM imaging data from real clinical workflows. Public datasets (MIMIC, ChestX-ray14, RSNA) are useful starting points but cannot match the breadth, modality mix, and clinical variety of real hospital PACS data.
Hospitals hold millions of MRI, CT, X-ray, ultrasound, and PET-CT scans with corresponding reports. Every radiology AI company wants access. Every hospital governance team says no — correctly — because the data cannot leave the institution.
DICOM-aware compute-to-data
Rapha Protocol integrates directly with hospital DICOMweb endpoints and PACS systems. The edge appliance queries study metadata through scoped QIDO-RS requests — never broad PACS scans. Patient identifiers, StudyInstanceUIDs, and AccessionNumbers are keyed-hashed with a hospital-held HMAC key before any metadata reaches the training runtime. Raw DICOM pixels never cross the public network boundary.
The training pipeline runs locally against the DICOM data mount. The researcher receives trained model weights, validation metrics, and a cryptographic proof receipt. No DICOM files, no pixel data, no PHI-bearing metadata leaves the hospital.
Supported imaging workflows
- Classification — chest X-ray pathology detection, mammography screening, skin lesion classification.
- Segmentation — organ-at-risk contouring, tumour boundary delineation, cardiac chamber segmentation.
- Detection — lung nodule detection, breast lesion detection, brain haemorrhage identification.
- Report generation — fine-tune vision-language models on paired imaging and radiology report data.
- Quality assurance — train models to detect positioning errors, motion artefacts, and protocol deviations.
Enterprise imaging vendor compatibility
The platform includes adapters for major imaging vendors and PACS systems:
- Fujifilm Synapse — AI Orchestrator adapter with study notification routing.
- Siemens Healthineers — syngo.via and teamplay integration paths.
- GE Healthcare — Centricity PACS and Edison platform integration.
- Canon Medical — Vitrea and Aquilion integration.
- Generic DICOMweb — QIDO-RS metadata client for any DICOMweb-compliant PACS.
Production deployment requires: real hospital DICOMweb endpoints, scoped study queries, hospital-held PHI HMAC keys, OPA policy approval, SGX/DCAP + TPM attestation, and a configured enterprise-node trainer command. Demo-only clients are isolated from production clients.