- Stage
-
Plan and Refine Project
- Subtopic
-
Determine compute and data storage solutions
Need assistance with compute or storage solutions?
Selecting appropriate compute and storage platforms is vital for efficiency and security, and is based on the type of data (e.g., size, format, and classification) and the researchers analytic needs. Refer to the Duke data classification standard guide for more information about what is considered Sensitive or Restricted.
Get personalized assistance through the Data Front Door
This is a service for all researchers at Duke seeking research data support services of any kind. You may also wish to work with your Departmental IT contact for local options or Research Computing for help with no-cost and custom computing solutions.
Get help with specialized datasets
There are specific processes and support for accessing, securing, and complying with requirements for various regulated or protected data sets. Support can include assistance with navigating data agreements, IRB submissions, interpreting data requirements, and determining the best storage/compute solutions. Key resources:
- dbGaP data
- Controlled Unclassified Information (CUI)
- Centers for Medicare and Medicaid Services (CMS) data
- FISMA regulated data
- For all other specialized/protected data, contact the Data Front Door
- Helpful decision tools:
Discover and compare options yourself using SecureIt
SecureIt is a tool developed by the Office of Information Technology (OIT) to determine appropriate computational and data storage solutions based on classification and other needs and requirements of research data. Researchers can compare solutions based on responses to questions about the data, and select the best option for a specific project.
Duke's Secure Enclaves
Data storage options
Environment | Contact | Fee | Information |
---|---|---|---|
Protected Analytics Computing Environment (PACE) | Request for PACE Services | Yes. (Fee schedule here) | A protected virtual network space where approved users can work with identifiable protected Duke Health patient information.
|
Duke Protected Network For Research (PNR) | Duke Research and Innovation Intake Form | No cost for the standard configuration, storage and analytical tools. Additional computation and storage resources available for a fee | Allows for the storage and analysis of sensitive and regulated research data for researchers who have appointments outside the Schools of Medicine and Nursing or who are not DUHS employees:
|
Duke Compute Cluster | rescomputing@duke.edu | No | A generalized high-performance computing (HPC) resource.
|
School of Medicine Azure Cloud Subscription | ServiceNow Request for SoM HPC Services | Yes | Allows for compute on sensitive and restricted data types, such as Protected Health Information, including Duke Health patient data.
|
Research Compute Cluster (RCC, on prem) | ServiceNow Request for SoM HPC Services | Yes (Rate sheet here) | High-performance computing (HPC) resource for use with sensitive data.
|
Duke PopHealth DataShare: PACE | DPHS Resource Request Form or contact Pophealthdatashare@duke.edu | Yes | A protected enclave within Pace.
Suitable for:
|
Data query options
Environment | Contact | Fee | Information |
---|---|---|---|
Federated Clinical Analytics Platform (FCAP) | Data Front Door | No, for explore access; Yes, for premium access | An integrated database and advanced analytics platform that aggregates multi-modal, de-identified health information from Duke Health patients.
|
Cosmos | Data Front Door | No, for Cosmos SlicerDicer level access (cohort generation); Yes, for Data Science and analysis. |
|
Truveta |
Request Truveta access (if using internal funding) Contact Truveta for access (EXTERNAL funding) | Yes; based on the amount of data required, usage and storage needed | Aggregated de-identified data from the Electronic Health Record (EHR) from 30+ health systems for research & analytics. |
Covered entity: You are in the covered entity if your position or primary appointment is in the School of Medicine, School of Nursing, or any other entity within the Duke University Affiliated Covered Entity.
Duke Health Patient Data: Any information (including genetic information) created or received by the Duke Health Enterprise regardless of form or medium (e.g., oral, written, graphic, electronic) that relates to the physical or mental health of an individual; the provision of health care services to an individual; or the payment of health care services.
Note:
All data requiring FISMA security standards goes into SOM GovCloud. None of the environments listed above can house data requiring FISMA security standards.