Small Datasets for testing functionalities
Presentation
eds-scikit was build to work seamlessly on a pre-existing OMOP database. However, the library also provides some toy datasets so that you can try out some features even without having access to a database.
Usage
First, you can display all availables synthetic datasets:
from eds_scikit import datasets
datasets.list_all_synthetics()
# Out: ['load_ccam', 'load_consultation_dates', 'load_hierarchy', 'load_icd10', 'load_visit_merging', 'load_stay_duration', 'load_suicide_attempt', 'load_tagging', 'load_biology_data', 'load_event_sequences']
To load a specific dataset, simply run:
data = datasets.load_icd10()
data
# Out: ICD10Dataset(condition_occurrence, visit_occurrence)
The data
object is similar to objects available in eds_scikit.io, namely:
For instance, tables are available as attributes:
data.condition_occurrence
As shown in the tutorial, you can now try out the corresponding conditions_from_icd10()
function.