Skip to content

Visualization

This library provides a visualization tool that aggregates the data to provide two interactive dashboards and a summary table describing various statistical properties of your biological data.

Input

It expects a Data object containing the following OMOP format tables:

Measurement

The function can be used on raw data or on the transformed data returned by the bioclean function .

from eds_scikit.io import HiveData
from eds_scikit.biology import plot_biology_summary

db_name = "cse_xxxxxxx_xxxxxxx"
tables =  ["measurement", "concept", "concept_relationship", "visit_occurrence", "care_site"]

data = HiveData(db_name, tables_to_load=tables)

plot_biology_summary(data)

Output

plot_biology_summary() creates a folder for each concepts-set. For instance, let us see what you will find in the folder Glucose_Blood_Quantitative`.

Summary table

A statistical summary table as below:

LOINC_concept_code AnaBio_concept_code LOINC_concept_name AnaBio_concept_name unit_source_value count mean std min 25% 50% 75% max MAD max_threshold min_threshold care_site_short_name
395136991232 14749-6 A0141 Glucose [Moles/Volume] Serum/Plasma - Numerique Glucose_Serum/Plasma_mmol/L mmol/l 691358.0 6.63218 6.55476 0.0 4.9 5.69 7.16 4500.0 1.4652 11.5508 0.0 ALL
704374636544 14749-6 F2406 Glucose [Moles/Volume] Serum/Plasma - Numerique Glucose Fibromax_Serum/Plasma_mmol/L mmol/l 108.0 6.761852 3.534022 2.0 4.99 5.8 7.09 29.37 1.4504 11.6016 0.0 ALL
824633720832 14749-6 A7339 Glucose [Moles/Volume] Serum/Plasma - Numerique Glucose h?_Serum/Plasma_mmol/L mmol/l 600.0 5.690167 2.524383 0.7 4.5 5.1 6.1 24.2 1.036 9.244 0.956 ALL
893353197568 14749-6 B9553 Glucose [Moles/Volume] Serum/Plasma - Numerique Glucose_Plasma_mmol/L mmol/l 26635.0 6.731067 3.334219 1.3 4.8 5.6 7.4 66.4 1.48 11.52 0.0 ALL
1082331758592 2345-7 E7312 Glucose SerPl-mCnc Glucose_Serum/Plasma_mg/dL mg/dl 119144.0 117.097998 242.990109 3.6 84.6 99.0 124.2 81045.0 24.124 195.496 2.504 ALL
1511828488192 14749-6 A8424 Glucose [Moles/Volume] Serum/Plasma - Numerique Glucose_Plasma FNA_mmol/L mmol/l 103334.0 6.667987 3.790141 0.1 4.8 5.6 7.3 160.9 1.48 11.52 0.0 ALL
8589934592 14749-6 A0141 Glucose [Moles/Volume] Serum/Plasma - Numerique Glucose_Serum/Plasma_mmol/L mmol/l 83756.0 6.650058 3.138156 0.01 5.05 5.78 7.17 93.35 None None None CARE SITE 10
34359738368 2345-7 E7312 Glucose SerPl-mCnc Glucose_Serum/Plasma_mg/dL mg/dl 8.0 109.325 46.192261 72.0 72.0 95.4 106.0 216.0 None None None CARE SITE 2
103079215104 14749-6 A0141 Glucose [Moles/Volume] Serum/Plasma - Numerique Glucose_Serum/Plasma_mmol/L mmol/l 99030.0 6.725521 3.319078 0.2 5.0 5.8 7.3 89.9 None None None CARE SITE 11
154618822656 ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...

Volumetry dashboard

An interactive dashboard describing the volumety properties over time.

An example is available here.

Distribution dashboard

An interactive dashboard describing the distribution properties.

An example is available here.