Skip to content

Tutorials

We provide step-by-step guides to get you started. We cover the following use-cases:

Rationale

In a typical medical NLP pipeline, a group of clinicians would define a list of synonyms for a given concept of interest (say, for example, diabetes), and look for that terminology in a corpus of documents.

Now, consider the following example:

Le patient n'est pas diabétique.
Le patient est peut-être diabétique.
Le père du patient est diabétique.
The patient is not diabetic.
The patient could be diabetic.
The patient's father is diabetic.

There is an obvious problem: none of these examples should lead us to include this particular patient into the cohort.

Warning

We show an English example just to explain the issue. EDS-NLP remains a French-language medical NLP library.

To curb this issue, EDS-NLP proposes rule-based pipelines that qualify entities to help the user make an informed decision about which patient should be included in a real-world data cohort.

To sum up, a typical medical NLP project consists in:

  1. Editing a terminology
  2. "Matching" this terminology on a corpus, ie extract phrases that belong to that terminology
  3. "Qualifying" entities to avoid false positives

Once the pipeline is ready, we need to deploy it efficiently.

Back to top