edsnlp.pipelines.ner.umls.patterns
get_patterns(config)
Load the UMLS terminology patterns.
PARAMETER | DESCRIPTION |
---|---|
config |
Languages and sources to select from the whole terminology. For both keys, None will select all values.
TYPE:
|
Return
patterns : dict[list] The mapping between CUI codes and their synonyms.
Notes
When run for the first time, this method will download the entire UMLS file and store it at ~/.data/bio/umls/2022AA/. Therefore the second run will be significantly faster than the first one.
Source code in edsnlp/pipelines/ner/umls/patterns.py
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 |
|
get_path(config)
Get the path, module and filename of the UMLS file.
PARAMETER | DESCRIPTION |
---|---|
config |
Languages and sources to select from the whole terminology. For both keys, None will select all values.
TYPE:
|
Return
path, module, filename : pathlib.Path, pystow.module, str
Notes
get_path
will convert the config dict into a pretty filename.
Examples:
>>> config = {"languages": ["FRE", "ENG"], "sources": None}
>>> print(get_path(config))
.data/bio/umls/2022AA/languagesFRE-ENG_sourcesNone.pkl"
Source code in edsnlp/pipelines/ner/umls/patterns.py
50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 |
|
download_and_agg_umls(config)
Download the UMLS if not exist and create a mapping between CUI code and synonyms.
PARAMETER | DESCRIPTION |
---|---|
config |
Languages and sources to select from the whole terminology. For both keys, None will select all values.
TYPE:
|
Return
patterns : dict[list] The mapping between CUI codes and their synonyms.
Notes
Performs filtering on the returned mapping only, not the downloaded resource.
Source code in edsnlp/pipelines/ner/umls/patterns.py
89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 |
|