edsnlp.pipelines.ner.scores.tnm
patterns
modifier_pattern = '(?P<modifier>[cpyraum])'
module-attribute
tumour_pattern = 't\\s?(?P<tumour>([0-4o]|is|x))x?'
module-attribute
node_pattern = 'n\\s?(?P<node>[0-3o]|x)x?'
module-attribute
metastasis_pattern = 'm\\s?(?P<metastasis>[01o]|x)x?'
module-attribute
version_pattern = '\\(?(?P<version>uicc|accj|tnm)\\s+([ée]ditions|[ée]d\\.?)?\\s*(?P<version_year>\\d{4}|\\d{2})\\)?'
module-attribute
spacer = '(.|\\n){1,5}'
module-attribute
tnm_pattern = '(?<={version_pattern}{spacer})?'
module-attribute
models
TnmEnum
Bases: Enum
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
7 8 9 |
|
__str__()
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
8 9 |
|
Unknown
Bases: TnmEnum
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
12 13 |
|
unknown = 'x'
class-attribute
Modifier
Bases: TnmEnum
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
16 17 18 19 20 21 22 23 |
|
clinical = 'c'
class-attribute
histopathology = 'p'
class-attribute
neoadjuvant_therapy = 'y'
class-attribute
recurrent = 'r'
class-attribute
autopsy = 'a'
class-attribute
ultrasonography = 'u'
class-attribute
multifocal = 'm'
class-attribute
Tumour
Bases: TnmEnum
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
26 27 28 |
|
unknown = 'x'
class-attribute
in_situ = 'is'
class-attribute
TNM
Bases: BaseModel
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 |
|
modifier: Optional[Union[int, Modifier]] = None
class-attribute
tumour: Optional[Union[int, Tumour]] = None
class-attribute
node: Optional[Union[int, Unknown]] = None
class-attribute
metastasis: Optional[Union[int, Unknown]] = None
class-attribute
version: Optional[str] = None
class-attribute
version_year: Optional[int] = None
class-attribute
coerce_o(v)
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
41 42 43 44 45 |
|
validate_year(v)
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
47 48 49 50 51 52 53 54 55 56 57 |
|
norm()
Source code in edsnlp/pipelines/ner/scores/tnm/models.py
59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 |
|
factory
DEFAULT_CONFIG = dict(pattern=None, attr='LOWER')
module-attribute
create_component(nlp, name, pattern, attr)
Source code in edsnlp/pipelines/ner/scores/tnm/factory.py
13 14 15 16 17 18 19 20 21 22 23 24 |
|
tnm
eds.tnm
pipeline.
PERIOD_PROXIMITY_THRESHOLD = 3
module-attribute
TNM
Bases: BaseComponent
Tags and normalizes TNM mentions.
PARAMETER | DESCRIPTION |
---|---|
nlp |
Language pipeline object
TYPE:
|
pattern |
List of regular expressions for TNM mentions.
TYPE:
|
attr |
spaCy attribute to use
TYPE:
|
Source code in edsnlp/pipelines/ner/scores/tnm/tnm.py
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 |
|
nlp = nlp
instance-attribute
regex_matcher = RegexMatcher(attr=attr, alignment_mode='strict')
instance-attribute
__init__(nlp, pattern, attr)
Source code in edsnlp/pipelines/ner/scores/tnm/tnm.py
31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 |
|
set_extensions()
Set extensions for the dates pipeline.
Source code in edsnlp/pipelines/ner/scores/tnm/tnm.py
51 52 53 54 55 56 57 58 |
|
process(doc)
Find TNM mentions in doc.
PARAMETER | DESCRIPTION |
---|---|
doc |
spaCy Doc object
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
spans
|
list of tnm spans |
Source code in edsnlp/pipelines/ner/scores/tnm/tnm.py
60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 |
|
parse(spans)
Parse dates using the groupdict returned by the matcher.
PARAMETER | DESCRIPTION |
---|---|
spans |
List of tuples containing the spans and groupdict returned by the matcher.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
List[Span]
|
List of processed spans, with the date parsed. |
Source code in edsnlp/pipelines/ner/scores/tnm/tnm.py
85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 |
|
__call__(doc)
Tags TNM mentions.
PARAMETER | DESCRIPTION |
---|---|
doc |
spaCy Doc object
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
doc
|
spaCy Doc object, annotated for TNM
TYPE:
|
Source code in edsnlp/pipelines/ner/scores/tnm/tnm.py
108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 |
|