Skip to content

edsnlp.pipelines.core.contextual_matcher.factory

create_component(nlp, name, patterns, assign_as_span, alignment_mode, attr, ignore_excluded, ignore_space_tokens, regex_flags, include_assigned)

Allows additional matching in the surrounding context of the main match group, for qualification/filtering.

PARAMETER DESCRIPTION
nlp

spaCy Language object.

TYPE: Language

name

The name of the pipe

TYPE: str

patterns

The configuration dictionary

TYPE: Union[Dict[str, Any], List[Dict[str, Any]]]

assign_as_span

Whether to store eventual extractions defined via the assign key as Spans or as string

TYPE: bool

attr

Attribute to match on, eg TEXT, NORM, etc.

TYPE: str

ignore_excluded

Whether to skip excluded tokens during matching.

TYPE: bool

alignment_mode

Overwrite alignment mode.

TYPE: str

regex_flags

RegExp flags to use when matching, filtering and assigning (See here)

TYPE: Union[re.RegexFlag, int]

include_assigned

Whether to include (eventual) assign matches to the final entity

TYPE: bool

Source code in edsnlp/pipelines/core/contextual_matcher/factory.py
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
@deprecated_factory(
    "contextual-matcher", "eds.contextual-matcher", default_config=DEFAULT_CONFIG
)
@Language.factory("eds.contextual-matcher", default_config=DEFAULT_CONFIG)
def create_component(
    nlp: Language,
    name: str,
    patterns: Union[Dict[str, Any], List[Dict[str, Any]]],
    assign_as_span: bool,
    alignment_mode: str,
    attr: str,
    ignore_excluded: bool,
    ignore_space_tokens: bool,
    regex_flags: Union[re.RegexFlag, int],
    include_assigned: bool,
):
    """
    Allows additional matching in the surrounding context of the main match group,
    for qualification/filtering.

    Parameters
    ----------
    nlp : Language
        spaCy `Language` object.
    name : str
        The name of the pipe
    patterns: Union[Dict[str, Any], List[Dict[str, Any]]]
        The configuration dictionary
    assign_as_span : bool
        Whether to store eventual extractions defined via the `assign` key as Spans
        or as string
    attr : str
        Attribute to match on, eg `TEXT`, `NORM`, etc.
    ignore_excluded : bool
        Whether to skip excluded tokens during matching.
    alignment_mode : str
        Overwrite alignment mode.
    regex_flags : Union[re.RegexFlag, int]
        RegExp flags to use when matching, filtering and assigning (See
        [here](https://docs.python.org/3/library/re.html#flags))
    include_assigned : bool
        Whether to include (eventual) assign matches to the final entity

    """

    return ContextualMatcher(
        nlp,
        name,
        patterns,
        assign_as_span,
        alignment_mode,
        attr=attr,
        ignore_excluded=ignore_excluded,
        ignore_space_tokens=ignore_space_tokens,
        regex_flags=regex_flags,
        include_assigned=include_assigned,
    )