edsnlp.pipes.base
BaseComponent
The BaseComponent
adds a set_extensions
method, called at the creation of the object.
It helps decouple the initialisation of the pipeline from the creation of extensions, and is particularly usefull when distributing EDSNLP on a cluster, since the serialisation mechanism imposes that the extensions be reset.
set_extensions
Set Doc
, Span
and Token
extensions.
get_spans
Returns sorted spans of interest according to the possible value of on_ents_only
. Includes doc.ents
by default, and adds eventual SpanGroups.