Skip to content

edsnlp.pipes.core.normalizer.accents.factory

create_component = registry.factory.register('eds.accents', assigns=['token.norm'], deprecated=['accents'])(AccentsConverter) module-attribute [source]

Normalises accents, using a same-length strategy.

Parameters

PARAMETER DESCRIPTION
nlp

The pipeline object.

TYPE: Optional[PipelineProtocol]

name

The component name.

TYPE: Optional[str]

accents

List of accentuated characters and their transcription.

TYPE: List[Tuple[str, str]] DEFAULT: [('ç', 'c'), ('àáâä', 'a'), ('èéêë', 'e'), ('ìí...