Skip to content

edsnlp.pipes.core.normalizer.quotes.factory

create_component = registry.factory.register('eds.quotes', assigns=['token.norm'], deprecated=['quotes'])(QuotesConverter) module-attribute

We normalise quotes, following this source <https://www.cl.cam.ac.uk/~mgk25/ucs/quotes.html>_.

Parameters

PARAMETER DESCRIPTION
nlp

The pipeline object.

TYPE: Optional[PipelineProtocol] DEFAULT: None

name

The component name.

TYPE: Optional[str] DEFAULT: 'spaces'

quotes

List of quotation characters and their transcription.

TYPE: List[Tuple[str, str]] DEFAULT: [('"〃ײ᳓″״‶˶ʺ“”˝‟', '"'), ('`΄'ˈˊᑊˋꞌᛌ𖽒𖽑‘’י՚‛՝``′...