edsnlp.processing.simple
nlp = spacy.blank('eds')
module-attribute
ExtensionSchema = Union[str, List[str], Dict[str, Any]]
module-attribute
pipe(note, nlp, context=[], results_extractor=None, additional_spans=[], extensions=[], batch_size=1000, progress_bar=True)
Function to apply a spaCy pipe to a pandas DataFrame note For a large DataFrame, prefer the parallel version.
PARAMETER | DESCRIPTION |
---|---|
note |
A pandas DataFrame with a
TYPE:
|
nlp |
A spaCy pipe
TYPE:
|
context |
A list of column to add to the generated SpaCy document as an extension.
For instance, if
TYPE:
|
additional_spans |
A name (or list of names) of SpanGroup on which to apply the pipe too:
SpanGroup are available as
TYPE:
|
extensions |
Spans extensions to add to the extracted results:
For instance, if
TYPE:
|
batch_size |
Batch size used by spaCy's pipe
TYPE:
|
progress_bar |
Whether to display a progress bar or not
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
DataFrame
|
A pandas DataFrame with one line per extraction |
Source code in edsnlp/processing/simple.py
182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 |
|