minimel.mentions module
Find entity mentions
- minimel.mentions.get_matches(matcher, text, offsets=True, stem=None)
- minimel.mentions.setup_matcher(countfile, names=None)
- minimel.mentions.spacy_matcher(model)
- minimel.mentions.count_name_lines(lines, countfile, stem=None, head=None)
- minimel.mentions.count_names(paragraphlinks: Path, countfile: Path, *, outfile: Path | None = None, stem: str | None = None, head: int | None = None)
Count anchor texts in Wikipedia paragraphs.
- Parameters:
- Keyword Arguments:
outfile – Output file or directory (default: name{countfile}[.stem-{LANG}].json)
stem – Stemming language ISO 639-1 (2-letter) code
head – Use only N first lines from each partition