minimel.get_disambig module
Extract list-hyperlinks from Wikipedia disambiguation pages
- minimel.get_disambig.writer(fn)
- minimel.get_disambig.query_pages(langcode: str, *, query_listpages: bool = False, outfile: Path | None = None)
Query the Wikidata API to get disambiguation (& list pages if indicated)
Returns Wikidata Qids, one per line
- minimel.get_disambig.get_list_links(page, disambig_template=None)
- minimel.get_disambig.get_disambig_links(lines, dawgfile, disambig_ent_file=None, disambig_template=None)
- minimel.get_disambig.get_disambig(wikidump: Path, dawgfile: Path, disambig_ent_file: Path | None = None, *, disambig_template: str | None = None, nparts: int = 1000)
Get disambiguation links.
Writes disambig.json.
- Parameters:
- Keyword Arguments:
nparts – Number of chunks to read
disambig_template – Use disambiguation pages that contain a template with this name instead of disambig_ent_file (if disambig_ent_file is provided, create it)