minimel.index module

Convert Wikimapper index to IntDAWG.

minimel.index.make_dawg(db_fname)
minimel.index.index(db_fname: Path)

Make an efficient DAWG trie index from a Wikimapper sqlite file

Parameters:

db_fname (Path) – Wikimapper SQLite3 index file

minimel.index.xml_db(wikidump: Path, *, ns: int = 0, nparts: int = 100)

Make a name database from Wikidump page ids

Parameters:
  • wikidump (Path) – Wikipedia XML dump file

  • ns (int)

  • nparts (int)

Keyword Arguments:
  • ns – Page Namespace

  • nparts – Number of chunks to read