minimel.prepare module

minimel.prepare.prepare(wikiname: str, version: str, *, rootdir: Path | None = None, mirror: str = 'https://dumps.wikimedia.org', overwrite: bool = False, nparts: int = 100, index_only: bool = False, custom_langcode: str | None = None)

Download required files and make indices

Parameters:
  • wikiname (str) – Wikipedia edition name (eg. “simplewiki”)

  • version (str) – Wikipedia version (eg. “latest”)

  • rootdir (Optional[Path])

  • mirror (str)

  • overwrite (bool)

  • nparts (int)

  • index_only (bool)

  • custom_langcode (Optional[str])

Keyword Arguments:
  • rootdir – Root directory

  • mirror – Wikimedia mirror

  • overwrite – Whether to overwrite existing files

  • nparts – Number of chunks to read

  • index_only – Whether to only create the DAWG index

  • custom_langcode – Custom language code (if different from wikiname, e.g. “en-simple”)