minimel.ent_feats module

Extract entity features from parquet triples

minimel.ent_feats.ent_feats(spo_parquet: Path, anchor_json: Path, *, part: float = 1)

Extract entity features from parquet triples

Parameters:
  • spo_parquet (Path) – Parquet triple file

  • anchor_json (Path) – Anchor counts

  • part (float) – Filter part of features based on count <1: Quantile of feature count >1: Minimum feature count