Software

  • biographR: an R package to extract structured biographical data from unstructured text and to clean data using ChatGPT’s API.
  • namespace: a Python package for fuzzy name merging using fine-tuned embeddings.

Model Weights

  • namespace: RoBERTa weights fine-tuned for fuzzy merging names