Mu dataset, Ink Restauration Studio, Google Colab notebook

This page contains material used for the paper entitled “Comparing Shapes, Not Noise: Human–Machine Clustering of Handwritten Greek Characters from Papyri” submitted to IEEE Workshop on Historical Handwriting Analysis, Natural Language Processing and Knowledge Graphs (HHA-NLP-KG), to be held in Venice, Italy (hybrid event), September 7–9, 2026. The following materials shall be published here as soon as the article is accepted:

  • the “Mu Dataset” composed of small images of individual letters or "cliplets" divided into two subsets (original and redrawn).
  • Description of “Mu Dataset” in csv.
  • “Ink Restauration Studio”, a standalone webapplication, to be opened with Google Chrome.
  • Description of how to use the “Ink Restauration Studio”.
  • Google Colab Notebook that allows the user to run same (with provided dataset) or similar experiments as described in the article.
  • Notes on fine-tuning of the code.
  • Other resources (heatmaps, similarity scores, vizualization tools).
To top