M8 – Analysis, part two

Analysis, part 2: analysing the dataset yourself

 

Technical requirements: option 1

If you are an advanced user who wishes to use our variant analyser or NEXUS converter scripts yourself, please see Option 2 below.

Our most user-friendly approach is our customised Google Colab series. By going through our Colab pages, you will learn how to run analyses, how to read the data, and how to tweak the data for your own research interests. The only requirement is a Google account – everything else is available in the Colab pages!

To learn how to analyse the Texting Scarlatti dataset for your own purposes, please complete the following steps:

  1. Open our Google Colab series and follow the instructions.
  2. Our interactive environment consists of four parts:

Technical requirements: option 2 (advanced users only)

The following requirements are all available on our public GitHub repository (link available soon).

  • Our dataset file (summary.json) which holds all the data on the 3,176 witnesses we analysed;
  • The variant analyser and NEXUS converter scripts, as well as their dependencies (variant_regex.py, variants_to_nexus.csv, and the variant_patterns directory) and any modules specified in the files themselves.

Finally, if you are using the NEXUS converter script, you will need to install a phylogenetic software package to open and interpret the NEXUS files such as PAUP* or SplitsTree 6.

Jasper van der Klis, October 2025