Notes and materials for "Explainable Stylometry: How to Find Distinctive Textual Features" workshop run by Jeremi Ochab at the seminar Digital Approaches to Pre-Modern Texts and Manuscripts 10-12 June 2025, ENC-PSL in Paris.
You might consider reading:
- Megan S. Kane, "Corpus Analysis with spaCy," Programming Historian 12 (2023), https://doi.org/10.46430/phen0113 to understand the data structure and capabilities of spaCy
- Google Colab Tutorial or a similar resource to make yourself comfortable with Google Colab.
The primary way of running the workshop will be via Google Colab notebook, which is here:
Click the badge above to open the interactive notebook in Google Colab, where you can:
- Clone, import, and run the
cl_explainable_stylopackage - Load text files from
example1/,example2/ - Experiment...
If you prefer to work offline or on your local machine, you can also clone or download the package from here and follow the installation tips here
If you find this package useful in your research, please cite one of these papers when referring to it:
Ochab, J. K., & Walkowiak, T. (2024). Implementing interpretable models in stylometric analysis. In Digital Humanities 2024: Conference Abstracts. Washington, D.C.: George Mason University (GMU).
Argasiński, Jan K., Iwona Grabska-Gradzińska, Karol Przystalski, Jeremi K. Ochab, and Tomasz Walkowiak. ‘Stylometric Analysis of Large Language Model-Generated Commentaries in the Context of Medical Neuroscience’. In Computational Science – ICCS 2024, edited by Leonardo Franco, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M. A. Sloot, 281–95. Cham: Springer Nature Switzerland, 2024. https://doi.org/10.1007/978-3-031-63775-9_20.
Jeremi Ochab jeremi.ochab@uj.edu.pl
Tomasz Walkowiak tomasz.walkowiak@pwr.edu.pl