Vive la différence: Tracing the (Authorial) Gender Signal by Multivariate Analysis of Word Frequencies

Authors

DOI:

https://doi.org/10.31400/dh-hun.2021.5.3143

Keywords:

authorship attribution, gender of author, Bootstrap Consensus Network, Burrows’s Zeta

Abstract

Multivariate analysis of word frequencies is used to identify the gender of authors in a corpus of 18th- and early 19th-century English sentimentalist and Gothic fiction. Results obtained with most frequent words are compared to those produced with medium-frequency Burrows’s Zeta words characteristic for both genders. Gender-sensitive words from two periods (18th/19th c. and 19th/20th c.) are compared in terms of their usefulness for gender identification in literary texts.

Published

2021-12-31

How to Cite

Rybicki, Jan. 2021. “Vive La différence: Tracing the (Authorial) Gender Signal by Multivariate Analysis of Word Frequencies”. Digitális Bölcsészet / Digital Humanities, no. 5 (December):T:19-T:38. https://doi.org/10.31400/dh-hun.2021.5.3143.

Most read articles by the same author(s)