Stylometric analysis of Early Modern English plays

Date

2017-12-08

Advisors

Journal Title

Journal ISSN

ISSN

2055-7671

Volume Title

Publisher

Digital Scholarship in the Humanities

Type

Article

Peer reviewed

Yes

Abstract

Function word adjacency networks (WANs) are used to study the authorship of plays from the Early Modern English period. In these networks, nodes are function words and directed edges between two nodes represent the likelihood of ordered co-appearance of the two words. For every analyzed play a WAN is constructed and these are aggregated to generate author profile networks. We first study the similarity of writing styles between Early English playwrights by comparing the profile WANs. The accuracy of using WANs for authorship attribution is then demonstrated by attributing known plays among six popular playwrights. The WAN method is shown to additionally outperform other frequency-based methods on attributing Early English plays. This high classification power is then used to investigate the authorship of anonymous plays. Moreover, WANs are shown to be reliable classifiers even when attributing collaborative plays. For several plays of disputed coauthorship, a deeper analysis is performed by attributing every act and scene separately, in which we both corroborate existing breakdowns and provide evidence of new assignments. Finally, the impact of genre on attribution accuracy is examined revealing that the genre of a play partially conditions the choice of the function words used in it.

Description

The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link.

Keywords

computational stylistics, authorship attribution, William Shakespeare, word adjacency networks

Citation

Segarra, S., Eisen, M., Egan, G. and Ribeiro, A. (2018) Stylometric analysis of Early Modern English plays. Digital Scholarship in the Humanities, 33 (1), pp. 500-528

Rights

Research Institute

Institute of English