Journal Article

An algorithm for automated authorship attribution using neural networks

Matt Tearle, Kye Taylor and Howard Demuth

in Literary and Linguistic Computing

Published on behalf of ALLC: The European Association for Digital Humanities

Volume 23, issue 4, pages 425-442
Published in print December 2008 | ISSN: 0268-1145
Published online October 2008 | e-ISSN: 1477-4615 | DOI: https://dx.doi.org/10.1093/llc/fqn022
An algorithm for automated authorship attribution using neural networks

More Like This

Show all results sharing these subjects:

  • Language Teaching and Learning
  • Computational Linguistics
  • Bibliography
  • Digital Lifestyle
  • Information and Communication Technologies

GO

Show Summary Details

Preview

We present an algorithm as evidence of the possibility of a truly automated stylometric authorship attribution tool, based on committees of artificial neural networks. Neural networks have an advantage over traditional statistical stylometry in that they are inherently nonlinear, and therefore can consider nonlinear interactions between stylometric variables. The algorithm presented (1) is intended to demonstrate the feasibility of an automated approach using neural networks and (2) highlights important areas for further research. We present results of two separate test experiments—Shakespeare and Marlowe, and the Federalist Papers—as a demonstration of the method's; generality. In both cases, our algorithm produces committees that correctly predict the test works, without requiring the usual precursory statistical study to determine efficacious stylometric measures.

Journal Article.  8816 words.  Illustrated.

Subjects: Language Teaching and Learning ; Computational Linguistics ; Bibliography ; Digital Lifestyle ; Information and Communication Technologies

Full text: subscription required

How to subscribe Recommend to my Librarian

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content. subscribe or login to access all content.