Article

Corpus Linguistics In Authorship Identification

Krzysztof Kredens and Malcolm Coulthard

in The Oxford Handbook of Language and Law

Published in print March 2012 | ISBN: 9780199572120
Published online November 2012 | | DOI: http://dx.doi.org/10.1093/oxfordhb/9780199572120.013.0037

Series: Oxford Handbooks in Linguistics

 Corpus Linguistics In Authorship Identification

More Like This

Show all results sharing these subjects:

  • Linguistics
  • Forensic Linguistics

GO

Show Summary Details

Preview

Corpus linguistics is basically ‘an empirical approach to studying language, which uses observations of attested data in order to make generalisations about lexis, grammar, and semantics’, and which, in the context of forensic linguistics, offers much more than explanatory possibilities. It provides methods for processing naturally occurring language data with a view to describing the nature of particular instances of language use and the behavior of particular (groups of) language users. Language corpora can thus be used for a variety of forensic linguistic tasks. This article explores how corpora and corpus methodology can aid the forensic linguist: in authorship identification; to analyze texts comparatively in order to comment on the authorship of questioned documents; to interpret the meaning of disputed utterances; and to investigate and describe language use in legal and forensic settings. After discussing authorship attribution, it looks at disputed meanings, corpora in language and law research, corpora for forensic applications, and the Internet as a corpus.

Keywords: corpus linguistics; forensic linguistics; corpora; authorship attribution; authorship identification; language use; law; disputed meanings

Article.  5822 words. 

Subjects: Linguistics ; Forensic Linguistics

Full text: subscription required

How to subscribe Recommend to my Librarian

Buy this work at Oxford University Press »

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.