Investigating hybrids of evolutionary search and linear discriminant analysis for authorship attribution

Kareem Shaker, David Corne, Richard Everson

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)


Authorship Attribution is the problem of determining who is (or was) the author of one or more texts, in cases where authorship is disputed. There are many well-known cases of disputed authorship; in this paper we consider the Federalist Papers, and the 15th Book of Oz. We treat the problem as a supervised classification problem, and use evolutionary algorithms to search through subsets of function words, which in turn form the basis of predicting authorship via linear discriminant analysis. We compare two approaches (due to the size of the text corpora in dispute, extensive experimentation is difficult), both centred around the optimization of ROC curves. On both datasets, the hybrid EA approach was able to classify the disputed works with 100% accuracy, using small sets of function words comparable to or better than previous works on these cases. ©2007 IEEE.

Original languageEnglish
Title of host publication2007 IEEE Congress on Evolutionary Computation, CEC 2007
Number of pages7
Publication statusPublished - 2007
Event2007 IEEE Congress on Evolutionary Computation - , Singapore
Duration: 25 Sept 200728 Sept 2007


Conference2007 IEEE Congress on Evolutionary Computation
Abbreviated titleCEC 2007


Dive into the research topics of 'Investigating hybrids of evolutionary search and linear discriminant analysis for authorship attribution'. Together they form a unique fingerprint.

Cite this