Abstract
Authorship Attribution is the problem of determining who is (or was) the author of one or more texts, in cases where authorship is disputed. There are many well-known cases of disputed authorship; in this paper we consider the Federalist Papers, and the 15th Book of Oz. We treat the problem as a supervised classification problem, and use evolutionary algorithms to search through subsets of function words, which in turn form the basis of predicting authorship via linear discriminant analysis. We compare two approaches (due to the size of the text corpora in dispute, extensive experimentation is difficult), both centred around the optimization of ROC curves. On both datasets, the hybrid EA approach was able to classify the disputed works with 100% accuracy, using small sets of function words comparable to or better than previous works on these cases. ©2007 IEEE.
Original language | English |
---|---|
Title of host publication | 2007 IEEE Congress on Evolutionary Computation, CEC 2007 |
Pages | 2071-2077 |
Number of pages | 7 |
DOIs | |
Publication status | Published - 2007 |
Event | 2007 IEEE Congress on Evolutionary Computation - , Singapore Duration: 25 Sept 2007 → 28 Sept 2007 |
Conference
Conference | 2007 IEEE Congress on Evolutionary Computation |
---|---|
Abbreviated title | CEC 2007 |
Country/Territory | Singapore |
Period | 25/09/07 → 28/09/07 |