Authorship Attribution is the problem of determining the authorship of one or more texts. Applications include disputed authorship, or deciding which of a collection of pieces of text were by the same author. A popular and successful approach is to characterize a specific author in terms of the usage pattern of function words. These are common words that are unrelated to subject matter, and tend to be used in specific ways by different authors. In English, a well-known collection of 70 function words is often used for this purpose. Previously, using a hybrid of evolutionary search and linear-discriminant analysis (LDA), we have shown excellent performance in authorship attribution in English based on a function word approach. Here, for the first time, we propose and test a set of Arabic function words for use in Arabic authorship attribution. Tests indicate that the chosen collection forms an effective basis for authorship attribution in Arabic.
|Title of host publication||2010 UK Workshop on Computational Intelligence, UKCI 2010|
|Publication status||Published - 2010|
|Event||2010 UK Workshop on Computational Intelligence - Colchester, United Kingdom|
Duration: 8 Sep 2010 → 10 Sep 2010
|Conference||2010 UK Workshop on Computational Intelligence|
|Abbreviated title||UKCI 2010|
|Period||8/09/10 → 10/09/10|