Ablation Study on Feature Group Importance for Automated Essay Scoring

Jih Soong Tan, Ian K. T. Tan

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Grading of written academic essays by humans requires significant effort. It is a time-consuming task and is vulnerable to human biases. Ever since the introduction of modern computing, this has been one of the many automations being explored. Researches in automated essay scoring have been on-going, where the majority of the researches in recent years are based on extracting multiple linguistic features and using them to build a classification model for automated essay scoring. The 3 main types of features used are lexical, grammatical, and semantic. In our work, we conducted an ablation study to discover the engineered features that has the weakest influence. We did this using a generic feature engineering and classification approach that was used by the winners of the Automated Student Assessment Prize (ASAP). This is to mitigate biases that may have addressed specific feature engineering or models. Our results show that a semantic feature called the prompt has been the weakest feature in influencing the models. From further investigations, this was due to it being over-fitted in the classification model.

Original languageEnglish
Pages (from-to)90-101
Number of pages12
JournalAsia-Pacific Journal of Information Technology and Multimedia
Volume11
Issue number1
DOIs
Publication statusPublished - 30 Jun 2022

Keywords

  • Ablation Study
  • ASAP
  • Automated Essay Scoring
  • Feature Engineering
  • Semantic

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Ablation Study on Feature Group Importance for Automated Essay Scoring'. Together they form a unique fingerprint.

Cite this