Comparative Analysis of Machine Learning Models on Student Performance Data: Insights from Test Scores and Survey Data

Maheen Hasib, Sanjana Sundararaman

Research output: Contribution to journalArticlepeer-review

36 Downloads (Pure)

Abstract

With the increasing use of digital learning platforms, large volumes of student data have become available for analysis. This paper investigates how machine learning, learning analytics, and educational data mining can be utilized to gain insights into student performance. Various predictive modeling techniques, including Random Forest (RF), K-Nearest Neighbor (KNN), and Decision Trees (DT), are evaluated for their ability to forecast student test scores. Clustering algorithms like K-means are employed to identify patterns within the data. The study integrates these predictive models with survey data collected from undergraduate students at Heriot-Watt University Dubai, aiming to identify factors that influence academic outcomes. The research uses comparative analysis across different machine learning models which is applied to both the survey data and Kaggle test score data. The analysis reveals that linear regression is the most effective model for the Kaggle test score dataset, while K-means clustering provides the best insights from the survey data. The survey model is determined to be more comprehensive due to its inclusion of more predictors. Key metrics, such as accuracy scores, precision, recall, F1 score, and mean squared error, were calculated for both datasets to provide a quantitative overview, enabling a comparative evaluation of model performance and predictor effectiveness for both the datasets. The findings contribute to understanding how data-driven approaches can support educational decisions and interventions while addressing ethical considerations and inclusivity in educational settings.Keywords
Original languageEnglish
Pages (from-to)61-76
Number of pages16
JournalEuropean Journal of Teaching and Education
Volume7
Issue number1
Early online date16 Feb 2025
DOIs
Publication statusPublished - 18 Feb 2025

Keywords

  • AI in education
  • educational data mining
  • machine learning in education
  • predictive modeling
  • test scores

Fingerprint

Dive into the research topics of 'Comparative Analysis of Machine Learning Models on Student Performance Data: Insights from Test Scores and Survey Data'. Together they form a unique fingerprint.

Cite this