Evaluating the Performance of Tree-Based Predictive Models as Programme Recommenders for University Entrants in Kenya.

Thumbnail Image

Date

2024-10

Authors

Kabiru, Kibuthi J.
Makiya, Ratemo C.
Anduvare, E. M.

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Enrolling for the wrong programme by university students has, to an extent, contributed to the high rates of discontinuation on academic grounds, repeat year cases, change of programme after registration, interuniversity transfers, deferments to change programme, drop out cases, suspension over exam irregularities as well as to strikes. This study focused on finding a technological solution for reducing these cases by evaluating three tree-based predictive models and recommending the most predictive model to implement as a programme recommender. Data was collected in five selected public universities in Kenya using Google Forms. The respondents were 308 translating to 308 rows of data with 36 columns. Numpy, Pandas, Matplotlib, Sklearn, Seaborn, Scipy, Plotly python analytics libraries were deployed using Jupyter Notebook for Anaconda. The cleaned and processed dataset features had categorical variables thus one-hot-encoding technique was employed. Data was split for training and testing with the random_state set to 42. Gini index criteria was implemented. The three models were evaluated on their performance from the optimally split data for training and test with a 80:20 ratio. Random Forest (RF) came out the most predictive at 99.3% followed by Gradient Boosting (XG Boost) at 90% then Decision Tree (DT) at 80.93%. The testing accuracy score for RF was 81.72%, XGBoost was at 75.72% and DT was at 76.34%. Confusion matrix criterion was implemented to evaluate the performance of the three models. The results of this study have demonstrated the high accuracy level of RF as the most predictive tree-based model for this real-world University crisis. The model is recommended for development as a system to be integrated into the KUCCPS portal. The integrated system is dubbed Programme Recommender which if launched would highly predict the best programme of study for application by university entrants.

Description

Abstract

Keywords

Random Forest, Gradient Boosting, Python, KUCCPS, Programme, Recommender

Citation

African Journal of Education Science and Technology (AJEST) Vol. 8 No.1

Endorsement

Review

Supplemented By

Referenced By