A Comparison of Supervised Machine Learning Models

  • Goal: To explore supervised machine learning models through education data

    • Research Question: Can a supervised machine learning model classify degree-granting postsecondary institutions as being located in the Northeast vs. in the West?

      • Leveraged domain knowledge of regional differences so that project focus could be on learning about supervised machine learning.

  • Models Tested

    • Baseline

    • K-Nearest Neighbors

    • Decision Tree

    • Random Forest & Optimized Random Forest

    • AdaBoost

    • XGBoost

  • Final Model

    • Optimized Random Forest

  • Process

    • Train-Test Split (70:30)

    • Optimize & Evaluate

  • Evaluation Metrics

    • Accuracy

    • Precision

    • Recall

    • F1 Score

Previous
Previous

Freelance Writing (Data Science and Computer Science)

Next
Next

An NLP Topic Model & Recommender