Abstract
This research project utilized advanced computer algorithms to predict the outcomes of Premier League soccer matches. The dataset containing match data and odds from seasons was processed to handle missing information, select features and reduce complexity using Principal Component Analysis. To address imbalances, in the target variable Synthetic Minority Over sampling Technique (SMOTE) was employed. Various machine learning models such as RandomForest, DecisionTree, SVM, XGBoost and LightGBM were evaluated.