A Bayesian approach for predicting match outcomes: FIFA World Cup 2026
Main Article Content
Abstract
One of the biggest international football competitions, the FIFA World Cup provides teams with an exciting and unpredictable stage on which to display their skills. Predicting match outcomes isn't easy due to the numerous factors involved, like team strategy, player performance, and even unpredictable elements like weather or injuries. Traditional statistical methods in the frequentist framework (such as regression model, machine learning and Monte Carlo simulation) might not fully capture these complexities. This study applied the Bayesian logistics regression and gradient boosting model. to predict possible match outcomes in the forthcoming FIFA World Cup 2026. The Bayesian framework provides a probabilistic and adaptable base that adjusts to tournament dynamics and incorporates prior knowledge, while gradient boosting captures complex non-linear correlations. Key variables include player form, team dynamics, and strategic differences. Data were collected from FIFA's official site and Kaggle, covering historical match data, player statistics and team rankings. Data preprocessing, including median imputation for missing values and feature engineering were carried out. The dataset is split into train-test-validate sets, and the two models evaluated exhibited high predictive accuracy. The study identified top contenders, highlighted offensive and defensive strengths, noted feature importance. The findings emphasize the potential of machine learning in sports analytics. The results identified the leading contenders for the 2026 FIFA World Cup, listing them in order of superiority. Results aim to contribute to the field of sports analytics, offering valuable insights into the complex dynamics influencing success in high-stakes football tournaments. From the literatures, this study on the application of Bayesian logistics regression and gradient boosting model is one of the rare applications to sport analytic.
Downloads
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
- Authors keep the rights and guarantee the Journal of Innovative Applied Mathematics and Computational Sciences the right to be the first publication of the document, licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License that allows others to share the work with an acknowledgement of authorship and publication in the journal.
- Authors are allowed and encouraged to spread their work through electronic means using personal or institutional websites (institutional open archives, personal websites or professional and academic networks profiles) once the text has been published.
References
[1]G. Baio andM. A. Blangiardo,Bayesian Hierarchical Model for the Prediction of Football Results,Journal of Applied Statistics,35(2) (2010), 253–264.
[2]R. P. Bunker andF. Thabtah,A Machine Learning Framework for Sport Result Prediction, Applied Computing and Informatics,15(1) (2019), 27–33.
[3]A. C. Constantinou andN. E. Fenton,Pi-Football: A Bayesian Network Model for Fore casting ssociation Football Match Outcomes,Knowledge-Based Systems,36(2012), 322–339.
[4]A. C. Constantinou andN. E. Fenton,Determining the Level of Ability of Football Teams by Dynamic Ratings Based on the Relative Discrepancies in Scores Between Adversaries, Journal of Quantitative Analysis in Sports,9(1) (2013), 37–50.
[5]A. C. Constantinou,Dolores: A Model that Predicts Football Match Outcomes from All Over the World,Machine Learning,108(10) (2019), 1007.
[6]N. Danisik, P. Lacko andM. Farkas,Football Match Prediction Using Players Attributes, (2018), 201–206.
[7]R. Giulianotti andR. Stebbins,Football: A Sociology of the Global Game,Contemporary Sociology,29(2000), 842.
[8]M. Gifford andT. Bayrak,A Predictive Analytics Model for Forecasting Outcomes in the National Football League Games Using Decision Tree and Logistic Regression,Decision Analytics Journal,8(2023), 100296.
[9]D. Goldblatt,The Ball is Round: A Global History of Soccer,Penguin Publishing Group, (2008), 1008.
[10]T. Horvat andJ. Job,The Use of Machine Learning in Sport Outcome Prediction: A Review, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, (2020).
[11]M. Hughes andJ. Franks,Analysis of Passing Sequences, Shots and Goals in Soccer, Journal of Sports Sciences,23(5) (2005), 509–514.
[12]C. Igiri,An Improved Prediction System for Football Match Result,IOSR Journal of Engineering, 4(2014), 12–20.
[13]M. Ingram,A Point-Based Bayesian Hierarchical Model to Predict the Outcome of Tennis Matches, Journal of Quantitative Analysis in Sports,15(2019).
[14]D. Johansen, C. Gurrin andD. Havard,Towards Consent-Based Lifelogging in Sport Analytic,(2015), 335–344.
[15]J. Ma, F. Stingo,andB. Hobbs,Bayesian Predictive Modeling for Genomic Based Personalized Treatment Selection,Journal of Biometrics,72(2015).
[16]V. Matheson,Mega-Events: The Effect of the World’s Biggest Sporting Events on Local, Regional and National Economies, International Association of Sports Economists, Working Papers (2006).
[17]B. Min, J. Kim andH. Eom,A Compound Framework for Sports Results Prediction: A Football Case Study,Knowledge-Based Systems,21(2008), 551–562.
[18]B. Olawale andM. Oladapo,Bayesian Analysis of 2014 FIFA World Cup Matches Played and Goals Scored,International Journal of Modern Mathematical Sciences,16(1) (2018), 25–36.
[19]F. Owramipur,Football Result Prediction with Bayesian Network in Spanish League, International Journal of Computer Theory and Engineering,5(2013), 812–815.
[20]N. Razili andA. Mustapha,Predicting Football Matches Result Using Bayesian Networks for English Premier League (EPL),IOP Conference Series: Materials Science and Engineering, 226(1) (2017), 012099.
[21]C. Sandvoss,A Game of Two Halves: Football Fandom, Television and Globalization,(2003).
[22]M. Stein,Bring it to the Pitch: Combining Video and Movement Data to Enhance Team Sport
Analysis, IEEE Transactions on Visualization and Computer Graphics, (2017).
[23] J. Sugden andA. Tomlinson,FIFA and The Contest for World Football: Who Rules the
People’s Game?,(1998).
[24]A. Suzuki,A Bayesian Approach for Predicting Match Outcomes: The 2006 (Association) football world cup,Journal of the Operational Research Society,61(10) (2010), 1530–1539.
[25]S. Vaidya,Football Match Winner Prediction,International Journal of Computer Applications, 154 (2016), 31–33.
[26]M. Veenman, A. M. Stefan andJ. Haaf,Bayesian Hierarchical Modeling: An Introduction and Reassessment, Behavior Research Methods,56(1) (2023).
[27]F. Wunderlich andD. Memmert,Forecasting the Outcomes of Sports Events: A Review, European Journal of Sport Science, 21(7) (2020), 944–957.