So using logistic regression and more data, my model is predicting 87% of game outcomes (using all the data from round 1 to the end of rounds), and 100% of games starting on round 5 (eg once the game has progressed for a couple rounds - this model is very accurately predicting the winner!).
Now my problem is I’m not quite sure how to give a simple explanation of how logistic regression works. In fact I’m somewhat confused myself.
Variables
Both measured at the end of Russian turn
UnitDif: AXIS IPC Units - Ally IPC units
IPCDif: AXIS IPC Territory - Ally IPC territory
The Model
116 rounds of data - roughly 15 games
              Predicted
             Allied Win   Axis Win  Percent Correct
Observed Allied Win 28        10       73.7
       Axis Win  5        73       93.6
Overall Percent Correct - 87.1
(It isn’t predicting Allied wins as well, because 2/3 of my data was axis wins)
       B     SE      Sig      Exp(B)
UnitDif  .038   .011    .000      1.039
IPCDif   .110    .035    .002      1.116
Constant 6.644  1.501   .000    768.146
Cox and Snell R Squared: .520
Nagelkerke R Squared:Â Â .724
I think this means that if UnitDIF changes by 1, your chances of winning change by 3.9%. If IPCDif changes by 1, your chances of winning change by 11.6%. But both of those values seem kind of high. So is that right?
Also if the IPCDif is zero then it means for the game to be even the UnitDif should be 177 (6.644/.038) - is that right?
The logistic model is complex because it has something to do with a ratio of two exponents (e to the power of something).