As an example of early results, I first excluded the round 1 data (I’m taking data after the Russian turn) because there is a series of slaughter moves that typically happen in the first round. Eg. the fact that the UK has a battleship in the Mediterranean is worth a fraction of its 24 IPCs. Results are improved by excluding round 1 data.
My main variable is the total value of Axis units in IPC minus that of the Allied units.
Just to show that you can find out stuff with very little data, with a lousy 30 data points, this one variable explains 43% of the outcomes.
Adj R^2 = 0.433
Constant=.793
AxTotDif (the difference in IPCs)
B= 0.0034
T=4.89 (significant level is better than 0.001, or there is a greater than 99.9% chance that this variable is statiscally significant)
Thus, if you have zero IPC difference, the Axis has a 79.3% chance of winning (this makes a lot of sense - the Axis has better supply lines). For each IPC difference, the probability of winning increases by .34%. Note: these numbers are going to change a LOT, once I add some more data points.
In other news, this is pretty crazy, the difference in IPC value for land units has no significant impact, all of the difference comes from the naval IPC difference. That’s what happens when you only use 30 data points =)