The new ELO-based ranking system


  • @MrRoboto so I just opened the spreadsheets. That is very impressive. I won’t have time to help now but didn’t realize you had already done so much work.

  • 2025 2024 '23

    @MrRoboto sent you a PM as well, look for an email with a link to my spreadsheet for copying data over. I rearranged the columns on my end for faster entry the way brain reads the posts.

    Got back to early november 2021 (page 93 fully completed)


  • Nice, a group project to get more years recorded. After personally and manually entering every game result for years, I have quite a strong grasp on how good each player is, in a way that can’t really be quantified with bare numbers or a formula.

    So I am hoping to see how the numbers will fall with 1/1/22 to date, 1/1/21 to date, or however far we can go back. To the beginning of G40 is the dream. Shouldn’t matter much that the rule sets changed dramatically (especially balanced mod) at some points, since was even competition between many of the same players anyway.

    I hope the lifetime ratings will pretty much line up with my experience and memory, and it will be fascinating to see players from years ago stack up in the same rankings against contemporaries.

  • 2025 2024 '23 '22 '15 '11 '10 Official Q&A Moderator

    So with a lifetime… no K factor adjustment for sensitivity, right?

  • '19 '18

    I would keep the K factor. It will help new players joining the community finding their correct spot in the ranking faster.

    If someone really weak comes, we need that player to fall quickly otherwise the first couple of wins are overrated.
    And if the next I don’t know, Napoleon or Sun Tzu, suddenly joins, we need that player to climb super fast otherwise the losses will hurt the respective players more than they should.


  • @MrRoboto if the k factor is considered over lifetime (and is too sensitive) an issue might be that players that are strong now, but weaker in the past will have their past games weigh down their current ELO. If I’m right on that, instead of it being a factor in one’s first games, can it be more sensitive in one’s most recent games? That might allow new players to move up more quickly without penalizing players that have been around for a while.


  • @MrRoboto

    mega! At first I thought like “may it be fun to play the league games whatever the ranking (system)” - but at this moment I find the project even more thrilling than going on with my games (:) mainly because you gently propose it as a matter of community! And by this you are doing great in keeping @gamerman01 's style!! It looks to me as what you @gamerman01 have fostered dearly is coming of age rather than plotting


  • @MrRoboto said in Proposal for a new, ELO-based, ranking system:

    I would keep the K factor. It will help new players joining the community finding their correct spot in the ranking faster.

    Ah, of course this is right.


  • @farmboy said in Proposal for a new, ELO-based, ranking system:

    If I’m right on that, instead of it being a factor in one’s first games, can it be more sensitive in one’s most recent games? That might allow new players to move up more quickly without penalizing players that have been around for a while.

    Or both?? Sensitive at beginning and also recent?
    Or K factor that adjusts to total number of games, where the adjustment is high in the first few games, but if that player continues on to play 15 or 50 or 500, then automatically eliminates those first few game k factor and enacts the later game? MrRoboto, we think you can do anything now. :D

    I suppose after 50 or 500 whatever games, the k factor being applied to the first few would be practically nil, so maybe could be applied to both beginning and more recent.

    Look at those formulas. Look at that automation. We’ll figure it out.

  • '19 '18

    @farmboy and @gamerman01
    This is not possible since that would mean to retroactively adapt the ELO change of past games when you play more recent games.
    That contradicts the whole idea and is actually not even possible to implement since that creates circular references again - the biggest problem the old system had.

    It’s also not necessary at all. As you, gamerman, already stated: At a certain point the first few games are completely irrelevant. That point is FAR earlier than 50 games.

    Just an extreme example:
    Dawgoneit is currently 5-45, with an ELO of 1059
    With only 4 wins against some of the current top5 players, he can increase his rating to ~1600 even though he still is only 9-45 at that point.
    The system accurately shows the current strength.

    Thanks to Mr_Stucifer we now have the data of 2022 too. I think the system already looks extremely solid.

    Everybody can create a copy for themselves with
    File -> Make a Copy.

    You can then play around and add some results as you like to see how the system behaves.


  • Um, guys?

    The Hubble telescope just got the corrected lens. With another year or 15 months whatever of data (2022, some of 2021) and whatever adjustment Roboto just made,

    I am super excited and happy to see this standings board. Like I said, with my experience of entering every game result and, indeed, reading comments and what all is involved with moderating the league for years,

    I can tell you THIS outcome is excellent.

    Screenshot 2023-10-31 10.52.24.png

    Screenshot 2023-10-31 10.57.37.png


  • Now that is what it would look like if there was a “lifetime” rating starting in late 2021. This is not what 2023 would look like. And of course our past rankings spreadsheet will fill out 2023 so that playoffs are unaffected and comparability across years will be there.

    But something like this, and will be better after conversations and tweaking, is coming to a computer near you in 1/1/24.

    More than half the credit goes to Roboto for enthusiasm, computer ability, and pushing for improvement. I was hard to get through, but today with this spreadsheet pictured below, I am a believer and this is the future.

  • G gamerman01 referenced this topic on

  • intereasting ideas… from the new elo spreadsheet my overall rating is 1673, my OOb is1546 and BM is 1552

    How can my overall ranking be higher than then any of the two individual game version rankings?


  • @oysteilo said in Proposal for a new, ELO-based, ranking system:

    intereasting ideas… from the new elo spreadsheet my overall rating is 1673, my OOb is1546 and BM is 1552

    How can my overall ranking be higher than then any of the two individual game version rankings?

    One quick explanation/example is if you defeated someone in BM who normally plays PtV or OOB and is more successful there.


  • also look at Jkeller, he is number one in overall, but I only find his name in the OOB bracket where he is 7th.

    Maybe I am missing something?


  • @oysteilo this also might be because results are still being put in. I see him having an overall ranking based on 11 games and an OOB ranking based on 4 games. And 0 games with BM or PTV. So I suspect there is either a clerical error or 7 games (all wins so that would push up his ELO) that were counted in overall that have yet to be counted elsewhere.


  • Right, only 3-1 in OOB but is 10-1 overall

    jkeller was 5-0 in BM in 2022 but the new spreadsheet has him with 0 BM games. Results tab has BM games completed by him but none in the BM standings tab, so something isn’t working there.

    MrRoboto will weigh in.
    Good point-out, oysteilo


  • I love this idea. My personal feedback:

    • No need to have Elon ratings decay, but perhaps put an asterisk after the number if fewer than 3 games were played in the last 12 months or similar concept
      *the version of games are similar enough that we don’t need separate ratings for each version. You already can see the best OOB players are also top in BM.
      *I would like to see playoffs using this for bracketing assuming player has met the minimum number of games for the season.

  • @Arthur-Bomber-Harris At first I agree with the sentiment, but after a minute I definitely do not.

    Most players here prefer BM and have honed their skills for it.
    I think more of the better players are playing BM.

    If the data is accurate, and it may not be (see jkeller issue below)
    Anecdotal evidence:
    Pejon is #4 overall, 20-7

    He is 6-0 PtV, possibly a weaker field
    13-7 BM
    Maybe he plays higher competition in BM, I don’t know offhand. Those records add up to 19-7, may be some data entry errors - we probably need someone to double check. I don’t have time these days.

    Myygames is #1 in OOB with 7-0
    Is #10 overall when put together with everyone else.

    Again, could be data entry errors, could be strength of schedule differences.
    But many players, and probably Myygames, would like to know they’re #1 in OOB and not just #10 overall, especially when that’s the version they’re into.

    We also split the versions 3 years ago because we had the issue of what version to play in the playoffs.

    I don’t have time - I just slapped this response together but I hope it helps and stimulates your brain. Keep those ideas coming, and let me know what you think about my response if you want


  • @gamerman01 strength of schedule is way weaker in OOB which is why I can make the playoff finals in this division but would get crushed in BM with more top players.

    With ELO it gives people incentive to cross over to other versions if people have inappropriately low or high ratings. Knock down a few people who are most out of line with reality and then the reduced ratings cascade through the rest of the group as the consequences reverberate. It might not be absolutely perfect but I doubt people will end up too far away from where they should be.

Suggested Topics

  • 109
  • 34
  • 178
  • 103
  • 467
  • 47
  • 152
  • 137
Axis & Allies Boardgaming Custom Painted Miniatures

79

Online

17.8k

Users

40.6k

Topics

1.8m

Posts