Skip to main content

Special: SOS UKGE



Special: SOS UKGE 2019

DISCLAIMER: 

This article is an examination of how the finals of UKGE would have looked if they had cut to top 2 using a Strength of Schedule ("SOS" from now on) instead of GW's idiotic tiebreaker system ("GWITS").  It is not meant as a condemnation of any player.  It is not meant to imply that any player did not deserve their spot in the tournament.  It is only meant to point out the failures of GW's system.

The best solution to this would be to change the final cut to include all players with perfect match records.  There were six of these at UKGE, so four players who never lost a match didn't even get a chance to play for 1st.  The two who did get to play were chosen by GWITS - a system that is deeply flawed in that it rewards players for playing against opponents who are much less successful than they are.  SOS, on the other hand, rewards players for playing against stronger opponents, and is widely used in most competitive game/sport formats.

Once again, big congratulations to all the undefeated players for kicking ass in a very large tournament, you should have all been included in the final cut!  However, with a top-2 cut, this isn't going to happen most of the time.  If GW won't fix their top-2 cut (due to time restrictions, a reasonable objection), they really should use SOS.

On the upside, it looks like GW is experimenting with some different tournament formats (including a two-day format that has a larger final cut and more rounds).  Hopefully, this will result in fewer inconsistencies and injustices due to their truly terrible tiebreak and matchmaking systems. 

(For a detailed discussion of why SOS is a better system than GWITS, please see the discussion here.)

The GWITS final round standings:

According to the GWITS, the players were ranked thusly after the final round (player names will not be used, in an attempt to reduce the perception that I am slamming on any given person).

  1. Ylthari's Guardians A
  2. Ylthari's Guardians B
  3. Stormsire's Cursebreakers A
  4. Mollog's Mob 
  5. Stormsire's Cursebreakers B
  6. Godsworn Hunt (what!?)

Tiebreakers for each player - the factors that determined who among the 12 point players would be in 1st-6th place after swiss - are listed below, in this format: Warband - Game Losses - Total Glory Differential.

WarbandLossesGlory Diff
Guardians A184
Guardians B183
Cursebreakers A182
Mollog172
Cursebreakers B149
Godsworn2not calculated

At the end of Round 4, Guardians A and Guardians B played in the final.

A few things of note:  1st through 5th place were decided by glory differential only, and 1st-3rd were separated by only 2 glory.  3rd place missed the cut for the finals by only a single point of glory.  While I can't speak for other players, this would be extremely frustrating for me.

Secondly, while not noted on this chart, it is worth pointing out that of 109 total players, 3 dropped after first round losses.  One might assume that these players would have then finished 107th, 108th, and 109th.  Not so.  Because GW has decided that the first tie breaker is games lost, those players who stayed in and lost all their matches (0-4) finished lower than the 0-1 drop players, because they lost more games.  Seven players finished below the 0-1 drop players.  One of the 0-1 drop players also recorded a loss first round, but recorded zero game losses, so that player finished highest of all the 0-win players.  That sucks out loud.

The SOS final round standings:

If the tournament had used SOS instead of GWITS, the tiebreakers would have been:
  1. Opponents' match record percentages
  2. Player's game record percentage
  3. Opponents' game record percentages
A full explanation of the DCI's SOS system - used by Magic the Gathering, can be found here.

In that case, the tiebreakers for each player would have looked more like this:

WarbandOMW%PGW%OGW%
Guardians A0.7075not calculatednot calculated
Guardians B0.58250.890.5025
Cursebreakers A0.65400.89not calculated
Mollog0.5200not calculatednot calculated
Cursebreakers B0.58250.890.5725
Godsworn0.64500.80not calculated

This would have resulted in the following standings after swiss:
  1. Guardians A
  2. Cursebreakers A
  3. Godsworn (WHAT!?)
  4. Cursebreakers B
  5. Guardians B
  6. Mollog
In this case, the final would have been between Guardians A and Cursebreakers A.

There's some good stuff to be gleaned from here.  First, the Guardians A player would have been first after swiss either way!  That's good, it shows that even GW's poorly designed system can end up rewarding players for beating more difficult opponents - sometimes.  If you've seen my other articles on the subject, you'll note that this wasn't the case at LVO or Adepticon.  Additionally, the 2nd place player would have changed, resulting in a different final match.  Finally, some of the other top 6 got jumbled around.  While this wouldn't have changed the outcome of the final match, it might have mattered to some folks.

Second, Godsworn did even better in SOS than they did in GWITS.  I don't even know what to say about that, except I'm fucking impressedGood job, man.

Coda: The Inevitable Counterargument



Predictably, after reading (or not reading) this article, someone will definitely make an argument something along the lines of "If SOS had been used during the whole tournament, it's impossible to predict how things would have gone, because everything would have been different."  Let's address that.

First, SOS is a tiebreaker, not a matchmaking system.  It's entirely possible to use GW's (broken, stupid) matchmaking system and still use SOS as tiebreakers.  So in fact, the matches could have been exactly the same.

Second, even if the matches wouldn't have been the same, there's value in analyzing data through a different lens, even if it can't account for all variables.  Economists use this technique all the time; it's called controlling for a variable.  You can find a detailed explanation of how this works, written by someone much smarter than me, in the works of Steven Levitt.

However, you may want a simpler, less whole-book-long look at how controlling for a variable works.  For that, let's take a look at one of my favorite sayings: "If my aunt had balls, she'd be my uncle."  I used to say this a lot, mostly to illustrate that a particular argument was pointless because it was examining a hypothetical in a way that couldn't account for all the possible outcomes.  In other words, I was making the same argument that I illustrated above.  Because I was young, and kind of dumb.

Since then, I've grown up in a lot of ways, and I look at things much differently.  Most significantly, I realized that just because my aunts' possession or lack of testicles doesn't necessarily determine their genders.  In studies from 2016 and 2011 respectively, it is estimated that 0.6% of the population identifies as trans, and 0.4% identify as non-binary.  So, even if my aunt did have balls, there's still only a 99% probability that she would identify as being male, and therefore, my uncle.  (And I would, of course, respect that person's preference in naming conventions, because I'm not a piece of shit).  I have a lot of aunts, so if they all had balls, there would be a pretty good chance that at least one of them would still be my aunt.  But most of them wouldn't.

So now, I think about that saying very differently.  Now, I think: "If my aunt had balls, she'd probably be my uncle."  You can apply the same thinking to the SOS/GWITS conversion.  Certainly there is a mode of thinking that dictates that changing to SOS would change every thing about the tournament, and the results would be impossible to predict.  But looking at a controlled-for variable (+/- balls) allows us to view a probable outcome (uncle vs. aunt). 

Whereas if we simply decide that a change to a single variable will automatically trigger an Ashton-Kutcher sized ripple in probability every time and result in a totally unpredictable outcome, you've learned nothing, examined nothing, and taken the easy way out.  You have no idea what to call your parent's sibling.  You have brought shame to your whole family.  Meiwaku.

Comments

  1. Howdy

    Awesome article, loving your work. Come join us on the discord?

    https://discord.gg/breyyyz

    Peace,

    ReplyDelete

Post a Comment

Popular posts from this blog

Hex and the City: Extreme Flank

Decision: How should I place my board to optimize Extreme Flank? TL;DR: This one for when you lose the rolloff or if you like your boards in the rectangular (non-offset) short board layout:

Otherwise, this one:
Prelude: Understanding Extreme Flank The first obstacle that needs to be overcome in order to properly set up for scoring Extreme Flank is to understand how the card actually works.  It's quite poorly worded and the resulting methods of scoring can be counter-intuitive.   Luckily, someone made this excellent little diagram to help us understand how to score it:

In the above diagram, if your fighter is on a blue edge, they can only score Extreme Flank if your other fighter is on the green edge.  Note that the bottom layouts are mirrors of the top layouts.  This is important because the order in which you choose fighter matters.  For example, using the left diagrams, if you have a fighter on p4 and a fighter on p1 you can only score extreme flank if you choose the fighter on …

Cardiology: Double Feature! Branching Fate/One Fate

Decision: Should You Play Branching Fate/One Fate? TL;DR: Do you stride far? Then you should probably be playing Branching Fate.  Other warbands with high dice counts should consider playing Branching as well.  If you can manage to regularly roll 3 magic dice on spell attacks, One Fate is quite good.  Otherwise, pass on it.
Prelude:Double Feature! This week we're taking a look at two different cards for Cardiology; rest assured this departure from standard operating procedure is for good reason.  Namely, the math required to analyze One Fate is very similar to the math required to analyze Branching Fate.  We couldn't see a reason to stretch the same work into two articles, so this week you're getting a two-for-one special.
Factor: Warband In order to score either of the Fate objectives, you've got to be rolling three or more dice on attack or defense.  Rolling three dice on attack is pretty common (defense - less so), but some warbands definitely come naturally better eq…

Special: Las Vegas Open Recap

SPECIAL: LVO RECAP
So the Call It Shadespire playgroup headed out to play in the Las Vegas Open last weekend, and this article will focus on our experience there!

Factor: The People I want to start out by saying that the people playing Shadespire at the convention were - universally - awesome.  Everyone there seemed out to have a good time, and all of my matches were fantastic, no exaggeration.  I'd like to give a special shout out to the Canadians there - especially Sam, Justin, and Kaptain Murder - who were all friendly to the Albuquerque crew, even putting up with our appropriation of Canadian culture in our team names (on Friday we were LETTERKENNY, on Sunday we were DIRTY DANGLES). 

My opponents were all super cool, and I honestly had a blast every single game.  It's been a long time since I played a tournament and didn't have a single game that felt bad.
Factor: Las Vegas
For real, hanging out in Las Vegas involves a lot of walking - and this is coming from a nurse.  …