"What is the power level of your deck?" It's a common phrase you hear. A question asked by both casual and competitive players alike. No rational person wants to walk into a commander game with a deck that isn't on the same level as everyone else. The common metric is to give the deck a rating from 1 to 10 on how powerful it is. The problem; everyone's opinion is different. I've seen people who think their deck is a 10, because they've never seen a
Flash Hulk or storm deck before. I've seen people give a 7 to decks that are much more powerful than they think. How well does this rating system actually work? And is there a way to unify the rating system?
You know what my favorite thing about Deckstats is? The stats.
Over the last month or two, I've been collecting data from the people in my playgroup. The task was simple; I gave them a sheet of paper that listed all of the active decks in our meta, and asked them to give each deck a rating from 1 to 10. If they didn't know the deck well enough, they would leave the entry blank.
Here are the decks, ordered from highest average rating to lowest. Names have been changed in order to keep identities a secret. Commanders with (c) next to them denote a deck at competitive commander power.
Commander | Player | Self-rate | Rating | st.dev | Number of votes |
Urza, Lord High Artificer (c) | Gohan | 10 | 8.89 | 0.928 | 9 |
Rashmi, Eternities Crafter (c) | Krillin | 9 | 8.79 | 0.699 | 7 |
Krenko, Mob Boss | Morganator 2.0 | 8 | 8.75 | 0.849 | 14 |
Edric, Spymaster of Trest (c) | Morganator 2.0 | 10 | 8.61 | 1.147 | 14 |
Brago, King Eternal | Piccolo | 9 | 8.33 | 0.816 | 6 |
Brago, King Eternal (c) | Android 18 | 8 | 8.14 | 1.464 | 7 |
Marath, Will of the Wild | Piccolo | 9 | 8.00 | 0.000 | 3 |
Marwyn, the Nurturer (c) | Gohan | 10 | 8.00 | 0.866 | 9 |
Oona, Queen of the Fae (c) | Frieza | 9 | 7.83 | 0.764 | 3 |
Vilis, Broker of Blood | Krillin | 7 | 7.80 | 0.837 | 5 |
Atla Palani, Nest Tender (c) | Morganator 2.0 | 9 | 7.78 | 1.856 | 9 |
K'rrik, Son of Yawgmoth (c) | Cell | 8 | 7.75 | 1.500 | 4 |
Kruphix, God of Horizons | Krillin | 8 | 7.63 | 0.479 | 4 |
Selvala, Heart of the Wilds (c) | Goku | 8 | 7.58 | 1.114 | 6 |
Prime Speaker Vannifar (c) | Gohan | 8 | 7.57 | 1.134 | 7 |
Jodah, Archmage Eternal | Bulma | 8 | 7.46 | 1.030 | 13 |
Najeela, the Blade-Blossom (c) | Gohan | 7 | 7.45 | 1.066 | 10 |
The Scarab God | Morganator 2.0 | 7 | 7.43 | 1.072 | 14 |
Alesha, Who Smiles at Death | Piccolo | 6.5 | 7.33 | 0.577 | 3 |
Jhoira, Weatherlight Captain | Tien | 7 | 7.25 | 0.957 | 4 |
Meren of Clan Nel Toth | Goku | 7 | 7.10 | 0.894 | 5 |
Gaddock Teeg | Goku | 7 | 7.08 | 1.114 | 6 |
Ayula, Queen Among Bears | Bulma | 7 | 7.07 | 1.158 | 14 |
Karlov of the Ghost Council | Frieza | 7 | 7.00 | 0.816 | 4 |
The First Sliver | Frieza | 8 | 7.00 | 1.785 | 9 |
Marrow-Gnawer | Yamcha | 6 | 7.00 | 2.160 | 4 |
Elsha of the Infinite (c) | Gohan | 8 | 6.94 | 1.321 | 8 |
Tasigur, the Golden Fang | Piccolo | 8 | 6.83 | 0.764 | 3 |
Marchesa, the Black Rose | Goku | 7 | 6.79 | 1.075 | 7 |
Elenda, the Dusk Rose | Piccolo | 7 | 6.75 | 0.500 | 4 |
Garna, the Bloodflame | Trunks | 8 | 6.75 | 1.500 | 4 |
Yarok, the Desecrated | Gohan | 8 | 6.73 | 0.876 | 11 |
Tuvasa the Sunlit | Gohan | 7 | 6.63 | 1.188 | 8 |
Gonti, Lord of Luxury | Piccolo | 6 | 6.60 | 0.548 | 5 |
Rhys the Redeemed | Android 17 | 7.5 | 6.60 | 1.673 | 5 |
Grenzo, Dungeon Warden | Goku | 5 | 6.58 | 1.497 | 6 |
Kadena, Slinking Sorcerer | Gohan | 6 | 6.57 | 0.787 | 7 |
Gargos, Vicious Watcher | Gohan | 7 | 6.50 | 1.000 | 4 |
Marchesa, the Black Rose | Yamcha | 7.5 | 6.50 | 1.291 | 4 |
Hapatra, Vizier of Poisons | Android 18 | 7 | 6.50 | 1.323 | 7 |
Marwyn, the Nurturer | Trunks | 7 | 6.50 | 1.378 | 6 |
Krav+Regna | Chi-Chi | 5 | 6.44 | 1.116 | 8 |
Nekusar, the Mindrazer | Android 17 | 6 | 6.40 | 0.894 | 5 |
Golos, Tireless Pilgrim | Chi-Chi | 4 | 6.36 | 1.180 | 7 |
Breya, Etherium Shaper | Piccolo | 9.5 | 6.33 | 0.577 | 3 |
The Scorpion God | Piccolo | 5.5 | 6.33 | 0.577 | 3 |
Greven, Predator Captain | Gohan | 6 | 6.29 | 1.380 | 7 |
Arcades, the Strategist | Chi-Chi | 5 | 6.25 | 1.541 | 6 |
Edgar Markov | Vegeta | 4 | 6.14 | 0.690 | 7 |
Anje Falkenrath (c) | Cell | 9 | 6.00 | 2.449 | 4 |
Gahiji, Honored One | Yamcha | 7.5 | 5.75 | 0.500 | 4 |
Momir Vig, Simic Visionary | Trunks | 4 | 5.33 | 1.528 | 3 |
Meren of Clan Nel Toth | Tien | 5 | 5.00 | 2.000 | 3 |
Gisela, Blade of Goldnight | Chiaotzu | 6.67 | 4.92 | 1.379 | 12 |
Golos, Tireless Pilgrim | Piccolo | 2 | 4.40 | 2.074 | 5 |
Yargle, Glutton of Urborg | Bulma | 2 | 4.09 | 2.023 | 11 |
Pre-analysisI did some asking around about why some people voted the way they did. A lot of these outcomes conflicted with how I perceived the strength of decks. In particular,
Krenko, Mob Boss versus
Edric, Spymaster of Trest. As the creator of both these decks, I am certain that Edric is more powerful, no contest. Edric can take on powerful cEDH decks, while Krenko is limited to high-power decks. So this prompted me to ask about how people rate decks. Two in particular caught my attention. Chi-Chi would rate decks entirely based on how fast they could win. She claims that she used to be in a cEDH league, so she's seen much faster decks before. The next one was Android 17. He rated decks based on how much of a threat they were to him. For example, he considered Yamcha's
Marrow-Gnawer deck to be a 10, because his
Rhys the Redeemed deck can't deal with an army of
Rat Colony with
fear.
The difference of opinions is something I've noticed. The two
Brago, King Eternal decks both made it to the top 10, but only one of them is a cEDH deck... and it was rated lower than the non-cEDH deck. Now if you know how standard deviations work, you'll know that there isn't a statistically significant difference between the two Brago decks, and there also isn't a significant difference between my Edric and Krenko decks. But what is important is that some people did rate Krenko higher than Edric, and some people did rate the cEDH Brago lower than the casual Brago.
I'm not the only one that noticed this. When I told Gohan that his
Urza, Lord High Artificer was the strongest deck, he quite comically said "What? No it isn't."
Error ratingsI made sure that people would rate their own decks, for two reasons. One, it gave them a baseline to compare deck powers to. Two, I wanted to see how well they could guess the power of their decks compared to everyone else. Here's the data for that. Names with (c) next to them denote players who have experience with cEDH deck-building.
Player | Number of Decks | Average deck power | Self-rate avg | Highest power | Lowest Power | Deck Range | Self-guess error | Other deck guess error | # of ratings |
Android 17 | 2 | 6.50 | 6.75 | 6.60 | 6.40 | 0.20 | 10% | 16% | 10 |
Android 18 (c) | 4 | 7.32 | 7.50 | 8.14 | 6.50 | 1.64 | 5% | 11% | 28 |
Bulma | 3 | 6.21 | 5.67 | 7.46 | 4.09 | 3.37 | 20% | 22% | 10 |
Cell (c) | 4 | 6.88 | 8.50 | 7.75 | 6.00 | 1.75 | 27% | 16% | 4 |
Chiaotzu | 1 | 4.92 | 6.67 | 4.92 | 4.92 | 0.00 | 36% | 19% | 32 |
Chi-Chi | 3 | 6.35 | 4.67 | 6.44 | 6.25 | 0.19 | 26% | 24% | 24 |
Frieza (c) | 4 | 7.28 | 8.00 | 7.83 | 7.00 | 0.83 | 10% | 15% | 11 |
Gohan (c) | 10 | 7.16 | 7.70 | 8.89 | 6.29 | 2.60 | 11% | 18% | 47 |
Goku (c) | 8 | 7.03 | 6.80 | 7.58 | 6.58 | 1.00 | 7% | 17% | 29 |
Krillin (c) | 3 | 8.07 | 8.00 | 8.79 | 7.63 | 1.16 | 6% | 10% | 19 |
Piccolo | 10 | 6.77 | 6.94 | 8.33 | 4.40 | 3.93 | 20% | 8% | 10 |
Tien | 3 | 6.13 | 6.00 | 7.25 | 5.00 | 2.25 | 2% | 13% | 26 |
Morganator 2.0 (c) | 4 | 8.14 | 8.50 | 8.75 | 7.43 | 1.32 | 12% | 16% | 48 |
Trunks | 4 | 6.19 | 6.33 | 6.75 | 5.33 | 1.42 | 17% | 8% | 10 |
Vegeta | 1 | 6.14 | 4.00 | 6.14 | 6.14 | 0.00 | 35% | 22% | 2 |
Yamcha | 3 | 6.42 | 7.00 | 7.00 | 5.75 | 1.25 | 20% | 16% | 12 |
Damn. I'm getting close to Deckstat's character limit. It's a little much to ask you guys to draw conclusions from this, so instead ask me questions. Ask for details about the group, and also some tests that I could do on this data. I know for sure that I'm going to test the predictive power of cEDH players versus casual players.
What I do know right now; the 1 to 10 system for rating commander decks is very inaccurate. It is a metric based entirely on personal experience, and who am I to say that I know better than everyone else? I really want to find a better way for people to rate their commander decks, similar to what Judaspriester and Dexflux were doing a while back.
https://deckstats.net/forum/index.php/topic,49777.0.html