Balancing Games |
February 24th, 2024 |
games |
- Try to win
- Win about 1/N of the time
With many games and groups of participants these are in conflict: if I play bridge against my kids I'm going to win all the time, but I'm not very good at the game so if I play against people who are serious about it I'm going to lose ~all the time.
One way some games handle this is by including a lot of luck. The more random the outcomes are, the more you'll approach 1/N regardless of player skill. Kid games where you make no choices, like Candyland or War, take this to the extreme.
Instead, I think handicapping is a much better approach. For example in Go the weaker player can start with several stones already on the board, which gives them an advantage while still keeping it interesting and without turning it into a different-feeling game. When I was little and playing Go with my dad I remember slowly reducing the number of handicaps I needed over months, which was really rewarding: each game was fun and challenging, and I could see my progress.
Other examples:
In Dominion, changing the ratio of coppers to estates that each player starts with.
In Settlers of Catan, allowing weaker players to place both of their settlements before stronger ones.
In Power Grid, Monopoly, Modern Art, or anything else financial, letting weaker players start with more money.
In Ticket to Ride, Thurn und Taxis, Settlers of Catan, or anything else with resource cards, letting weaker players start with more cards.
I like it when games are designed in a way that makes this kind of adjustment easy and granular. You can calibrate by removing a handicap after the weaker player wins some number of games in a row (I think three is about right though it depends on granularity) and vice versa.
I'm curious, though: why isn't this more common? It's very normal in Go, mostly of historical interest in chess, and in most game cultures I'm around it seems like the expectation is just that weaker players will just lose a lot or or stronger players will "go easy" on them? Is it that acknowleging that some players are stronger than others is awkward? Too hard to calculate for games with more than two players?
Interestingly handicapping is pretty common in Shogi, suggesting that it's partly a cultural thing rather than about the features of the game.
So imo ticket to ride is just a poorly balanced game.
I'm thinking people that like games are people that like making new games. If you're really into games you want to win. Those that are looking for an equal playing field regardless just aren't excited enough to make a good game if at all. I've settled in on playing solitaire on my phone. I can stop a game whenever. Sometimes I lose and know full well where the mistake was but don't bother going back. Sometimes I'm sure that a hand is impossible (is it really?) Sometimes I can tell that I'm tired and my focus is poor so I lose. At this point, I've got a 21% win rate and that's just fine. The alternative is to play games with others and I lose almost all the time while they're dancing around the table. Mainly because I just don't care about winning enough to remember the play, each round is "new to me". There must be others like this. Are there a lot? Or is this a really small group?
Or you just play co-op boardgames
I think a reason we don't see more if this is because of the context in which we most often play games. In your context of playing games with your kids, there's a very obvious disparity in skill, experience, and age split. They're not likely to be offended by you suggesting they start with extra money. When peers play together in groups (all kids or all adults), there's a less objective clear difference to point to for making game modifications. Then it becomes an evaluation of skill or performance by your peers, which for many feels like judging their identity, and people get hurt feelings. I think exceptions to this are 1) when someone is new to the game, so they have a "reason" for starting off at a lower skill level, not just not being as good. And 2) if the group plays together often and everyone knows each other really well, you sometimes see the kind of acknowledgement of "well you're good at pattern recognition visuals, but I'm going to mop the floor with you later at Scrabble" and the appreciation of diversity of intelligences balances out sensitivity to failing at one category.
Adding to this, Chess and Go have rating systems, so the evaluation is already done for serious players.
Small comment that the term handicapping is kind of outdated and has weird connotations, maybe just saying giving yourself a disadvantage or evening the playing field? Also I am a fan of this way of playing, with kids it makes a lot of sense. With other adults I sometimes do it covertly, most commonly with word games, like if I’m playing Boggle, we will play 3 letter words, but I’ll only write down 4+ or with bananagrams I’ll take a few extra breaths before saying Go once I’ve used all my letters. This feels like it makes it a more enjoyable all around experience. I would also be fine if someone I was playing with more explicitly doing this, I think I do it covertly sometimes to avoid social dynamics of people feeling bad.
I havent found a good way to handicap Dominion. Just too much of an advantage knowing how to build a good strategy.
Sweet I've usually played it with allowing my opponents 1-2 turns first
Linchuan Yeah this seems better than changing the estate/copper distribution, b/c changing the distribution will probably actually cost you more than 1-2 turns.
I'm not typically in the position of being a heavy underdog in a board game, but when I am, I'm unlikely to be interested in a handicap.
Hypotheses re why handicapping is not more popular: (a) The weaker player(s) oppose handicapping more often than the stronger player(s). (b) People are competitive or have pride in their abilities, etc, and so can dislike the idea of acknowledging so explicitly that the other person is so much better than them that the game needs to be modified to accomodate a skill gap. (c) Spencer Greenberg posted something relevant recently that I'm going to look up and add here.
Re (c): I think weaker players often dislike handicapping for the same reason they often feel bad about losing often: https://www.facebook.com/105736/posts/10106980334455592/
People can tie their identity up in their performance with games. If a person can point to a huge experience gap between players as the reason for a handicap I think they're more likely to accept the handicap. However, if both players have played the g…
In my boardgaming group the weaker player (due to less epxerience) would just be expected to lose a lot and the weaker player (due to a mismatch in cognitive capacity) wouldn't really want to be at our table nor would we want them to be there. In a different setting, as you describe, where social circle or family assembles a wildly mismatched group of players I envision people largely accepting handicap due to less experience, but much less so due to cognitive differences. Most people don't want to get those shoved down their throats, like, at all. It isn't such a taboo for no reason.
I don't play golf, but the idea of "handicap" is a part of at least pop-culture renditions of the game. In my boardgame and videogame groups, explicit in-game support for varying player skill levels is pretty well absent and not really considered. Competitive sports tend to have "divisions" rather than individual compensations.
It's pretty obvious why it's not common in money games like poker or betting pools.
In my experience for casual games, the expectation of equality of outcome just isn't there, and isn't all that important. People still have fun in the optimization and game decisions, even when some of us are really known to be more likely to win. Having a formal handicap system would BOTH be too much effort, AND make some players have less fun because it feels like they're not "really" playing the same game. Maybe. Maybe it's just the effort thing.
Also, for many multiplayer games, there are social dynamics that make current leaders have a bit of a disadvantage in trading or interactions.
This may be besides the point of your post, but: you can do even better than that, and without a need for handicapping, by playing co-op board games instead. Versus-style board games are just one type of game, and while you can modify their rules to come closer to equality of outcomes, that seems like a rather convoluted way of getting there. Like, in this situation, why play a zero-sum game when you could play a positive-sum game instead?[1]
Or if entirely co-op games don't seem appealing, another option along this axis is to play team-based games; then you can balance team strengths by which and how many people you assign to each team.
Some co-op board game recommendations suitable even for groups of widely disparate skill levels: Letter Jam, Just One.
A co-op game for groups that want a challenge: Hanabi.
Some team-based board game recommendations: Codenames, Decrypto. I wrote about these two games here.
Speaking from my own experience, when I grew up I only knew versus board games, stuff like Monopoly or Settlers of Catan. But once I discovered co-op board games, I eventually realized that I had a lot more fun playing those with my siblings.
One of the reasons I tend to like playing zero-sum games rather than co-op games is that most other people seem to prefer:
While I instead tend to prefer:
Many cooperative board games run into a problem where if there are people of differing skill levels on the same team than the strongest player ends up doing most of the playing. Hanabi is the only multiplayer game I've tried that successfully avoids this, where every player needs to be engaged and trying their best.
I know what you mean, and it used to absolutely be an issue in our group, especially with games like Eldritch Horror or Pandemic Legacy, i.e. multi-hour games where you have full information about everything every player is doing. That said, an obvious design which circumvents this problem is co-op games where every player has some private information: then other players can't play for you and vice versa.
Incidentally, all the non-team co-op games I suggested above have this design.
Just One is a co-op party game where the active player must guess a word and each other player independently provides a word hint. Then the hint givers compare hints and eliminate all hints that were given multiple times (hence the title, "Just One").
Resulting game flow: If everyone tries to give an "obvious" hint (e.g. giving the hint "metal" for the word "steel"), then multiple people will likely give the same hint, and as such this hint will be unavailable to the active player. Whereas if nobody gives obvious hints, there's a higher chance that there are no duplicate hints to eliminate, so the active player can work with a lot of hints but might get misled by all hints being non-obvious. This makes it an interesting challenge for what kinds of hints to give and how to interpret the hints one receives.
Meanwhile Letter Jam is a bit like Hanabi: Every player has one letter card facing away from themselves, so everyone but themselves knows what it is. The goal is for everyone to guess their 4-7 letter cards in as few rounds as possible. Every round one player (chosen by the group) gives a word hint to the other players based on the letters they see.
E.g. suppose there are four players. Then I would see the letter cards of the three other players, plus 1-2 letters visible to everyone, plus finally a joker which can substitute for any one letter. And suppose I see the player letters P L A, and an open letter T. Then I could make the word hint PLANT (by using the joker for the N). This hint is given silently by placing numbered poker chips next to the letters I want to use, e.g. the 1-chip in front of the player with the letter P. Here's how these hints look like to the other players: player 1 sees ?LA*T, player 2 P?A*T, player 3 PL?*T. Based on such hints, players try to narrow down what their own letter is.
The hint I gave involved the joker and thus doesn't provide much info on the hidden letters, whereas one great hint can directly help multiple players guess their current letter and proceed to the next one. But even if one player is much better at giving hints, they still rely on others to also provide hints, since you cannot identify your own letters when you give a hint. And even if you could give 5 perfect hints and would then need 5 perfect hints yourself, that's still much less efficient (i.e. it requires more rounds) than if each player can contribute a perfect hint.
Morphie's law does this.
https://store.steampowered.com/app/948960/Morphies_Law_Remorphed/
Doesn't seem to be a particularly successful implementation, but it's an FPS game where players with more kills grow bigger (and are easier to hit), while players with more deaths grow smaller (and are harder to hit and can hide in places the larger players cannot access)
This algorithm is supposed to make the KDR 1/1 over infinite time.
I think one reason I don't like that sort of thing is there's more ambiguity in "what it took to win the game"
It's hard to know whether an artificial advantage is proportional to the skill gap. If I win, I won't know the extent to which I should attribute that win to good play (that I ought to be proud of, and that will impress others), VS attributing the win to a potentially greater than 1/N chance of winning(that I came by artificially).
If the greater skill is the absolute advantage that leads me to a win , I will discount the achievement on account of having an absolute advantage, but I'll still feel satisfied that I have achieved a relatively higher skill level.
If an improperly calibrated handicap is the absolute advantage that leads me to a win, it's a win I'd discount on account of there being an absolute advantage, but in this case I'd garner no satisfaction from having an (artificial) absolute advantage.
Morestill the win might feel insulting or condescending if I was given a disproportionately large advantage due to my friends/competitors underestimation of my expected quality of play.
My win will also not necessarily give my competitors an update as to whether they underestimated my expected quality of play.
If the expectation is that I will win 1/N times, they won't update on my skill level if I win. (Maybe very slightly, and eventually as you play more games)
If I win when the odds are against me, people update significantly on my expected quality of play.
It feels good to know people are updating favourably on my expected quality of play.
In chess, I think there are a few reasons why handicaps are not more broadly used:
That said, chess does use handicaps in some settings, but they are not material handicaps. In informal blitz play, time handicaps are sometimes used, often in a format where players start at five minutes for the game and lose a minute if they win, until one of the players arrives at zero minutes. Simultaneous exhibitions and blindfold play are also handicaps that are practiced relatively widely. Judging just by the number of games played in each handicap mode, I'd say though that time handicap is by far the most popular variant at the club player level.
Is the gap only 2 stones between best professionals and best computers? A reddit thread from 2 years ago said Shin Jinseo has a losing record getting 2 stones from FineArt, and computers have probably improved since then.
For chess in particular the piece-trading nature of the game also makes piece handicaps pretty huge in impact. Compare to shogi: in shogi having multiple non-pawn pieces handicapped can still be a moderate handicap, whereas multiple non-pawns in chess is basically a predestined loss unless there is a truly gargantuan skill difference.
I haven’t played many handicapped chess games, but my rough feel for it is that each successive “step” of handicap in chess is something like 3 times as impactful as the comparable shogi handicap. This makes chess handicaps harder to use as there’s much more risk of over- or under-shooting the appropriate handicap level and ending up with one side being highly likely to win.
In multiplayer games, one balancing factor is that other players can gang up on the person who is ahead. Depending on the game dynamic, this can even things out a lot. In some games, this even creates the dynamic where you don't want to look too strong, so that others don't focus their attention on you.
Playing games against my kids when they were young, rather than just slack off and let them win, it was more fun for me to figure out the best way to handicap myself: What algorithm for sub-optimal play would keep the game close? Solving that puzzle effectively became my victory condition, rather than the game's victory condition, and I was effectively competing against myself, a more balanced opponent.
In Drawback Chess, each player gets a hidden random drawback, and the drawbacks themselves have ELOs (just like the players). As players' ratings converge, they'll end up winning about half the time, since they'll get a less stringent drawback than their opponent's.
The game is pretty different from ordinary chess, and has a heavy dose of hidden information, but it's a modern example of fluid handicaps in the context of chess.
Aren't time handicaps still common in chess?
You're correct, time handicaps (e.g. 2m vs. 5m) are more common than pawn/piece handicaps. Mostly for in-person play.
Master vs. Amateur handicaps can look crazy: 2m vs. 15m and -QRR is a slight advantage for the master simply because most amateurs are not used to playing with the clock. Another M v. A handicap is 'capped pawn': amateur picks a pawn, checkmate must be delivered with that pawn (pre-promotion). It's a bit like having two Kings, as if that pawn is captured the game is lost.
In a game where you play a higher number of shorter games, you can ideally have a handicap that adjusts after every game. For example, in Super Smash Bros, if you turn handicap to "auto" then the stronger player starts with damage, which (in two player) goes up 10% every time they win, and down 10% every time they lose. It gets a little more complicated in 3+ player games, and I'm not sure the exact algorithm, but it works reasonably well. Maybe something to emulate in a game where handicaps can be reasonably granular?
Small edges are why there's so much money gambled in poker.
It's hard to reach a skill level where you make money 50% of the night, but it's not that hard to reach a point where you're "only" losing 60% of the time. (That's still significantly worse than playing roulette, but compared to chess competitions where hobbyists never win any sort of prize, you've at least got chances.)
Bridge is a slightly odd choice of example in your opening section. A single hand of Bridge has very high randomness; it's quite likely the weaker partnership will "win", assuming they have at least basic competence in the game. The advantage of a stronger pair only really becomes apparent over a large number of hands.
The same is true is Poker, even more so. In fact stronger players may not "win" very many more hands than weaker players at all; it's just that when they win they win more and when they lose they lose less.
This isn't true at all in Chess, of course.
Despite the randomness, bridge is an excellent example, as "people who are serious about it" play duplicate bridge. Duplicate poker exists, but doesn't seem as popular.