Epistemic Status: True story

This is intended as post two of the sequence Zbybpu’f Nezl.

Previously (required): The Darwin Game

I

This is my reconstruction of my thoughts at the time.

The Darwin Game requires surviving the early, middle and late games.

In the opening, you need to maximize scoring against whatever randomness people submit. Survival probably isn’t enough. The more copies of yourself you bring to the middle game, the more you face yourself, which snowballs. Get as many points as you can.

In the middle game, you face whatever succeeded in the opening. Strategies that survived the opening in bad shape can make a comeback here, if they are better against this new pool. What strategies do well against you matters.

In the end game, you’ll need to beat the successful middle game strategies, all of which have substantial percentages of the pool. Eventually you’ll be heads up against one opponent. Not letting opponents outscore you in a pairing becomes vital.

How would the game play out? What types of strategies would thrive?

I  divided the types as follows:

There were attackers, who would attempt to get the opponent to accept a 3/2 or 4/1 split. They might or might not give up on that if you refused, and presumably most would use a signal to self-cooperate, but not all. One person did submit “return 3.”

Then there were cooperators, who attempt to split the pot evenly. I assumed that meant alternating 3/2 splits. This then divided into those who would fold if attacked, allowing you to score above 2.5 per turn, those that would let themselves be outscored but would make sure you scored less than 2.5 per turn, and those that would not allow themselves to be outscored. The last group might or might not forgive an early attempt to attack them.

There would also be bad programs. People do dumb things. Someone might play all 2s, or pick numbers fully at random, or who knows what else.

As a list (attackers from here on means both AttackBot and BullyBot):

AttackBot. Attackers who don’t give up.

BullyBot. Attackers who give up.

CarefulBot. Cooperators who harshly punish attackers.

DefenseBot. Cooperators who don’t let you outscore them but don’t otherwise punish.

EquityBot. Cooperators who let you outscore them, but make sure you don’t benefit.

FoldBot. Cooperators who accept full unfavorable 3/2 splits.

GoofBot. Weird stuff.

My prior was we’d see all seven, with most looking to cooperate.

Was attacking a good strategy?

Attacking only works against FoldBots. When attacking fails, even DefenseBots might take a while to re-establish cooperation. CarefulBots could wipe you out. It was also impossible to know how long to keep attacking before concluding opponents weren’t going to fold.

With a pool of bots chosen by humans, attacking strategies (AttackBot or BullyBot) likely would fail hard in the opening.

The endgame was a different story. All GoofBots would be dead. Unless FoldBots fold too quickly to a BullyBot, in a given round they strictly outscore CarefulBots, DefenseBots and EquityBots. Each round, provided they exist, FoldBots would become a bigger portion of the cooperative pool. If you were an AttackBot or BullyBot, and survived long enough, you would kill off the CarefulBots, then the DefenseBots and finally the EquityBots as the FoldBots out-competed them, leaving a world of AttackBots, BullyBots and FoldBots. If all but one attacker was gone, the last attacker to survive would win if it cooperated efficiently against itself, since it would score above average each round. In theory a steady state could exist with multiple attackers keeping each other in check, but that isn’t stable since advantages in size snowball.

CarefulBots are strictly worse than DefenseBots, so those were out. GoofBots are terrible.

This meant there were five choices:

I could submit an AttackBot that cooperates with itself, and hope to survive into the endgame. I quickly dismissed this as unlikely to work.

I could submit a BullyBot that cooperates with itself, attacks but accepts an even split against stubborn opponents. But this rewards stubborn opponents while wiping out non-stubborn opponents in the mid-game, which means your endgame trump card stops working. I dismissed this as well.

DefenseBots don’t lose heads-up by non-tiny amounts, and punish anyone who tries to outscore them, wiping them out in the mid-game. But you score nothing against AttackBots in the opening, before you can shape the pool much. At best you take a smaller pool into the mid-game, where efficient cooperation with your own copies starts to snowball.

I saw the emotional appeal of DefenseBots, but using one didn’t made sense. Its defenses were too robust and expensive, and you still lose to a smart AttackBot heads-up if you’re outnumbered. I’d need to take more risk.

That was the problem with being a FoldBot. FoldBots feed attackers. You are free riding on the rest of the cooperative pool. You hope they kill attackers despite that. The problem is that if even one copy of an attacker survives, as you and other FoldBots grow strong, attacking becomes a better and better strategy. I decided this wasn’t worth that risk.

I would submit an EquityBot. I wouldn’t protect against them outscoring me. I would protect against them outscoring what cooperation would have gotten them. If at any point they wanted to split the remaining pie, I would accept. Even if they refused, I’d give them some points on a 3/2 split, so long as they were punished for it, and I wasn’t growing their portion of the pool.

This raised the threshold percentage of the pool I needed to win heads-up against an attacker, but with a size disadvantage I’d lose no matter what, and I’d still win if I had a sufficiently large size edge, which was more likely if I did better early on.

Too much folding and you strengthen someone who beats you. Too little and you fall behind letting others snowball.

I decided to alternate 3/2 even if my opponent was going 3/3. This said both ‘I’m not going to give up’ and ‘you are welcome to cooperate at any time,’ and still punished the opponent reasonably hard. After long enough I even risked throwing in a few more 2s.

I considered sending a signal to recognize myself, but realized there was no point. Better to start coordinating right away. I’d randomize my first turn to 2 or 3, and once my opponent didn’t match me I would alternate. I figured opponents would start 2 more often than 3, so I decided to do a 50/50 split to take advantage of that, coordinating faster and with a slight edge, at the expense of doing slightly worse against myself, but this was probably just a mistake and I should have done an uneven split (but not quite the fully maximizing-for-self-play ratio). However, in an endgame against a similar program, you can definitely get an edge by being slightly more willing to play 3s early than your opponent.

Opponents that wanted to cooperate would have a very easy time recognizing my offer and cooperating. That left special case logic.

If my opponent was alternating on the same schedule as me (somehow we started 2/3, but then we’d 2/2 then 3/3 then 2/2), then I’d play 2 twice in a row to break that up. Ideally, if the opponent was offering a different cycle that was fair, I’d match that (so if they went 1/4/1/4, I’d submit 4 next time, and if they did 1 I’d start alternating), but I didn’t expect such cases so I didn’t make that logic robust, as the professor had already thrown out part of a previous submission for being too complex, and I wanted to preserve the more important parts.

If my opponent was playing all 2s even after I started alternating, I put in logic to play all 3s. If they played even one 3, I’d back down permanently. I also put in logic against a few other bizarre simple bots (like all 1s, all 4s, seems to be completely random, etc) but didn’t worry about it too much since they’d be wiped out very quickly and complexity is bad.

If my opponent was playing all 3s without a starting signal, and kept it up long enough, that meant he’d defect against himself, which meant he couldn’t win an endgame, and also meant that he was highly unlikely to ever give up, so I’d eventually fold. If they were going to lose in the long run, better to get what I could. Letting them survive longer would only help me.

II

David took a different approach.

David knew about the class mailing list.

David assembled a large group. They agreed to submit 2-0-2 as their first three moves. If both sides sent the signal, they’d cooperate using a reasonable randomization system. If they didn’t get the signal back, they’d play all 3s. They’d be pure CliqueBots, cooperating with each other and defecting against everyone else. With a large enough group, they’d wipe out the other players and share the victory. David would win The Golden Shark and his guaranteed A+.

I would find out about the coalition after round one.

III

We were all set for game night. We had each chosen the logical output of our decision functions. The professor set up a website where we could see the game played out in real time over the course of several hours (due to a combination of that’s more fun and the game was slow to run), with a discussion board for him to offer observations and us to comment.

Next time I’ll reveal what happened on game night. Predictions are encouraged.

New to LessWrong?

New Comment
6 comments, sorted by Click to highlight new comments since: Today at 12:18 AM

Just wanted to express appreciation for the post being submitted in installments that encourage you to think.

The obvious choice in this environment is a Clique-y defensebot; send the clique signal and cooperate with them, but instead of being an attackbot otherwise be a defensebot.

Since you wouldn't use this logic against other cliquebots it would be hard for them to punish you without giving up the ability to cooperate with themselves. You'll outperform other cliquebots if the other cliquebots dominate by dominating sooner and riding off their punishments. In the mid game either cliquebots dominate with you slightly ahead and you win, or cliquebots die off but you've survived the early game and cooperate with them so you get the benefits of attackers in the pool without the costs.

If enough cliquebots defect like this, then I'm not sure what would happen, and it'll depend a lot on what the initial distribution is I guess. If there are very few cliquebots this one is also vulnerable to other attackbots, but I think enough other possibilities (cliquey equitybot or cliquey foldbot) make me strongly suspect that someone will win by defecting from the clique.

The outcome does seem to depend on proportions.

For simplicity, I assumed that there were 3 types of bots in the pool: cliquebots, fairbots with capitulation (who quickly reach a steady state of "all 2" against cliquebot), and fairbots with payback (who quickly reach a steady state of "all 3" against cliquebot). The fairbots rapidly reach mutual cooperation with themselves and with each other.

Depending on initial proportions, either the cliquebots eliminate everyone else, or the cliquebots get eliminated (while the fairbots who capitulate gain some share relative to the fairbots with payback). For example, with the numbers I used the cliquebots win if the initial distribution is 28% cliquebot, 31% capitulate, 41% payback (or 18% cliquebot, 46% capitulate, 36% payback) but lose if it's 28% cliquebot, 30% capitulate, 42% payback (or 18% cliquebot, 45% capitulate, 37% payback).

Actual outcome will depend on implementation details, plus the field of other bots, plus the details of the bots which do something in between capitulate and payback (including your EquityBot).

What is a DealBot?

Anyway, I don't quite get the benefit of an equity bot. "I wouldn’t protect against them outscoring me. I would protect against them outscoring what cooperation would have gotten them." - Is the idea to make bully bots back down, whilst not losing too many points against an attack bot, in the hope that the attack bots would be wiped out by the end by the other bots?

[-]PDV6y30

I thin the idea is simply not to feed the attackers enough for them to grow relative to the fair players, while also minimizing your losses so you don't fall behind.

[-]Zvi6y20

PDV is correct. The idea is to avoid rewarding the attacker and letting its share of the pool grow. In particular, in a heads-up fight, AttackBot vs. EquityBot depends on relative size, whereas AttackBot vs. FoldBot goes to AttackBot no matter the ratio.

DealBot is an old name for DefenseBot, I'll edit.