Why I'm Skeptical About Unproven Causes (And You Should Be Too)

Since living in Oxford, one of the centers of the "effective altruism" movement, I've been spending a lot of time discussing the classic “effective altruism” topic -- where it would be best to focus our time and money.

Some people here seem to think that the most important thing we should be focusing our time and money on are speculative projects, or projects that promise a very high impact, but involve a lot of uncertainty.  One such very common example is "existential risk reduction", or attempts to make a long-term far future for humans more likely, say by reducing the chance of things that would cause human extinction.

I do agree that the far future is the most important thing to consider, by far (see papers by Nick Bostrom and Nick Beckstead).  And I do think we can influence the far future.  I just don't think we can do it in a reliable way.  All we have are guesses about what the far future will be like and guesses about how we can affect it. All of these ideas are unproven, speculative projects, and I don't think they deserve the main focus of our funding.

While I waffled in cause indecision for a while, I'm now going to resume donating to GiveWell's top charities, except when I have an opportunity to use a donation to learn more about impact.  Why?  My case is that speculative causes, or any cause with high uncertainty (reducing nonhuman animal suffering, reducing existential risk, etc.) requires that we rely on our commonsense to evaluate them with naīve cost-effectiveness calculations, and this is (1) demonstrably unreliable with a bad track record, (2) plays right into common biases, and (3) doesn’t make sense based on how we ideally make decisions.  While it’s unclear what long-term impact a donation to a GiveWell top charity will have, the near-term benefit is quite clear and worth investing in.

 

Focusing on Speculative Causes Requires Unreliable Commonsense

How can we reduce the chance of human extinction? It just makes sense that if we fund cultural exchange programs between the US and China, there will be more goodwill for the other within each country, and therefore the countries will be less likely to nuke each other. Since nuclear war would likely be very bad, it's of high value to fund cultural exchange programs, right?

Let's try another. The Machine Intelligence Research Institute (MIRI) thinks that someday artificial intelligent agents will become better than humans at making AIs. At this point, AI will build a smarter AI which will build an even smarter AI, and -- FOOM! -- we have a superintelligence. It's important that this superintelligence be programmed to be benevolent, or things will likely be very bad. And we can stop this bad event by funding MIRI to write more papers about AI, right?

Or how about this one? It seems like there will be challenges in the far future that will be very daunting, and if humanity handles them wrong, things will be very bad. But if people were better educated and had more resources, surely they'd be better at handling those problems, whatever they may be. Therefore we should focus on speeding up economic development, right?

These three examples are very common appeals to commonsense.  But commonsense hasn't worked very well in the domain of finding optimal causes.

 

Can You Pick the Winning Social Program?

Benjamin Todd makes this point well in "Social Interventions Gone Wrong", where he provides a quiz with eight social programs and asks readers to guess whether they succeeded or failed.

I'll wait for you to take the quiz first... doo doo doo... la la la...

Ok, welcome back. I don't know how well you did, but success on this quiz is very rare, and this poses problems for commonsense.  Sure, I'll grant you that Scared Straight sounds pretty suspicious. But the Even Start Family Literacy Program? It just makes sense that providing education to boost literacy skills and promote parent-child literacy activities should boost literacy rates, right? Unfortunately, it was wrong. Wrong in a very counter-intuitive way. There wasn't an effect.  

 

GiveWell and Commonsense's Track Record of Failure

Commonsense actually has a track record of failure. GiveWell has been talking about this for ages.  Every time GiveWell has found an intervention hyped by commonsense notions of high-impact and they've looked at it further, they've ended up disappointed.

The first was the Fred Hollows Foundation. A lot of people had been repeating the figure that the Fred Hollows Foundation could cure blindness for $50. But GiveWell found that number suspect.

The second was VillageReach. GiveWell originally put them as their top charity and estimated them as saving a life for under $1000. But further investigation kept leading them to revise their estimate until ultimately they weren't even sure if VillageReach had an impact at all.

Third, there is deworming. Originally, deworming was announced as saving a year of healthy life (DALY) for every $3.41 spent. But when GiveWell dove into the spreadsheets that resulted in that number, they found five errors. When the dust settled, the $3.41 figure was found to actually be off by a factor of 100. It was revised to $326.43.

Why shouldn't we expect this trend to not be the case in other areas where calculations are even looser and numbers are even less settled, like efforts devoted to speculative causes? Our only recourse is to fall back on interventions that are actually studied.

 

People Are Notoriously Bad At Predicting the (Far) Future

Cost-effectiveness estimates also frequently require making predictions about the future. Existential risk reduction, for example, requires making predictions about what will happen in the far future, and how your actions are likely to effect events hundreds of years down the road. Yet, experts are notoriously bad at making these kinds of predictions.

James Shanteau found in "Competence in Experts: The Role of Task Characteristics" (see also Kahneman and Klein's "Conditions for Intuitive Expertise: A Failure to Disagree") that experts perform well when thinking about static stimuli, thinking about things, and when there is feedback and objective analysis available. Furthermore, experts perform pretty badly when thinking about dynamic stimuli, thinking about behavior, and feedback and objective analysis are unavailable.

Predictions about existential risk reduction and the far future are firmly in the second category. So how can we trust our predictions about our impact on the far future? Our only recourse is to fall back on interventions that we can reliably predict, until we get better at prediction (or invest money in getting better at making predictions).

 

Even Broad Effects Require Specific Attempts

One potential resolution to this problem is to argue for “broad effects” rather than “specific attempts”.  Perhaps it’s difficult to know whether a particular intervention will go well or mistaken to focus entirely on Friendly AI, but surely if we improved incentives and norms in academic work to better advance human knowledge (meta-research), improved education, or advocated for effective altruism, the far future would be much better equipped to handle threats.

I agree that these broad effects would make the far future better and I agree that it’s possible to implement these broad effects and change the far future.  The problem, however, is it can’t be done in an easy or well understood way.  Any attempt to implement a broad effect would require a specific action that has an unknown expectation of success and unknown cost-effectiveness.  It’s definitely beneficial to advocate for effective altruism, but could this be done in a cost-effective way?  A way that’s more cost-effective at producing welfare than AMF?  How would you know?

In order to accomplish these broad effects, you’d need specific organizations and interventions to channel your time and money into.  And by picking these specific organizations and interventions, you’re losing the advantage of broad effects and tying yourself to particular things with poorly understood impact and no track record to evaluate. 

 

Focusing on Speculative Causes Plays Into Our Biases

We've now known for quite a long time that people are not all that rational. Instead, human thinking fails in very predictable and systematic ways.  Some of these ways make us less likely to take speculative causes seriously, such as ambiguity aversion, the absurdity heuristic, scope neglect, and overconfidence bias.

But there’s also a different side of the coin, with biases that might make people think badly about existential risk:

Optimism bias. People generally think things will turn out better than they actually will. This could lead people to think that their projects will have a higher impact than they actually will, which would lead to higher estimates of cost-effectiveness than is reasonable.

Control bias. People like to think they have more control over things than they actually do. This plausibly also includes control over the far future. Therefore, people are probably biased into thinking they have more control over the far future than they actually do, leading to higher estimates of ability to influence the future than is reasonable.

"Wow factor" bias. People seem attracted to more impressive claims. Saving a life for $2500 through a malaria bed net seems much more boring compared to the chance of saving the entire world by averting a global catastrophe. Within the Effective Altruist / LessWrong community, existential risk reduction is cool and high status, whereas averting global poverty is not. This might lead to more endorsement of existential risk reduction than is reasonable.

Conjunction fallacy.  People have a problem assessing probability properly when there are many steps involved, each of which has a chance of not happening. Ten steps, each with an independent 90% success rate, has only a 35% chance of success.  Focusing on the far future seems to involve that a lot of largely independent events happen the way that is predicted. This would mean people are worse at estimating their chances of helping the far future, creating higher cost-effectiveness estimates than is reasonable.

Selection bias.  When trying to find trends in history that are favorable for affecting the far future, some examples can be provided.  However, this is because we usually hear about the interventions that end up working, whereas all the failed attempts to influence the far future are never heard of again.  This creates a very skewed sample that can negatively bias our thinking about our success of influencing the far future.

 

It’s concerning there are numerous biases both weighted in favor and weighted against speculative causes, and this means we must tread carefully when assessing their merits.  However, I would strongly expect biases to be even worse in favor of speculative causes rather than against them, because speculative causes lack the available feedback and objective evidence needed to help insulate against bias, whereas a focus on global health does not.

 

Focusing on Speculative Causes Uses Bad Decision Theory

Furthermore, not only is the case for speculative causes undermined by a bad track record and possible cognitive biases, but the underlying decision theory seems suspect in a way that's difficult to place.         

 

Would you play a lottery with no stated odds?

Imagine another thought experiment -- you're asked to play a lottery. You have to pay $2 to play, but you have a chance at winning $100. Do you play?

Of course, you don't know, because you're not given odds. Rationally, it makes sense to play any lottery where you expect to come out ahead more often than not. If the lottery is a coin flip, it makes sense to pay $2 to have a 50/50 shot to win $100, since you'd expect to win $50 on average, and come ahead $48 each time. With a sufficiently high reward, even a one in a million chance is worth it. Pay $2 for a 1/1M chance of winning $1B, and you'd expect to come out ahead by $998 each time.

But $2 for the chance to win $100, without knowing what the chance is? Even if you had some sort of bounds, like you knew the odds had to be at least 1/150 and at most 1/10, though you could be off by a little bit. Would you accept that bet?

Such a bet seems intuitively uninviting to me, yet this is the bet that speculative causes offer me.

 

"Conservative Orders of Magnitude" Arguments

In response to these considerations, I've seen people endorsing speculative causes look at their calculations and remark that even if their estimate were off by 1000x, or three orders of magnitude, they still would be on solid ground for high impact, and there's no way they're actually off by three orders of magnitude. However, Nate Silver's The Signal and the Noise: Why So Many Predictions Fail — but Some Don't offers a cautionary tale:

Moody’s, for instance, went through a period of making ad hoc adjustments to its model in which it increased the default probability assigned to AAA-rated securities by 50 percent. That might seem like a very prudent attitude: surely a 50 percent buffer will suffice to account for any slack in one’s assumptions? It might have been fine had the potential for error in their forecasts been linear and arithmetic. But leverage, or investments financed by debt, can make the error in a forecast compound many times over, and introduces the potential of highly geometric and nonlinear mistakes.

Moody’s 50 percent adjustment was like applying sunscreen and claiming it protected you from a nuclear meltdown—wholly inadequate to the scale of the problem. It wasn’t just a possibility that their estimates of default risk could be 50 percent too low: they might just as easily have underestimated it by 500 percent or 5,000 percent. In practice, defaults were two hundred times more likely than the ratings agencies claimed, meaning that their model was off by a mere 20,000 percent.

Silver points out that when estimating how safe mortgage backed securities were, the difference between assuming defaults are perfectly uncorrelated and defaults are perfectly correlated is a difference of 160,000x in your risk estimate -- or five orders of magnitude.

If these kinds of five-orders-of-magnitude errors are possible in a realm that has actual feedback and is moderately understood, how do we know the estimates for cost-effectiveness are safe for speculative causes that are poorly understood and offer no feedback?  Again, our only recourse is to fall back on interventions that we can reliably predict, until we get better at prediction.

 

Value of Information, Exploring, and Exploiting

Of course, there still is one important aspect of this problem that has not been discussed -- value of information -- or the idea that sometimes it’s worth doing something just to learn more about how the world works.  This is important in effective altruism too, where we focus specifically on “giving to learn”, or using our resources to figure out more about the impact of various causes.

I think this is actually really important and is not a victim to any of my previous arguments, because we’re not talking about impact, but rather learning value.  Perhaps one could look to an "explore-exploit model", or the idea that we achieve the best outcome when we spend a lot of time exploring first (learning more about how to achieve better outcomes) before exploiting (focusing resources on achieving the best outcome we can).  Therefore, whenever we have an opportunity to “explore” further or learn more about what causes have high impact, we should take it.

 

Learning in Practice

Unfortunately, in practice, I think these opportunities are very rare.  Many organizations that I think are “promising” and worth funding further to see what their impact looks like do not have sufficiently good self-measurement in place to actually assess their impact or sufficient transparency to provide that information, therefore making it difficult to actually learn from them.  And on the other side of things, many very promising opportunities to learn more are already fully funded.  One must be careful to ensure that it’s actually one’s marginal dollar that is getting marginal information.

 

The Typical Donor

Additionally, I don’t think the typical donor is in a very good position to assess where there is high value of information or have the time and knowledge to act upon this information once it is acquired.  I think there’s a good argument for people in the “effective altruist” movement to perhaps make small investments in EA organizations and encourage transparency and good measurement in their operations to see if they’re successfully doing what they claim (or potentially create an EA startup themselves to see if it would work, though this carries large risks of further splitting the resources of the movement).

But even that would take a very savvy and involved effective altruist to pull off.  Assessing the value of information on more massive investments like large-scale research or innovation efforts would be significantly more difficult, beyond the talent and resources of nearly all effective altruists, and are probably left to full-time foundations or subject-matter experts.

 

GiveWell’s Top Charities Also Have High Value of Information

As Luke Muehlhauser mentions in "Start Under the Streetlight, Then Push Into the Shadows", lots of lessons can be learned only by focusing on the easiest causes first, even if we have strong theoretical reasons to expect that they won’t end up being the highest impact causes once we have more complete knowledge.

We can use global health cost-effectiveness considerations as practice for slowly and carefully moving into the more complex and less understood domains.  There even are some very natural transitions, such as beginning to look at "flow through effects" of reducing disease in the third-world and beginning to look at how more esoteric things affect the disease burden, like climate change.  Therefore, even additional funding for GiveWell’s top charities has high value of information.  And notably, GiveWell is beginning this "push" through GiveWell Labs.

 

Conclusion

The bottom line is that sometimes things look too good to be true.  Therefore, I should expect that the actual impact of speculative causes that make large promises, upon a thorough investigation, will be much lower.

And this has been true in other domains. People are notoriously bad at estimating the effects of causes in both the developed world and developing world, and those are the causes that are near to us, provide us with feedback, and are easy to predict. Yet, from the Even Start Family Literacy Program to deworming estimates, our commonsense has failed us.

Add to that the fact that we should expect ourselves to perform even worse at predicting the far future. Add to that optimism bias, control bias, "wow factor" bias, and the conjunction fallacy, which make it difficult for us to think realistically about speculative causes. And then add to that considerations in decision theory, and whether we would bet on a lottery with no stated odds.

When all is said and done, I'm very skeptical of speculative projects.  Therefore, I think we should be focused on exploring and exploiting.  We should do whatever we can to fund projects aimed at learning more, when those are available, but be careful to make sure they actually have learning value.  And when exploring isn’t available, we should exploit what opportunities we have and fund proven interventions.

But don’t confuse these two concepts and fund causes intended for learning because of their actual impact value.  I’m skeptical about these causes actually being high impact, though I’m open to the idea that they might be and look forward to funding them in the future when they become better proven.     

-

Followed up in: "What Would It Take To 'Prove' A Skeptical Cause" and "Where I've Changed My Mind on My Approach to Speculative Causes".

This was also cross-posted to my blog and to effective-altruism.com.

I'd like to thank Nick Beckstead, Joey Savoie, Xio Kikauka, Carl Shulman, Ryan Carey,  Tom Ash, Pablo Stafforini, Eliezer Yudkowsky, and Ben Hoskin for providing feedback on this essay, even if some of them might strongly disagree with it's conclusion.

Comments

sorted by
magical algorithm
Highlighting new comments since Today at 4:23 AM
Select new highlight date
Rendering 50/102 comments  show more

This post had an odd effect on me. I agreed with almost everything in it, as it matches my own logic and intuitions. Then I realized that I strongly disliked the logic in your anti-meat post, because it appeared so severely biased toward a predefined conclusion "eating meat is ethically bad". So, given the common authorship, I must face the possibility that the quality of the two posts is not significantly different, and it's my personal biases which make me think that it is. As a result, I am now slightly more inclined to consider the anti-meat arguments seriously and slightly less inclined to agree with the arguments from this post, even though the foggy future and the lack of feedback arguments make a lot of sense.

EDIT: Hmm, whatever shall I do with 1 Eliezer point and 1 Luke point...

+1 for correct, evenhanded use of the genetic heuristic (what we call the genetic fallacy when we agree with its usage).

what we call the genetic fallacy when we agree with its usage

I was about to ask whether you actually meant to say “disagree”, then I noticed that English has an ambiguity with predicative nominals over the object in relative clauses I hadn't noticed before. :-/

I'm having trouble parsing the version with "agree" to anything simultaneously non-tautologous (i.e. when we use a name, we generally agree with our own usage) and reasonable; what reading did you notice?

My first reading: ‘We call the genetic heuristic “the genetic fallacy” when we agree with its usage.’

The intended reading: ‘We call the genetic fallacy “the genetic heuristic” when we agree with its usage.’

+1 for sharing your cognitive process in detail.

Existential risk reduction is cool and high status, whereas averting global poverty is not.

What?? If this is true, please pass along the message to the Gates Foundation, the United Nations, the World Economic Forum, and... almost everyone else on the planet.

Yes, I was going to say... How can one possibly argue that certain speculative causes are too popular and this is because they play into common cognitive biases when the examples are the fringest of the fringe and funded approximately not at all?

Let's try another. The Machine Intelligence Research Institute (MIRI) thinks that someday artificial intelligent agents will become better than humans at making AIs. At this point, AI will build a smarter AI which will build an even smarter AI, and -- FOOM! -- we have a superintelligence. It's important that this superintelligence be programmed to be benevolent, or things will likely be very bad. And we can stop this bad event by funding MIRI to write more papers about AI, right?

Or how about this one? It seems like there will be challenges in the far future that will be very daunting, and if humanity handles them wrong, things will be very bad. But if people were better educated and had more resources, surely they'd be better at handling those problems, whatever they may be. Therefore we should focus on speeding up economic development, right?

These three examples are very common appeals to commonsense. But commonsense hasn't worked very well in the domain of finding optimal causes.

I wish I lived on a planet where these were 'very common appeals to commonsense'. I wonder how much a ticket there would cost?

I think it might be more for a select group of people. In the LW community, I have gotten the impression that existential risk is higher status than global poverty reduction - that's definitely the opinion of the high status people in this community. And maybe for the specific kind of nonconformist nerd who reads Less Wrong and is likely to come across this post, transhumanism and existential risk reduction has a "coolness factor" that global poverty reduction doesn't have.

You're definitely right about the wider world, but many people might only care about the opinions of the 100 or so members of their in-group.

This. Status matters within one's in-group or a group one wants to be accepted as an in-group member by.

I feel like you're just sneering at a very small point I made rather than actually engaging with it.

What I meant to say was (1) x-risk reduction is cooler and higher status in the effective altruist / LessWrong community and (2) this biases people at least a little bit. I'll edit the essay to reflect that.

Would you agree with (1)? What about (2)?

If you meant to say x-risk reduction is high-status in the EA/LW community, then yes, that makes a lot more sense than what you originally said.

But I'm not actually sure how true this is in the broader EA community. E.g. GiveWell and Peter Singer are two huge players in the EA community, each with larger communities than LW (by my estimate), and they haven't publicly advocated x-risk reduction. So my guess is that x-risk reduction is basically just high status in the LW/MIRI/FHI world, and maybe around CEA as well due to their closeness to FHI. To the extent that x-risk reduction is high-status in that world, we should expect a bias toward x-risk reduction, but that's a pretty small world. There's a much larger and more wealthy world outside that group which is strongly biased against caring about x-risk reduction, and for this and other reasons we should expect on net for Earth to pay way, way less attention to x-risk than is warranted.

GiveWell and Peter Singer are two huge players in the EA community, each with larger communities than LW (by my estimate), and they haven't publicly advocated x-risk reduction.

GiveWell is doing shallow analyses of catastrophic risks, and Peter Singer has written favorably on reducing x-risk, although not endorsing particular charities or interventions, and it's not a regular theme in his presentations.

A few comments and requests for clarification:

  1. Are you only talking about donations? Do you think it would also be a mistake to work on speculative causes? (That seems much different in that there are many more opportunities for learning from working on a cause than from donating to it.) I think you are on much stronger ground with claims about donations, though I think it can make sense for other people with the right kind of information to fund other opportunities. E.g., think about early GiveWell. I wouldn't want to buy into something which said giving to early GiveWell was a bad idea, even for people who had a lot of information about the project. I think some people may be in a similar position for funding some early EA orgs, and they shouldn't be discouraged from funding them.

  2. What counts as a "speculative cause"? Meta-research? Political advocacy? Education for talented youth? Climate change? Prioritization work? Funding early GiveWell? Funding 80K? Anyone who says the thing they are doing is somehow improving very long-run outcomes? Anything that hasn't been recommended by GiveWell? Anything that hasn't been proven to work with RCT-quality evidence? Asking for a definition is probably unfair, but maybe a few illustrative examples of non-speculative causes and speculative causes would be helpful.

  3. If what you care about is the far future, why does AMF get to count as a "non-speculative" giving opportunity? We have very little idea how much AMF improves very long-run outcomes in comparison with other things, little effort has gone into studying this, and it seems many of your arguments that we are bad at understanding long-run effects apply.

On this last point, I think it may make sense to say something like, "We are very bad at saying how much various changes (saving a life in the US, saving a life in Africa, boosting GDP in the US by X, boosting GDP in an African country by X, etc.) improve long-run outcomes. But we know that AMF stands out for creating these changes in a proven, cost-effective way. And I think the intermediate changes AMF is making are good for long-run outcomes, and not clearly less important than other intermediate changes we could be making. So that makes it the best donation target even by far future standards." I just think this is a highly tentative position, and one that could easily change with further thought and research, and should be acknowledged as such.

Yeah, #3 is my own biggest question for the OP. If you care about the far future, then it seems like the case for MIRI+FHI's positive effect on the far future has been made more robustly than the case for AMF's effect on the far future has been made, even though it's still true that in general we are pretty ignorant about how to affect the far future in positive ways.

Are you only talking about donations? Do you think it would also be a mistake to work on speculative causes? (That seems much different in that there are many more opportunities for learning from working on a cause than from donating to it.)

I agree with that point. I'm not talking only about donations per se, though that's a much more important consideration for the "typical person". I think in so far as one should be purchasing either impact through proven causes or value of information through speculative causes, value of information is better purchased by work rather than money.

~

I think it can make sense for other people with the right kind of information to fund other opportunities. E.g., think about early GiveWell.

I agree. This is why I want to see certain "promising" causes funded, provided we will get to see if they succeed or fail.

~

What counts as a "speculative cause"? Meta-research? Political advocacy? Education for talented youth? Climate change? Prioritization work? Funding early GiveWell? Funding 80K? Anyone who says the thing they are doing is somehow improving very long-run outcomes? Anything that hasn't been recommended by GiveWell? Anything that hasn't been proven to work with RCT-quality evidence?

I'd say a "speculative cause" is any cause who's impact has not been demonstrated with empirical evidence of sufficient quality. It doesn't have to be an RCT. For example, vegetarianism advocacy could demonstrate impact, in my opinion, with just a few studies with actual control groups run by independent people that still show the conversion rate of ~2%.

I think Giving What We Can or 80K could sufficiently demonstrate their impact with just slightly better member surveys and slightly more robust analyses.

For another example, I think GiveWell itself has proven itself to be a good cause (albeit not one with more funding) through their documentation of money moved and track record of good research and there aren't any RCTs done on GiveWell.

~

If what you care about is the far future, why does AMF get to count as a "non-speculative" giving opportunity? We have very little idea how much AMF improves very long-run outcomes in comparison with other things, little effort has gone into studying this, and it seems many of your arguments that we are bad at understanding long-run effects apply.

I think this is a key thing to clear up. AMF is non-speculative in terms of producing near-term impact (or impact generically), but it is still speculative in terms of long-term impact. I think if you want to purchase direct impact (instead of information), you should be doing so through a proven cause like AMF.

Your last point doesn't make much sense to me. I agree that we should be concerned about purchasing as much impact as we can, but the amount of impact you're purchasing from AMF is minuscule compared to the far future. It seems like your concern for something being 'proven' is skewing your choices.

It's like walking into a shop and buying the first TV (or whatever) that you see, despite it likely being expensive and not nearly as good as other ones, because you can at least see it, rather than doing a bit of looking on Amazon.

I'd reject your analogy.

If you look for TVs among Amazon, they give you specific prices and you can reliably buy them. Now imagine you walked into a TV shop and saw some that were a good deal (say $100), but others where the price was set randomly?

I don't think we can make a case that we're actually purchasing some known, larger-than-AMF amount of impact from far future work, because so much far future work is likely to not actually do well at affecting the far future. I just don't think we have that much control over how the future unfolds.

About donating vs. working: I don't agree that the two are fundamentally different. Basically what I'm saying is time is money and money is time. I think there is a conversion rate between the two and that rate is certainly not static. But you have to consider the counterfactual, what you could be doing if you didn't spend your time or money on a cause. For example if you are a very good cost-effectiveness researcher your time may be worth 2 or 3 times as much as the money you could make earning to give. But that money could pay for at least one additional researcher. Similarly, if you spend your time writing web posts that time could be spent working a student job or better, investing in your career which will yield money in the long run. Somehow you can always convert the two.

Although we can't quantify the conversion rate I think it exists. And generally I would expect it to be around 1, maybe between 0,33 and 3 in some cases. That is sometimes significant, but with the different causes we're talking about we don't expect the impact to be even in the same order of magnitude. So for example it wouldn't make sense to say working on MIRI is a good idea, but donating to them is not a good idea.

So if I say I would not donate to an organisation it makes sense to ask if I would not work for that organisation either. Money and time are somehow convertible into each other. So I wouldn't agree that value of information is always better purchased by time than by money. And if it is, the difference may not be all that great.

And a question: Maybe this has been answered somewhere, but I think it would be useful if you straightforwardly said which causes you consider speculative. Makes this quite vague discussion a little less vague hopefully. So say 80k, EAA, GWWC, MIRI, FHI, CFAR, Effective Fundraising, Animal Ethics, all x-risk related, all research related?? This would really help me. I think the issue you raised with your post is very important to discuss and get right ;)

Would you play a lottery with no stated odds?

Imagine another thought experiment -- you're asked to play a lottery. You have to pay $2 to play, but you have a chance at winning $100. Do you play?

Of course, you don't know, because you're not given odds. Rationally, it makes sense to play any lottery where you expect to come out ahead more often than not. If the lottery is a coin flip, it makes sense to pay $2 to have a 50/50 shot to win $100, since you'd expect to win $50 on average, and come ahead $48 each time. With a sufficiently high reward, even a one in a million chance is worth it. Pay $2 for a 1/1M chance of winning $1B, and you'd expect to come out ahead by $998 each time.

But $2 for the chance to win $100, without knowing what the chance is? Even if you had some sort of bounds, like you knew the odds had to be at least 1/150 and at most 1/10, though you could be off by a little bit. Would you accept that bet?

Such a bet seems intuitively uninviting to me, yet this is the bet that speculative causes offer me.

The reason not to play a lottery is because it is a zero-sum game in which the rules are set by the other agent; since you know that the other player's goal is to make a profit, you should expect the rules to be set up to ensure that you lose money. Obviously, reality is not playing a zero-sum game with humanity; if one chooses a different expected payout structure- say, you have no idea what the specific odds are, but you know that your crazy uncle Bill Gates is giving away potentially all his money to family members in a lottery with 2$ tickets- then obviously it makes sense to play.

These were my thoughts when I read this.

A better analogy might be buying stock in a technology startup which is making a product completely unlike anything on the market now. It is certainly more risky than the sure thing, with lots of potential for losing your investment, but also has a much much higher potential payoff. This is generally the case in any sort of investing, whether it be investing in a charity or in a business -- the higher the risk, the higher the potential gain. The sure stuff generally has plenty of funding already -- the low hanging fruit has already been taken.

That being said, one should be on the lookout for good investing opportunities of both kinds -- charging more (in terms of expected payoff) for the riskier ones but not shunning either completely.

I agree with the most of the points points here. Short-term cost-effective charities are often more worthy of donation than speculative long-term high-uncertainty ones. I would prefer donating to GiveWell's top charities to funding US/China exchange programs.

Yet I still donate to MIRI. Why? Because it's a completely different beast.

I don't view MIRI as addressing far-future concerns. I view MIRI as addressing one very specific problem: we are on the path to AI, and we seem to be a lot closer to developing AI than we are to developing a perfect reflective preservable logical encoding of all human values.

There's a timer. That timer is getting uncomfortably low. And when it gets to zero, there's not a lot of death and a bad economy -- there's an extinction event.

If we had good reason to believe that the US and China will cross a threshold this century causing them to either blow up the world or collaborate and travel to the stars, based solely on the sentiment of each population towards the other, then you're damn right I'd fund exchange programs.

We don't have any evidence along those lines. There are a plethora of potential political catastrophes and uncountable factors that could cause them. A China/US nuclear war would be very bad, but it's one small possibility in a sea of many. It's far future, and it's very hard to predict what will help and what won't.

MIRI isn't trying to vaguely prod the unknown future into a state that's maybe better. It's racing against a timer that's known to be ticking. This argument hinges on a belief that we'll get strong AI in the next 30-150 years, that human value is complex and fragile, and that decision theory is nowhere near ready -- all arguments that have been made to my satisfaction.

Strong AI this century is a moderately likely probability, and if we don't have the right decision theory by the time it gets here then humanity loses. End of story, game over, no savepoints.

MIRI isn't hoping that it will nudge us towards a better future. It isn't trying to tweak factors that just maybe might push us towards a better future. Rather, MIRI is addressing a single-point-of-failure with an expiration date. There's a good likelihood that the fate of humanity will hinge upon the existence of this one bit of mathematics.

Yours is the kind of response I have to posts like the OP and also Holden's "Empowerment and Catastrophic Risk," though I wouldn't place so much specific emphasis on e.g. decision theory.

It is important to analyze general features of the world and build up many outside views, but one must also exercise the Be Specific skill. If several outside views tell me to eat Osha Thai, but then I snack on some Synsepalum dulcificum and I know it makes sour food taste sweet, then I should update heavily against the results of my original model combination, even if the Osha Thai recommendation was a robust result of 20+ models under model combination. Similarly, even if you have a very strong outside view that your lottery ticket is not the winner, a simple observation that the number on your ticket matches the announced winner on live TV should allow you to update all the way to belief that you won.

To consider a case somewhat more analogous to risks and AI, there were lots of outside views in 1938 suggesting that one shouldn't invest billions in an unprecedented technology that would increase our bombing power by several orders of magnitude, based on then-theoretical physics. Definitely an "unproven cause." And yet there were strong reasons to suggest it would be possible, and could be a determining factor in WWII, even though lots of the initial research would end up being on the wrong path and so on.

Also see Eliezer's comment here, and its supporting post here.

Benjamin Todd makes this point well in "Social Interventions Gone Wrong", where he provides a quiz with eight social programs and asks readers to guess whether they succeeded or failed.

Were the social interventions sampled randomly, or were they chosen for their counterintuitive outcomes?

Anyway, maybe if one wants to get good at reducing existential risk, the first thing to do is to start using PredictionBook and continue until one is good enough to have a reliable track record, then proceed from there.

Were the social interventions sampled randomly, or were they chosen for their counterintuitive outcomes?

They were chosen, but I'm not sure how one could even sample randomly - where would one look, how would one compile a list, based on what criteria? Most programs haven't been studied at all in any kind of rigorous fashion.

The best I can say is that Rossi, well-respected in the area of social interventions, takes an extremely pessimistic overall view on the effectiveness of interventions.

James Shanteau found in "Competence in Experts: The Role of Task Characteristics"...

Good to see that paper being given an airing. But one important thing that must be done is to decompose the problems we're working on: some results may be more solid that others. I've shown that using expert opinion to establish AI timelines is nearly worthless. However you can still get some results about the properties of AIs (see for instance Omahundro's AI-drives paper), and these are far more solid (for one, they depend much more on arguments than on expertise). So we're in the situation of having no clue when and how AIs could emerge, but being fairly confident that there's a high risk if they do.

Compare for instance the economics of the iPhone. We failed to predict the iPhone ahead of time (continually predicting that these kinds of thing were just around the corner or in the far future), but the iPhone didn't escape the laws of economics and copying and competition. We can often say something about things, even if we must fail to say everything.

"Conservative Orders of Magnitude" Arguments

I wanted to highlight this one, because as someone who isn't an expert in any of these fields it's an easy one for me to fall for.

My rule of thumb is whenever I find myself applying a fudge-factor of N to a system "to be safe," I should ask myself "why not 10N instead? why not 100N?"

If my only answer is incredulity, I should immediately halt and dump core; my processing has become corrupted.

"Wow factor" bias.

That's worth keeping in mind - that's certainly what pushed me into working for the FHI.

How do we choose between different proposals for "exploration"? For example the workshops that MIRI is currently hosting on a regular basis seem to be designed largely to learn how to most efficiently produce FAI-related research as well as how productive such research efforts can currently be. On the other hand, I suggest that at this stage we should devote more resources into what I call "Singularity Strategies" research, to better understand for example whether pushing for "FAI first" has positive or negative expected impact. I think both of these activities can plausibly be called "exploration". Peter, do you agree and if so, have you thought about how to choose between them?

(BTW, I'm planning to attend one of the MIRI workshops in September, which is mainly due to my intellectual curiosity about the topic and not meant to indicate that I endorse MIRI's approach as far as getting the best long term outcome.)

Peter, do you agree and if so, have you thought about how to choose between them?

I know absolutely nothing about these workshops or the details about how one ideally would get better at learning about FAI, so I wouldn't know the answer.

Selection bias. When trying to find trends in history that are favorable for affecting the far future, some examples can be provided. However, this is because we usually hear about the interventions that end up working, whereas all the failed attempts to influence the far future are never heard of again. This creates a very skewed sample that can negatively bias our thinking about our success of influencing the far future.

When I was talking about trends in history, I was saying that certain factors could be identified which would systematically lead to better outcomes rather than worse outcomes if those factors were in place to a greater extent when humanity faced future challenges and opportunities. (Note that I did not say I knew of specific, ready-to-fund interventions for making these factors be in place to a greater extent when humanity faces future challenges and opportunities. We may be talking past each other to some extent since you are talking about where to give now and I am mostly talking about where to look for opportunities later.)

I don't think what you've said here effectively addresses this claim, and I don't think there is selection bias pushing this claim. Consider a list of challenges lukeprog gave elsewhere:

nuclear weapons, climate change, recombinant DNA, nanotechnology, chloroflourocarbons, asteroids, cyberterrorism, Spanish flu, the 2008 financial crisis, and large wars.

Now consider the things I said in this talk would help people meet future challenges better: improved coordination between key actors, improved information access, improved motives, and improved individual capabilities (higher intelligence and technology). (I'm sorry for the vague terms but we don't have great frameworks for this right now.) Now ask: if people had more of these things when they faced aspects of these challenges which we've dealt with so far, would that be expected to lead to better or worse outcomes? I think it is clear that, in each case, it would be more likely to lead to better outcomes than to worse outcomes. Maybe you can think of cases where these factors make us deal with challenges worse, but that is not the typical case.

We may be talking past each other to some extent since you are talking about where to give now and I am mostly talking about where to look for opportunities later.

That sounds pretty plausible. But the "what are you actually going to do to make these broad things happen?" question is an important one. These things -- systematically making the population smarter, more coordinated, more benevolent, etc. -- are hella hard to pull off.

~

Now consider the things I said in this talk would help people meet future challenges better: improved coordination between key actors, improved information access, improved motives, and improved individual capabilities (higher intelligence and technology).

I agree that these things will generally make the future go better, but they might be too broad.

Take the example of "higher intelligence". This raises the question -- intelligence in what? Better English literature skills certainly won't help us deal with x-risks. It seems quite plausible that a particular x-risk we're dealing with will require a pretty particular set of skills, to which most intelligence amplification will not have been helpful. ...Perhaps you could argue that we need a diversified portfolio of education because we can't know what x-risk we'll be hit with, though.

Question: Suppose MIRI was like one of the 8 charities listed above (i.e. intuitively plausible, but empirically useless). How would we know? How would this MIRI' be different from MIRI today?

I think this question is too vague. MIRI could turn out to be useless for any number of reasons, leading to different empirical disconfirmations. (A lot of these will look like the end of human life.) E.g.:

  1. MIRI is useless because FAI research is very useful, but MIRI's basic methodology or research orientation is completely and irredeemably the wrong approach to FAI. Expected evidence: MIRI's research starts seeing diminishing returns, even as they attempt a wide variety of strategies; non-MIRI researchers make surprising amounts of progress into the issues MIRI is interested in; reputable third parties that assess MIRI's results consistently disagree with its foundational assumptions or methodology; the researchers MIRI attracts come to be increasingly seen by the academic establishment as irrelevant to FAI research or even as cranks, for substantive, mathy reasons.

  2. MIRI is useless because FAI is impossible -- we simply lack the resources to engineer a benign singularity, no matter how hard we try. Expected evidence: Demonstrations that a self-modifying AGI can't have stable, predictable values; demonstrations that coding anything like indirect normativity is unfeasible; increasingly slow progress, e.g., due to fixed hardware limitations.

  3. MIRI is useless because some defeater will inevitably kill us all (or otherwise halt technological progress) before we have time to produce self-enhancing AGI. (Alternatively: UFAI is completely impossible to prevent, by any means.) Expected evidence: Discovery of some weapon (e.g., a supervirus or bomb) that can reliably kill everyone and has widely known and easy-to-follow engineering specifications.

  4. MIRI is useless because some defeater will kill us all before we build an AGI, with a probability high enough to suck all the expected utility out of FAI research. I.e., our survival until the intelligence explosion singularity is a possibility, but not a large enough possibility to make the value of surviving past that sieve worth paying attention to. Expected evidence: Discovery that near-term existential threats are vastly more threatening than currently thought.

  5. MIRI is useless because FAI is inevitable. Expected evidence: Black-swan disproof of fragility-of-value and complexity-of-value, revolutionizing human psychology.

Note that these scenarios not only would be indicated by different evidence, but also call for very different responses.

  • If 1 is true: Instead of funding MIRI, we should fund some radically different FAI research program.
  • If 2 is true: Instead of funding MIRI, we should pour all our resources into permanently stopping AI research, by any available means.
  • If 3 is true: Instead of funding MIRI, we should just enjoy what little time we have left before the apocalypse.
  • If 4 is true: Instead of funding MIRI, we should combat other existential risks, or act as in 3.
  • If 5 is true: Instead of funding MIRI, we should speed along AI research and fund cryonic and life-extension technologies in the hopes of surviving into the intelligence explosion.

I don't think it's good practice to mix in "wow factor" bias into that list. That list is mostly made up of terms drawn from the psychology literature of empirically demonstrated deviations from rational behaviour that are predicted my some mathematical model, but for that one, this article is the top hit and no formal meaning has been assigned to it, never mind empirically demonstrated.

You're right that there's no identifiable literature on the "wow factor" bias (I made up the name myself), but I do think it's plausible that people are skewed toward seeking out opportunities that are flashy and high status. I could very well be wrong, but I have a lot of personal experience that this is the case. Do you disagree?

If existential risks are hugely important but we suck at predicting them, why not invest in schemes which improve our predictions?