Developing skills related to AI puts you in a better position to make AI go well. At least for me, this outweighs the other concerns that you've mentioned.

Note: This doesn't mean that you should take a job that advances fundamental AI capabilities. This would probably be net-negative as things are already moving far too fast for society to adapt. But it sounds like you're more considering jobs related to AI applications, so I'd say to go for it.

Reply

Does reducing the amount of RL for a given capability level make AI safer?

Chris_Leong8d20

You mention that society may do too little of the safer types of RL. Can you clarify what you mean by this?

Reply

How do open AI models affect incentive to race?

Chris_Leong8d30

This fails to account for one very important psychological fact: the population of startup founders who get a company off the ground is very heavily biased toward people who strongly believe in their ability to succeed. So it'll take quite a while for "it'll be hard to make money" to flow through and slow down training. And, in the mean time, it'll be acceleratory from pushing companies to stay ahead.

Reply

Does reducing the amount of RL for a given capability level make AI safer?

Chris_Leong8d20

I've heard people suggest that they have arguments related to RL being particularly dangerous, although I have to admit that I'm struggling to find these arguments at the moment. I don't know, perhaps that helps clarify why I've framed the question the way that I've framed it?

Reply

Does reducing the amount of RL for a given capability level make AI safer?

Chris_Leong9d64

I think it's still valid to ask in the abstract whether RL is a particularly dangerous approach to training an AI system.

Reply

Does reducing the amount of RL for a given capability level make AI safer?

Chris_Leong9d50

Oh, this is a fascinating perspective.

So most uses of RL already just use a small-bit of RL.

So if the goal was "only use a little bit of RL", that's already happening.

Hmm... I still wonder if using even less RL would be safer still.

Reply

LLMs seem (relatively) safe

Chris_Leong9d20

"LLMs are self limiting": I strongly disagree with LLM's being limited point. If you follow ML discussion online, you'll see that people are constantly finding new ways to draw extra performance out of these models and that it's happening so fast it's almost impossible to keep up. Many of these will only provide small boosts or be exclusive with other techniques, but at least some of these will be scalable.
"LLMs are decent at human values": I agree on your second point. We used to be worried that we'd tell an AI to get coffee and that it would push a kid out of the way. That doesn't seem to be very likely to be an issue these days.
"Playing human roles is pretty human": This is a reasonable point. It seems easier to get an AI that is role-playing a human to actually act human than an AI that is completely alien.

Reply

Now THIS is forecasting: understanding Epoch’s Direct Approach

Chris_Leong10d100

Under the current version of the interactive model, its median prediction is just two decades earlier than that from Cotra’s forecast

Just?

Reply

Key takeaways from our EA and alignment research surveys

Chris_Leong11d40

There's a lot of overlap between alignment researchers and the EA community, so I'm wondering how that was handled.

It feels like it would be hard to find a good way of handling it: if you include everyone who indicated an affiliation with EA on the alignment survey it'd tilt the survey towards alignment people, in contrast if you exclude them then it seems likely it'd tilt the survey away from alignment people since people will be unlikely to fill in both surveys.

Regarding the support for various cause areas, I'm pretty sure that you'll find the support for AI Safety/Long-Termism/X-risk is higher among those most involved in EA than among those least involved. Part of this may be because of the number of jobs available in this cause area.

Reply