Looking into AI Risk

June 26th, 2017
airisk, ea
In considering what to work on, I wrote that recently many people I respect have started to think that AI risk is the most valuable place to focus EA efforts. For example, 80000 Hours ranks it first on their "list of global issues", taking into account "scale, neglectedness, and solvability". On the other hand, I have a lot of friends working in machine learning, and none of them think AI risk is worth working on now. This level of disagreement is very strange, and kind of worrying.

What I'm planning to spend the next few days on is getting a better understanding of where this difference comes from. I think I'm in a good position to do this: I'm close to both groups, have some technical background as a programmer, and have some time. I see two ways this could go:

  • If after looking into it more I still think AI risk is not a valuable place to be working, I may be able to convince others of this. Since 80000 Hours and other EAs are currently suggesting a lot of people go into this field, if it turns out we're overvaluing it then those people could work on other things.

  • If I change my mind and start thinking AI risk is something we should be working on, I may convince some of my friends in machine learning. It's also likely that something in this direction would be close enough to my skills to be a good career fit and I should consider working on it.

Of course it's also possible that I won't get to the root of the disagreement, or that I won't convince anyone except myself, but I do think it's worth trying.

Rough plan: read a bunch of stuff to get background, talk to a lot of people, write things up. Things I'm planning to read:

The list above is entirely people who think AI risk should be prioritized, aside from the Ceglowski post at the end, so I'm especially interested to read (if they exist) pieces where machine learning experts talk about why they don't think AI risk is a high priority. I'm also interested in other general AI risk background reading, and suggestions of people to talk to.

Referenced in:

Comment via: google plus, facebook

Recent posts on blogs I like:

Where I Donated In 2024

All Grants Fund, Rethink, EA Funds Animal Welfare Fund

via Thing of Things January 17, 2025

2024-25 New Year review

This is an annual post reviewing the last year and setting intentions for next year. I look over different life areas (work, health, parenting, effectiveness, travel, etc) and analyze my life tracking data. Overall this was a pretty good year. Highlights …

via Victoria Krakovna January 15, 2025

The ugly sides of two approaches to charity

What's neglected by "magnificent" philanthropy, and by Singerian global poverty focus The post The ugly sides of two approaches to charity appeared first on Otherwise.

via Otherwise January 13, 2025

more     (via openring)