Superintelligence Risk Project

July 3rd, 2017
airisk, ea
I've decided to make a larger project out of looking into AI risk. I don't think we really know why there's such a disconnect between people who think we should strongly prioritize it and the mainstream ML perspective that it's not useful to work on (at least not currently). I've applied for an EA grant, and am thinking I'll spend about a month on this.

Here's where I currently am:

  • I've read nearly all of the reading people suggested (by number of words) or about two thirds of it (by individual pieces). This is mostly an effect of Superintelligence being very long.

  • I've had conversations with one person in each camp, have a few more scheduled, and am working on lining up more.

Here are some very preliminary thoughts on where I think the disagreement might be:

  • How likely is it that current approaches are all we need for AGI, with relatively straightforward extensions and a lot of scaling?

  • How valuable is it to work on solving problems that are probably not the right ones? For example, even if we think AGI will not look like current systems, might trying to solve the control problem for current systems teach us enough about the underlying problem and how to do this kind of work that we'll be in a better position once we see more what AGI will actually look like?

  • How useful is it to have a strong theoretical foundation, vs just understanding the technology enough from an engineering perspective that we can make it do things for us?

  • How similar is this to normal engineering? How much should we expect companies' desires that their AI systems do what they want them to do to work out?

  • As we get closer to AGI, how likely is the ML community to take superintelligence risk seriously? Is it just that they don't think it can be productively worked on now or do they not think it will ever be a real problem?

Referenced in:

Comment via: google plus, facebook, substack

Recent posts on blogs I like:

Tuberculosis Considered As Dating Strategy

Against some evopsych

via Thing of Things July 8, 2025

Retrospective on life tracking and effectiveness systems

I’ve been doing life tracking for around 10 years, and this post is looking back at some things I learned from the data (since my previous retrospective in 2017). Highlights include what I get out of the Oura ring, correlations between sleep and deep work…

via Victoria Krakovna July 4, 2025

Elixir's Last Dance

On May 18th, the contra dance band Elixir had their last gig ever. The dance was packed: there were three hundred people. It was the only dance BIDA has ever done where they sold tickets. People flew from across the country just to hear Elixir play one la…

via Lily Wise's Blog Posts June 5, 2025

more     (via openring)