aelydens

Free Resources for Understanding AI Safety


My time at the Recurse Center has sparked a deep interest in AI safety. I believe that how we handle both the opportunities and risks of AI systems will largely shape our future, and I want to share resources that can help myself and others learn more about the field of AI safety.

Below you’ll find materials I’ve personally used and recommendations I’ve gathered along the way. This is an evolving list that I will edit over time. Consider this post a roadmap for building foundational knowledge in AI Safety, an area that has captured my curiosity and where I hope to work in the future.

This roadmap is organized into two main parts:

  1. Core Path: Essential prerequisites, fundamentals, and must-read papers for understanding AI safety. My goal with this section is to identify core pre-requisites for an engineer interested in understanding and working in AI safety.
  2. Expansion Areas: Additional resources for diving deeper into specific areas of interest. Once you’ve completed the core section, you will be able to go further with the resources in this area.

Since I’m creating the roadmap for myself, this assumes a software engineering background, but does not assume any previous AI knowledge. All resources are free.

Core Path

1. Technical Foundation

1a. Mathematics

1b. Programming

1c. Machine Learning Basics

2. Essential AI Safety Material

2a. Must-Read Papers

2b. Key Lectures

Congratulations! You have completed the Core section of the roadmap. Now you get to hone in on areas that are interesting to you within the field of AI safety.

Expansion Areas

AI Safety is an extremely broad field, and there are many areas to explore. The following topics have been borrowed from the Wikipedia entry on Research areas in AI safety as an initial stab at categorizing interest areas. I hope to add specific papers and resources here for each focus area as I go.

Additional Resources

AI safety classes and long-form lectures

Shorter videos and podcasts

Hands-on ML courses

Andrew Ng’s coursera courses, like the Deep Learning specialization, have also been recommended to me. I’m not including them here as they are not free as of this blog post’s publication date.

Organizations

Newsletters, Communities and other references

Published on: 2024-02-10