[#Script #Coding] Machine Learning Safety – Full Course from the Center for AI Safety

Posted on Wednesday August 2, 2023 by Eric Brooks

Spread the love

Machine Learning Safety – Full Course from the Center for AI Safety

By freeCodeCamp.org
Published: Aug 02, 2023

“
ML systems are rapidly increasing in size, are acquiring new capabilities, and are increasingly deployed in high-stakes settings. As with other powerful technologies, safety for ML should be a leading research priority. In this course we’ll discuss how researchers can shape the process that will lead to strong AI systems and steer that process in a safer direction. We’ll cover various technical topics to reduce existential risks (X-Risks) from strong AI, namely withstanding hazards (“Robustness”), identifying hazards (“Monitoring”), reducing inherent ML system hazards (“Alignment”), and reducing systemic hazards (“Systemic Safety”). At the end, we will zoom out and discuss additional abstract existential hazards and discuss how to increase safety without unintended side effects.

âœï¸ See course.mlsafety.org for more.

âï¸ Contents âï¸
(0:00:00) Introduction
(0:11:09) Deep Learning Review
(0:52:41) Risk Decomposition
(1:06:57) Accident Models
(1:39:22) Black Swans
(1:58:45) Adversarial Robustness
(2:29:40) Black Swan Robustness
(2:52:56) Anomaly Detection
(3:35:32) Interpretable Uncertainty
(3:59:09) Transparency
(4:12:22) Trojans
(4:22:52) Detecting Emergent Behavior
(4:43:07) Honest Models
(5:00:06) Machine Ethics
(5:52:08) ML for Improved Decision-Making
(6:04:40) ML for Cyberdefense
(6:25:00) Cooperative AI
(6:58:33) X-Risk Overview
(7:05:23) Possible Existential Hazards
(7:13:16) AI and Evolution
(8:03:08) Safety-Capabilities Balance
(8:21:07) Review and Conclusion

ðŸŽ‰ Thanks to our Champion and Sponsor supporters:
ðŸ‘¾ davthecoder
ðŸ‘¾ jedi-or-sith
ðŸ‘¾ å—å®®åƒå½±
ðŸ‘¾ Agustín Kussrow
ðŸ‘¾ Nattira Maneerat
ðŸ‘¾ Heather Wcislo
ðŸ‘¾ Serhiy Kalinets
ðŸ‘¾ Justin Hual
ðŸ‘¾ Otis Morgan

—

Learn to code for free and get a developer job: https://www.freecodecamp.org

Read hundreds of articles on programming: https://freecodecamp.org/news

[READ MORE]

”

Spread the love

Proudly powered by WordPress

EricBrooks.Com^® is licensed under a Creative Commons License.

Disclaimer: The views expressed herein are solely those of Eric Brooks. They do not necessarily reflect those of his employers, friends, contacts, family, or even his pets (though my cat, Puddy, seems to agree with me on many key issues.). In accordance to my terms of use, you hereby acknowledge my right to psychoanalyze you, practice accupuncture, and mock you incessantly with every visit. As the user, you also acknowledge that the author has been legally declared a "Problem Adult" by the Commonwealth of Pennsylvania, and is therefore not responsible for any of his actions. ALSO, the political views and products advertised on this site may/may not reflect the views of Puddy or myself, so please don't take them as an endorsement. We just need to eat.

Connect