Human Compatible: Artificial Intelligence and the Problem of Control
暫譯: 人類相容:人工智慧與控制問題

Russell, Stuart

商品描述

A leading artificial intelligence researcher lays out a new approach to AI that will enable us to coexist successfully with increasingly intelligent machines

In the popular imagination, superhuman artificial intelligence is an approaching tidal wave that threatens not just jobs and human relationships, but civilization itself. Conflict between humans and machines is seen as inevitable and its outcome all too predictable.

In this groundbreaking book, distinguished AI researcher Stuart Russell argues that this scenario can be avoided, but only if we rethink AI from the ground up. Russell begins by exploring the idea of intelligence in humans and in machines. He describes the near-term benefits we can expect, from intelligent personal assistants to vastly accelerated scientific research, and outlines the AI breakthroughs that still have to happen before we reach superhuman AI. He also spells out the ways humans are already finding to misuse AI, from lethal autonomous weapons to viral sabotage.

If the predicted breakthroughs occur and superhuman AI emerges, we will have created entities far more powerful than ourselves. How can we ensure they never, ever, have power over us? Russell suggests that we can rebuild AI on a new foundation, according to which machines are designed to be inherently uncertain about the human preferences they are required to satisfy. Such machines would be humble, altruistic, and committed to pursue our objectives, not theirs. This new foundation would allow us to create machines that are provably deferential and provably beneficial.

商品描述(中文翻譯)

一位領先的人工智慧研究者提出了一種新的人工智慧方法,使我們能夠與日益智能的機器成功共存

在大眾的想像中,超人類的人工智慧是一股即將來臨的潮流,威脅著不僅是工作和人際關係,還有整個文明。人類與機器之間的衝突被視為不可避免,其結果也過於可預測。

在這本開創性的書中,著名的人工智慧研究者斯圖亞特·拉塞爾(Stuart Russell)主張,這種情境是可以避免的,但前提是我們必須從根本上重新思考人工智慧。拉塞爾首先探討了人類和機器中的智慧概念。他描述了我們可以期待的短期好處,從智能個人助理到大幅加速的科學研究,並概述了在達到超人類人工智慧之前仍需實現的人工智慧突破。他還指出了人類已經找到的濫用人工智慧的方法,從致命的自主武器到病毒式的破壞行為。

如果預測的突破發生,超人類人工智慧將出現,我們將創造出比我們自己更強大的實體。我們如何確保它們永遠不會對我們擁有權力?拉塞爾建議,我們可以在一個新的基礎上重建人工智慧,根據這個基礎,機器被設計為對它們需要滿足的人類偏好本質上是充滿不確定性的。這樣的機器將是謙遜的、利他的,並致力於追求我們的目標,而不是它們自己的。這個新的基礎將使我們能夠創造出可證明是尊重的和可證明是有益的機器。

作者簡介

Stuart Russell is a professor of Computer Science and holder of the Smith-Zadeh Chair in Engineering at the University of California, Berkeley. He has served as the Vice-Chair of the World Economic Forum's Council on AI and Robotics and as an advisor to the United Nations on arms control. He is the author (with Peter Norvig) of the definitive and universally acclaimed textbook on AI, Artificial Intelligence: A Modern Approach.

作者簡介(中文翻譯)

斯圖亞特·拉塞爾(Stuart Russell)是加州大學伯克利分校的計算機科學教授及史密斯-扎德工程學講座教授。他曾擔任世界經濟論壇人工智慧與機器人委員會的副主席,並擔任聯合國武器管制的顧問。他與彼得·諾維格(Peter Norvig)共同撰寫了公認為人工智慧領域權威的教科書《人工智慧:現代方法》(Artificial Intelligence: A Modern Approach)。