Results for 'ai alignment problem'

982 found
Order:
  1. AI, alignment, and the categorical imperative.Fritz McDonald - 2023 - AI and Ethics 3:337-344.
    Tae Wan Kim, John Hooker, and Thomas Donaldson make an attempt, in recent articles, to solve the alignment problem. As they define the alignment problem, it is the issue of how to give AI systems moral intelligence. They contend that one might program machines with a version of Kantian ethics cast in deontic modal logic. On their view, machines can be aligned with human values if such machines obey principles of universalization and autonomy, as well as (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  2. Aligning artificial intelligence with moral intuitions: an intuitionist approach to the alignment problem.Dario Cecchini, Michael Pflanzer & Veljko Dubljevic - 2024 - AI and Ethics:1-11.
    As artificial intelligence (AI) continues to advance, one key challenge is ensuring that AI aligns with certain values. However, in the current diverse and democratic society, reaching a normative consensus is complex. This paper delves into the methodological aspect of how AI ethicists can effectively determine which values AI should uphold. After reviewing the most influential methodologies, we detail an intuitionist research agenda that offers guidelines for aligning AI applications with a limited set of reliable moral intuitions, each underlying a (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  3.  34
    Calibrating machine behavior: a challenge for AI alignment.Erez Firt - 2023 - Ethics and Information Technology 25 (3):1-8.
    When discussing AI alignment, we usually refer to the problem of teaching or training advanced autonomous AI systems to make decisions that are aligned with human values or preferences. Proponents of this approach believe it can be employed as means to stay in control over sophisticated intelligent systems, thus avoiding certain existential risks. We identify three general obstacles on the path to implementation of value alignment: a technological/technical obstacle, a normative obstacle, and a calibration problem. Presupposing, (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  4.  58
    Can the predictive processing model of the mind ameliorate the value-alignment problem?William Ratoff - 2021 - Ethics and Information Technology 23 (4):739-750.
    How do we ensure that future generally intelligent AI share our values? This is the value-alignment problem. It is a weighty matter. After all, if AI are neutral with respect to our wellbeing, or worse, actively hostile toward us, then they pose an existential threat to humanity. Some philosophers have argued that one important way in which we can mitigate this threat is to develop only AI that shares our values or that has values that ‘align with’ ours. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  5.  56
    Human-aligned artificial intelligence is a multiobjective problem.Peter Vamplew, Richard Dazeley, Cameron Foale, Sally Firmin & Jane Mummery - 2018 - Ethics and Information Technology 20 (1):27-40.
    As the capabilities of artificial intelligence systems improve, it becomes important to constrain their actions to ensure their behaviour remains beneficial to humanity. A variety of ethical, legal and safety-based frameworks have been proposed as a basis for designing these constraints. Despite their variations, these frameworks share the common characteristic that decision-making must consider multiple potentially conflicting factors. We demonstrate that these alignment frameworks can be represented as utility functions, but that the widely used Maximum Expected Utility paradigm provides (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   10 citations  
  6.  43
    Applying AI for social good: Aligning academic journal ratings with the United Nations Sustainable Development Goals (SDGs).David Steingard, Marcello Balduccini & Akanksha Sinha - 2023 - AI and Society 38 (2):613-629.
    This paper offers three contributions to the burgeoning movements of AI for Social Good (AI4SG) and AI and the United Nations Sustainable Development Goals (SDGs). First, we introduce the SDG-Intense Evaluation framework (SDGIE) that aims to situate variegated automated/AI models in a larger ecosystem of computational approaches to advance the SDGs. To foster knowledge collaboration for solving complex social and environmental problems encompassed by the SDGs, the SDGIE framework details a benchmark structure of data-algorithm-output to effectively standardize AI approaches to (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  7. Saliva Ontology: An ontology-based framework for a Salivaomics Knowledge Base.Jiye Ai, Barry Smith & David Wong - 2010 - BMC Bioinformatics 11 (1):302.
    The Salivaomics Knowledge Base (SKB) is designed to serve as a computational infrastructure that can permit global exploration and utilization of data and information relevant to salivaomics. SKB is created by aligning (1) the saliva biomarker discovery and validation resources at UCLA with (2) the ontology resources developed by the OBO (Open Biomedical Ontologies) Foundry, including a new Saliva Ontology (SALO). We define the Saliva Ontology (SALO; http://www.skb.ucla.edu/SALO/) as a consensus-based controlled vocabulary of terms and relations dedicated to the salivaomics (...)
    Direct download  
     
    Export citation  
     
    Bookmark   4 citations  
  8.  8
    Some Problems of Historical Research on the 1911 Revolution.Chang K'ai-yüan - 1980 - Chinese Studies in History 13 (4):37-53.
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  9. L'analyse logique du problème des conséquences négatives et la classification des méthodes de sa résolution.Ai Ouemov - 1971 - Revue Internationale de Philosophie 98 (4):528.
    No categories
     
    Export citation  
     
    Bookmark  
  10.  9
    Toleration and Justice in the Laozi: Engaging with Tao Jiang's Origins of Moral-Political Philosophy in Early China.Ai Yuan - 2023 - Philosophy East and West 73 (2):466-475.
    In lieu of an abstract, here is a brief excerpt of the content:Toleration and Justice in the Laozi:Engaging with Tao Jiang's Origins of Moral-Political Philosophy in Early ChinaAi Yuan (bio)IntroductionThis review article engages with Tao Jiang's ground-breaking monograph on the Origins of Moral-Political Philosophy in Early China with particular focus on the articulation of toleration and justice in the Laozi (otherwise called the Daodejing).1 Jiang discusses a naturalistic turn and the re-alignment of values in the Laozi, resulting in a (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  11.  26
    An explanation space to align user studies with the technical development of Explainable AI.Garrick Cabour, Andrés Morales-Forero, Élise Ledoux & Samuel Bassetto - 2023 - AI and Society 38 (2):869-887.
    Providing meaningful and actionable explanations for end-users is a situated problem requiring the intersection of multiple disciplines to address social, operational, and technical challenges. However, the explainable artificial intelligence community has not commonly adopted or created tangible design tools that allow interdisciplinary work to develop reliable AI-powered solutions. This paper proposes a formative architecture that defines the explanation space from a user-inspired perspective. The architecture comprises five intertwined components to outline explanation requirements for a task: (1) the end-users’ mental (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  12.  28
    A finite model property for RMImin.Ai-ni Hsieh & James G. Raftery - 2006 - Mathematical Logic Quarterly 52 (6):602-612.
    It is proved that the variety of relevant disjunction lattices has the finite embeddability property. It follows that Avron's relevance logic RMImin has a strong form of the finite model property, so it has a solvable deducibility problem. This strengthens Avron's result that RMImin is decidable.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  13.  16
    A Defect Detection Method for the Surface of Metal Materials Based on an Adaptive Ultrasound Pulse Excitation Device and Infrared Thermal Imaging Technology.Yibo Ai, Yingjie Zhang, Xingzhao Cao & Weidong Zhang - 2021 - Complexity 2021:1-9.
    Ultrasonic excitation has been widely used in the detection of microcracks on metal surfaces, but there are problems such as poor excitation effect of ultrasonic pulse, long time to reach the best excitation, and difficult to find microcracks. In this paper, an adaptive ultrasonic pulse excitation device and infrared thermal imaging technology have been combined, as well as their control method, to solve the problem. The adaptive ultrasonic pulse excitation device adds intelligent modules to realize automatic adjustment of detection (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  14. Artificial Intelligence, Values, and Alignment.Iason Gabriel - 2020 - Minds and Machines 30 (3):411-437.
    This paper looks at philosophical questions that arise in the context of AI alignment. It defends three propositions. First, normative and technical aspects of the AI alignment problem are interrelated, creating space for productive engagement between people working in both domains. Second, it is important to be clear about the goal of alignment. There are significant differences between AI that aligns with instructions, intentions, revealed preferences, ideal preferences, interests and values. A principle-based approach to AI (...), which combines these elements in a systematic way, has considerable advantages in this context. Third, the central challenge for theorists is not to identify ‘true’ moral principles for AI; rather, it is to identify fair principles for alignment that receive reflective endorsement despite widespread variation in people’s moral beliefs. The final part of the paper explores three ways in which fair principles for AI alignment could potentially be identified. (shrink)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   46 citations  
  15. Robustness to Fundamental Uncertainty in AGI Alignment.G. G. Worley Iii - 2020 - Journal of Consciousness Studies 27 (1-2):225-241.
    The AGI alignment problem has a bimodal distribution of outcomes with most outcomes clustering around the poles of total success and existential, catastrophic failure. Consequently, attempts to solve AGI alignment should, all else equal, prefer false negatives (ignoring research programs that would have been successful) to false positives (pursuing research programs that will unexpectedly fail). Thus, we propose adopting a policy of responding to points of philosophical and practical uncertainty associated with the alignment problem by (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  16. From Confucius to Coding and Avicenna to Algorithms: Cultivating Ethical AI Development through Cross-Cultural Ancient Wisdom.Ammar Younas & Yi Zeng - manuscript
    This paper explores the potential of integrating ancient educational principles from diverse eastern cultures into modern AI ethics curricula. It draws on the rich educational traditions of ancient China, India, Arabia, Persia, Japan, Tibet, Mongolia, and Korea, highlighting their emphasis on philosophy, ethics, holistic development, and critical thinking. By examining these historical educational systems, the paper establishes a correlation with modern AI ethics principles, advocating for the inclusion of these ancient teachings in current AI development and education. The proposed integration (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  17.  49
    Current cases of AI misalignment and their implications for future risks.Leonard Dung - 2023 - Synthese 202 (5):1-23.
    How can one build AI systems such that they pursue the goals their designers want them to pursue? This is the alignment problem. Numerous authors have raised concerns that, as research advances and systems become more powerful over time, misalignment might lead to catastrophic outcomes, perhaps even to the extinction or permanent disempowerment of humanity. In this paper, I analyze the severity of this risk based on current instances of misalignment. More specifically, I argue that contemporary large language (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  18.  31
    Possibilities and ethical issues of entrusting nursing tasks to robots and artificial intelligence.Tomohide Ibuki, Ai Ibuki & Eisuke Nakazawa - forthcoming - Nursing Ethics.
    In recent years, research in robotics and artificial intelligence (AI) has made rapid progress. It is expected that robots and AI will play a part in the field of nursing and their role might broaden in the future. However, there are areas of nursing practice that cannot or should not be entrusted to robots and AI, because nursing is a highly humane practice, and therefore, there would, perhaps, be some practices that should not be replicated by robots or AI. Therefore, (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  19.  11
    Some Opinions on "the Problem of Inheriting the Legacy of Chinese Philosophy".Ai Ssu-ch'I. - 1968 - Chinese Studies in History 2 (2):92-97.
  20.  54
    Aligning artificial intelligence with human values: reflections from a phenomenological perspective.Shengnan Han, Eugene Kelly, Shahrokh Nikou & Eric-Oluf Svee - 2022 - AI and Society 37 (4):1383-1395.
    Artificial Intelligence (AI) must be directed at humane ends. The development of AI has produced great uncertainties of ensuring AI alignment with human values (AI value alignment) through AI operations from design to use. For the purposes of addressing this problem, we adopt the phenomenological theories of material values and technological mediation to be that beginning step. In this paper, we first discuss the AI value alignment from the relevant AI studies. Second, we briefly present what (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  21.  44
    Textile Diagrams. Florian Pumhösl's Abstraction as Method.T'ai Smith - 2015 - Zeitschrift für Medien- Und Kulturforschung 2015 (1):101-116.
    For Viennese artist Florian Pumhösl »abstraction is a method«, not a category. Or rather, if abstraction is the defining category of modernism, the objective is to reproduce modernism's problems and limits and exploit relationships among its parts. Considering what Pumhösl calls the »textile complex« of modernism, this essay examines the artist's work in parallel with Charles Sanders Peirce's diagram concept and Gottfried Semper's use of textile diagrams throughout Style in the Technical and Tectonic Arts. _German_ »Abstraktion« ist für den Wiener (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  22.  3
    Textile Diagrams. Florian Pumhösl's Abstraction as Method.T'ai Smith - 2015 - Zeitschrift für Medien- Und Kulturforschung 6 (1):101-116.
    For Viennese artist Florian Pumhösl »abstraction is a method«, not a category. Or rather, if abstraction is the defining category of modernism, the objective is to reproduce modernism's problems and limits and exploit relationships among its parts. Considering what Pumhösl calls the »textile complex« of modernism, this essay examines the artist's work in parallel with Charles Sanders Peirce's diagram concept and Gottfried Semper's use of textile diagrams throughout Style in the Technical and Tectonic Arts.
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  23. An Enactive Approach to Value Alignment in Artificial Intelligence: A Matter of Relevance.Michael Cannon - 2022 - In Vincent C. Müller (ed.), Philosophy and Theory of Artificial Intelligence 2021. pp. 119-135.
    The “Value Alignment Problem” is the challenge of how to align the values of artificial intelligence with human values, whatever they may be, such that AI does not pose a risk to the existence of humans. A fundamental feature of how the problem is currently understood is that AI systems do not take the same things to be relevant as humans, whether turning humans into paperclips in order to “make more paperclips” or eradicating the human race to (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  24.  65
    Value alignment, human enhancement, and moral revolutions.Ariela Tubert & Justin Tiehen - forthcoming - Inquiry: An Interdisciplinary Journal of Philosophy.
    Human beings are internally inconsistent in various ways. One way to develop this thought involves using the language of value alignment: the values we hold are not always aligned with our behavior, and are not always aligned with each other. Because of this self-misalignment, there is room for potential projects of human enhancement that involve achieving a greater degree of value alignment than we presently have. Relatedly, discussions of AI ethics sometimes focus on what is known as the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  25. An Enactive Approach to Value Alignment in Artificial Intelligence: A Matter of Relevance.Michael Cannon - 2021 - In Vincent C. Müller (ed.), Philosophy and Theory of Artificial Intelligence 2021. Springer Cham. pp. 119-135.
    The “Value Alignment Problem” is the challenge of how to align the values of artificial intelligence with human values, whatever they may be, such that AI does not pose a risk to the existence of humans. Existing approaches appear to conceive of the problem as "how do we ensure that AI solves the problem in the right way", in order to avoid the possibility of AI turning humans into paperclips in order to “make more paperclips” or (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  26. The Blood Ontology: An ontology in the domain of hematology.Almeida Mauricio Barcellos, Proietti Anna Barbara de Freitas Carneiro, Ai Jiye & Barry Smith - 2011 - In Proceedings of the Second International Conference on Biomedical Ontology, Buffalo, NY, July 28-30, 2011 (CEUR 883). pp. (CEUR Workshop Proceedings, 833).
    Despite the importance of human blood to clinical practice and research, hematology and blood transfusion data remain scattered throughout a range of disparate sources. This lack of systematization concerning the use and definition of terms poses problems for physicians and biomedical professionals. We are introducing here the Blood Ontology, an ongoing initiative designed to serve as a controlled vocabulary for use in organizing information about blood. The paper describes the scope of the Blood Ontology, its stage of development and some (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  27. The argument for near-term human disempowerment through AI.Leonard Dung - 2024 - AI and Society:1-14.
    Many researchers and intellectuals warn about extreme risks from artificial intelligence. However, these warnings typically came without systematic arguments in support. This paper provides an argument that AI will lead to the permanent disempowerment of humanity, e.g. human extinction, by 2100. It rests on four substantive premises which it motivates and defends: first, the speed of advances in AI capability, as well as the capability level current systems have already reached, suggest that it is practically possible to build AI systems (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  28.  6
    Minangkabaunese matrilineal: The correlation between the Qur’an and gender.Halimatussa’Diyah Halimatussa’Diyah, Kusnadi Kusnadi, Ai Y. Yuliyanti, Deddy Ilyas & Eko Zulfikar - 2024 - HTS Theological Studies 80 (1):7.
    Upon previous research, the matrilineal system seems to oppose Islamic teaching. However, the matrilineal system practiced by the Minangkabau society in West Sumatra, Indonesia has its uniqueness. Thus, this study aims to examine the correlation between the Qur’an and gender roles within the context of Minangkabau customs, specifically focusing on the matrilineal aspect. The present study employs qualitative methods for conducting library research through critical analysis. This study discovered that the matrilineal system practiced by the Minangkabau society aligns with Qur’anic (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  29. ChatGPT: towards AI subjectivity.Kristian D’Amato - 2024 - AI and Society 39:1-15.
    Motivated by the question of responsible AI and value alignment, I seek to offer a uniquely Foucauldian reconstruction of the problem as the emergence of an ethical subject in a disciplinary setting. This reconstruction contrasts with the strictly human-oriented programme typical to current scholarship that often views technology in instrumental terms. With this in mind, I problematise the concept of a technological subjectivity through an exploration of various aspects of ChatGPT in light of Foucault’s work, arguing that current (...)
    Direct download (6 more)  
     
    Export citation  
     
    Bookmark  
  30.  59
    Challenges of Aligning Artificial Intelligence with Human Values.Margit Sutrop - 2020 - Acta Baltica Historiae Et Philosophiae Scientiarum 8 (2):54-72.
    As artificial intelligence systems are becoming increasingly autonomous and will soon be able to make decisions on their own about what to do, AI researchers have started to talk about the need to align AI with human values. The AI ‘value alignment problem’ faces two kinds of challenges—a technical and a normative one—which are interrelated. The technical challenge deals with the question of how to encode human values in artificial intelligence. The normative challenge is associated with two questions: (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   4 citations  
  31. Varieties of Artificial Moral Agency and the New Control Problem.Marcus Arvan - 2022 - Humana.Mente - Journal of Philosophical Studies 15 (42):225-256.
    This paper presents a new trilemma with respect to resolving the control and alignment problems in machine ethics. Section 1 outlines three possible types of artificial moral agents (AMAs): (1) 'Inhuman AMAs' programmed to learn or execute moral rules or principles without understanding them in anything like the way that we do; (2) 'Better-Human AMAs' programmed to learn, execute, and understand moral rules or principles somewhat like we do, but correcting for various sources of human moral error; and (3) (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  32.  8
    Identify and Assess Hydropower Project’s Multidimensional Social Impacts with Rough Set and Projection Pursuit Model.Hui An, Wenjing Yang, Jin Huang, Ai Huang, Zhongchi Wan & Min An - 2020 - Complexity 2020:1-16.
    To realize the coordinated and sustainable development of hydropower projects and regional society, comprehensively evaluating hydropower projects’ influence is critical. Usually, hydropower project development has an impact on environmental geology and social and regional cultural development. Based on comprehensive consideration of complicated geological conditions, fragile ecological environment, resettlement of reservoir area, and other factors of future hydropower development in each country, we have constructed a comprehensive evaluation index system of hydropower projects, including 4 first-level indicators of social economy, environment, safety, (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  33.  30
    Instilling moral value alignment by means of multi-objective reinforcement learning.Juan Antonio Rodriguez-Aguilar, Maite Lopez-Sanchez, Marc Serramia & Manel Rodriguez-Soto - 2022 - Ethics and Information Technology 24 (1).
    AI research is being challenged with ensuring that autonomous agents learn to behave ethically, namely in alignment with moral values. Here, we propose a novel way of tackling the value alignment problem as a two-step process. The first step consists on formalising moral values and value aligned behaviour based on philosophical foundations. Our formalisation is compatible with the framework of (Multi-Objective) Reinforcement Learning, to ease the handling of an agent’s individual and ethical objectives. The second step consists (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  34.  10
    Comparative Analysis of Food Related Sustainable Development Goals in the North Asia Pacific Region.Charles V. Trappey, Amy J. C. Trappey, Hsin-Jung Lin & Ai-Che Chang - 2023 - Food Ethics 8 (2):1-24.
    Member States of the United Nations proposed Seventeen Sustainable Development Goals (SDGs) in 2015, emphasizing the well-being of people, planet, prosperity, peace, and partnership. Countries are expected to work diligently to achieve these goals by the year 2030. The paths chosen to achieve the SDGs depend on each country’s specific needs, challenges, and opportunities. This contribution conducts a bibliometric study of selected SDG research related to hunger and climate change among countries of the North Asia Pacific region. A review of (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  35.  8
    Research on the application of search algorithm in computer communication network.Kayhan Zrar Ghafoor, Shaweta Khanna, Jilei Zhang, Jianwei Chai & Hua Ai - 2022 - Journal of Intelligent Systems 31 (1):1150-1159.
    This article mitigates the challenges of previously reported literature by reducing the operating cost and improving the performance of network. A genetic algorithm-based tabu search methodology is proposed to solve the link capacity and traffic allocation problem in a computer communication network. An efficient modern super-heuristic search method is used to influence the fixed cost, delay cost, and variable cost of a link on the total operating cost in the computer communication network are discussed. The article analyses a large (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  36.  14
    Topology optimization of computer communication network based on improved genetic algorithm.Kayhan Zrar Ghafoor, Jilei Zhang, Yuhong Fan & Hua Ai - 2022 - Journal of Intelligent Systems 31 (1):651-659.
    The topology optimization of computer communication network is studied based on improved genetic algorithm, a network optimization design model based on the establishment of network reliability maximization under given cost constraints, and the corresponding improved GA is proposed. In this method, the corresponding computer communication network cost model and computer communication network reliability model are established through a specific project, and the genetic intelligence algorithm is used to solve the cost model and computer communication network reliability model, respectively. It has (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  37. Human-Centered AI: The Aristotelian Approach.Jacob Sparks & Ava Wright - 2023 - Divus Thomas 126 (2):200-218.
    As we build increasingly intelligent machines, we confront difficult questions about how to specify their objectives. One approach, which we call human-centered, tasks the machine with the objective of learning and satisfying human objectives by observing our behavior. This paper considers how human-centered AI should conceive the humans it is trying to help. We argue that an Aristotelian model of human agency has certain advantages over the currently dominant theory drawn from economics.
    Direct download  
     
    Export citation  
     
    Bookmark  
  38.  24
    Where lies the grail? AI, common sense, and human practical intelligence.William Hasselberger & Micah Lott - forthcoming - Phenomenology and the Cognitive Sciences:1-22.
    The creation of machines with intelligence comparable to human beings—so-called "human-level” and “general” intelligence—is often regarded as the Holy Grail of Artificial Intelligence (AI) research. However, many prominent discussions of AI lean heavily on the notion of human-level intelligence to frame AI research, but then rely on conceptions of human cognitive capacities, including “common sense,” that are sketchy, one-sided, philosophically loaded, and highly contestable. Our goal in this essay is to bring into view some underappreciated features of the practical intelligence (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  39.  47
    Social Choice for AI Alignment: Dealing with Diverse Human Feedback.Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mosse, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde & William S. Zwicker - manuscript
    Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, so that, for example, they refuse to comply with requests for help with committing crimes or with producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  40. Deontology and Safe Artificial Intelligence.William D'Alessandro - forthcoming - Philosophical Studies.
    The field of AI safety aims to prevent increasingly capable artificially intelligent systems from causing humans harm. Research on moral alignment is widely thought to offer a promising safety strategy: if we can equip AI systems with appropriate ethical rules, according to this line of thought, they'll be unlikely to disempower, destroy or otherwise seriously harm us. Deontological morality looks like a particularly attractive candidate for an alignment target, given its popularity, relative technical tractability and commitment to harm-avoidance (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  41. The value alignment problem: a geometric approach.Martin Peterson - 2019 - Ethics and Information Technology 21 (1):19-28.
    Stuart Russell defines the value alignment problem as follows: How can we build autonomous systems with values that “are aligned with those of the human race”? In this article I outline some distinctions that are useful for understanding the value alignment problem and then propose a solution: I argue that the methods currently applied by computer scientists for embedding moral values in autonomous systems can be improved by representing moral principles as conceptual spaces, i.e. as Voronoi (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  42.  17
    Epistemic (in)justice, social identity and the Black Box problem in patient care.Muneerah Khan & Cornelius Ewuoso - 2024 - Medicine, Health Care and Philosophy 27 (2):227-240.
    This manuscript draws on the moral norms arising from the nuanced accounts of epistemic (in)justice and social identity in relational autonomy to normatively assess and articulate the ethical problems associated with using AI in patient care in light of the Black Box problem. The article also describes how black-boxed AI may be used within the healthcare system. The manuscript highlights what needs to happen to align AI with the moral norms it draws on. Deeper thinking – from other backgrounds (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  43. Two Victim Paradigms and the Problem of ‘Impure’ Victims.Diana Tietjens Meyers - 2011 - Humanity 2 (2):255-275.
    Philosophers have had surprisingly little to say about the concept of a victim although it is presupposed by the extensive philosophical literature on rights. Proceeding in four stages, I seek to remedy this deficiency and to offer an alternative to the two current paradigms that eliminates the Othering of victims. First, I analyze two victim paradigms that emerged in the late 20th century along with the initial iteration of the international human rights regime – the pathetic victim paradigm and the (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  44.  44
    The AI Commander Problem: Ethical, Political, and Psychological Dilemmas of Human-Machine Interactions in AI-enabled Warfare.James Johnson - 2022 - Journal of Military Ethics 21 (3):246-271.
    Can AI solve the ethical, moral, and political dilemmas of warfare? How is artificial intelligence (AI)-enabled warfare changing the way we think about the ethical-political dilemmas and practice of war? This article explores the key elements of the ethical, moral, and political dilemmas of human-machine interactions in modern digitized warfare. It provides a counterpoint to the argument that AI “rational” efficiency can simultaneously offer a viable solution to human psychological and biological fallibility in combat while retaining “meaningful” human control over (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  45.  29
    Artificial intelligence and democratic legitimacy. The problem of publicity in public authority.Ludvig Beckman, Jonas Hultin Rosenberg & Karim Jebari - forthcoming - AI and Society.
    Machine learning algorithms are increasingly used to support decision-making in the exercise of public authority. Here, we argue that an important consideration has been overlooked in previous discussions: whether the use of ML undermines the democratic legitimacy of public institutions. From the perspective of democratic legitimacy, it is not enough that ML contributes to efficiency and accuracy in the exercise of public authority, which has so far been the focus in the scholarly literature engaging with these developments. According to one (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  46.  23
    Is AI a Problem for Forward Looking Moral Responsibility? The Problem Followed by a Solution.Fabio Tollon - 2022 - In Communications in Computer and Information Science. Cham: pp. 307-318.
    Recent work in AI ethics has come to bear on questions of responsibility. Specifically, questions of whether the nature of AI-based systems render various notions of responsibility inappropriate. While substantial attention has been given to backward-looking senses of responsibility, there has been little consideration of forward-looking senses of responsibility. This paper aims to plug this gap, and will concern itself with responsibility as moral obligation, a particular kind of forward-looking sense of responsibility. Responsibility as moral obligation is predicated on the (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  47. How does Artificial Intelligence Pose an Existential Risk?Karina Vold & Daniel R. Harris - 2023 - In Carissa Véliz (ed.), The Oxford Handbook of Digital Ethics. Oxford University Press.
    Alan Turing, one of the fathers of computing, warned that Artificial Intelligence (AI) could one day pose an existential risk to humanity. Today, recent advancements in the field AI have been accompanied by a renewed set of existential warnings. But what exactly constitutes an existential risk? And how exactly does AI pose such a threat? In this chapter we aim to answer these questions. In particular, we will critically explore three commonly cited reasons for thinking that AI poses an existential (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  48. Mapping the Stony Road toward Trustworthy AI: Expectations, Problems, Conundrums.Gernot Rieder, Judith Simon & Pak-Hang Wong - forthcoming - In Marcello Pelillo & Teresa Scantamburlo (eds.), Machines We Trust: Perspectives on Dependable AI. Cambridge, Mass.:
    The notion of trustworthy AI has been proposed in response to mounting public criticism of AI systems, in particular with regard to the proliferation of such systems into ever more sensitive areas of human life without proper checks and balances. In Europe, the High-Level Expert Group on Artificial Intelligence has recently presented its Ethics Guidelines for Trustworthy AI. To some, the guidelines are an important step for the governance of AI. To others, the guidelines distract effort from genuine AI regulation. (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  49.  41
    Shared Moral Foundations of Embodied Artificial Intelligence.Joe Cruz - 2019 - In Vincent Conitzer, Gillian Hadfield & Shannon Vallor (eds.), AIES '19: Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society. pp. 139-146.
    Sophisticated AI's will make decisions about how to respond to complex situations, and we may wonder whether those decisions will align with the moral values of human beings. I argue that pessimistic worries about this value alignment problem are overstated. In order to achieve intelligence in its full generality and adaptiveness, cognition in AI's will need to be embodied in the sense of the Embodied Cognition research program. That embodiment will yield AI's that share our moral foundations, namely (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  50.  33
    AI in the noosphere: an alignment of scientific and wisdom traditions.Stephen D. Edwards - 2021 - AI and Society 36 (1):397-399.
1 — 50 / 982