Search results for `ai alignment problem`

AI, alignment, and the categorical imperative.Fritz McDonald - 2023 - AI and Ethics 3:337-344.

Tae Wan Kim, John Hooker, and Thomas Donaldson make an attempt, in recent articles, to solve the alignment problem. As they define the alignment problem, it is the issue of how to give AI systems moral intelligence. They contend that one might program machines with a version of Kantian ethics cast in deontic modal logic. On their view, machines can be aligned with human values if such machines obey principles of universalization and autonomy, as well as (...)

Machine Ethics in Philosophy of Cognitive Science

Direct download (2 more)

Export citation

Bookmark

Aligning artificial intelligence with moral intuitions: an intuitionist approach to the alignment problem.Dario Cecchini, Michael Pflanzer & Veljko Dubljevic - 2024 - AI and Ethics:1-11.

As artificial intelligence (AI) continues to advance, one key challenge is ensuring that AI aligns with certain values. However, in the current diverse and democratic society, reaching a normative consensus is complex. This paper delves into the methodological aspect of how AI ethicists can effectively determine which values AI should uphold. After reviewing the most influential methodologies, we detail an intuitionist research agenda that offers guidelines for aligning AI applications with a limited set of reliable moral intuitions, each underlying a (...)

Ethics of Artificial Intelligence in Philosophy of Cognitive Science

Moral Intuition in Normative Ethics

Direct download

Export citation

Bookmark

34

Calibrating machine behavior: a challenge for AI alignment.Erez Firt - 2023 - Ethics and Information Technology 25 (3):1-8.

When discussing AI alignment, we usually refer to the problem of teaching or training advanced autonomous AI systems to make decisions that are aligned with human values or preferences. Proponents of this approach believe it can be employed as means to stay in control over sophisticated intelligent systems, thus avoiding certain existential risks. We identify three general obstacles on the path to implementation of value alignment: a technological/technical obstacle, a normative obstacle, and a calibration problem. Presupposing, (...)

Computer Ethics in Applied Ethics

Direct download (2 more)

Export citation

Bookmark

58

Can the predictive processing model of the mind ameliorate the value-alignment problem?William Ratoff - 2021 - Ethics and Information Technology 23 (4):739-750.

How do we ensure that future generally intelligent AI share our values? This is the value-alignment problem. It is a weighty matter. After all, if AI are neutral with respect to our wellbeing, or worse, actively hostile toward us, then they pose an existential threat to humanity. Some philosophers have argued that one important way in which we can mitigate this threat is to develop only AI that shares our values or that has values that ‘align with’ ours. (...)

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Machine Ethics in Philosophy of Cognitive Science

Moral Status of Artificial Systems in Philosophy of Cognitive Science

Direct download (2 more)

Export citation

Bookmark

56

Human-aligned artificial intelligence is a multiobjective problem.Peter Vamplew, Richard Dazeley, Cameron Foale, Sally Firmin & Jane Mummery - 2018 - Ethics and Information Technology 20 (1):27-40.

As the capabilities of artificial intelligence systems improve, it becomes important to constrain their actions to ensure their behaviour remains beneficial to humanity. A variety of ethical, legal and safety-based frameworks have been proposed as a basis for designing these constraints. Despite their variations, these frameworks share the common characteristic that decision-making must consider multiple potentially conflicting factors. We demonstrate that these alignment frameworks can be represented as utility functions, but that the widely used Maximum Expected Utility paradigm provides (...)

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Direct download (2 more)

Export citation

Bookmark

10 citations

43

Applying AI for social good: Aligning academic journal ratings with the United Nations Sustainable Development Goals (SDGs).David Steingard, Marcello Balduccini & Akanksha Sinha - 2023 - AI and Society 38 (2):613-629.

This paper offers three contributions to the burgeoning movements of AI for Social Good (AI4SG) and AI and the United Nations Sustainable Development Goals (SDGs). First, we introduce the SDG-Intense Evaluation framework (SDGIE) that aims to situate variegated automated/AI models in a larger ecosystem of computational approaches to advance the SDGs. To foster knowledge collaboration for solving complex social and environmental problems encompassed by the SDGs, the SDGIE framework details a benchmark structure of data-algorithm-output to effectively standardize AI approaches to (...)

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (3 more)

Export citation

Bookmark

465

Saliva Ontology: An ontology-based framework for a Salivaomics Knowledge Base.Jiye Ai, Barry Smith & David Wong - 2010 - BMC Bioinformatics 11 (1):302.

The Salivaomics Knowledge Base (SKB) is designed to serve as a computational infrastructure that can permit global exploration and utilization of data and information relevant to salivaomics. SKB is created by aligning (1) the saliva biomarker discovery and validation resources at UCLA with (2) the ontology resources developed by the OBO (Open Biomedical Ontologies) Foundry, including a new Saliva Ontology (SALO). We define the Saliva Ontology (SALO; http://www.skb.ucla.edu/SALO/) as a consensus-based controlled vocabulary of terms and relations dedicated to the salivaomics (...)

Medicine in Professional Areas

Ontology in Metaphysics

Direct download

Export citation

Bookmark

4 citations

8

Some Problems of Historical Research on the 1911 Revolution.Chang K'ai-yüan - 1980 - Chinese Studies in History 13 (4):37-53.

Social and Political Philosophy

Direct download (3 more)

Export citation

Bookmark

L'analyse logique du problème des conséquences négatives et la classification des méthodes de sa résolution.Ai Ouemov - 1971 - Revue Internationale de Philosophie 98 (4):528.

No categories

Export citation

Bookmark

9

Toleration and Justice in the Laozi: Engaging with Tao Jiang's Origins of Moral-Political Philosophy in Early China.Ai Yuan - 2023 - Philosophy East and West 73 (2):466-475.

In lieu of an abstract, here is a brief excerpt of the content:Toleration and Justice in the Laozi:Engaging with Tao Jiang's Origins of Moral-Political Philosophy in Early ChinaAi Yuan (bio)IntroductionThis review article engages with Tao Jiang's ground-breaking monograph on the Origins of Moral-Political Philosophy in Early China with particular focus on the articulation of toleration and justice in the Laozi (otherwise called the Daodejing).1 Jiang discusses a naturalistic turn and the re-alignment of values in the Laozi, resulting in a (...) naturalization of justice (impartiality) by rejecting artificial humaneness and rigid hierarchical moral-political structures. In this Laozian just world, there is no room for human intervention and rigid top-down enforcement of values, thus leaving justice to naturalized Heaven or Dao as the ultimate source of the cosmos.Based on Jiang's interpretative context, I show that there is a twofold justification for the "paradoxical" and "elusive"2 value of political toleration.3 A negative expression of toleration focuses on non-interference with choices, and non-enforcement of values toward those whom one reasonably disagrees with and regards as morally wrong. Such an attitude of toleration results in practical advantages such as avoiding entrenchment in bloodshed, abandonment, and deprivation.A positive expression of toleration operating in the Laozian ideal world requires equal protection of people as capable knowers, including those to whom we object. Such protection includes their ways of expression and an unbiased recognition of them as equally capable knowers so that they are heard without bias and prejudice. Such a kind of toleration is revealed as a constituent component of justice since it enables equal contribution toward the shaping of society without discrimination. In other words, toleration is seen as naturalized justice in a Laozian world within which no one is "being wronged with the capacity as a knower"—a kind of epistemic justice articulated by Miranda Fricker (Fricker 2007).4 [End Page 466]This article discusses how Jiang's political reconstruction of Laozian philosophy contains the seeds for a discussion of toleration, a value associated with distributive justice and against epistemic injustice. First, it introduces Jiang's arguments on a naturalization of justice and impartiality as Heavenly attributes in the Laozi. Second, it articulates the value of tolerance in the Laozi. Finally, I compare Laozian toleration with Confucian toleration using youwei- wuwei metaethical criticism.Moral-Political Philosophy in the LaoziJiang's interpretative framework about the origins of distributive justice in the Laozi starts with a discussion of the cosmogonic-mystical worldviews that signal "a new understanding of the nature of the cosmos, as well as a broad reorientation in the Heaven-human relationship in the mid-Warring States period" (p. 192). Differing from the interests in the origins of human culture or civilization in the Ru and Mo traditions, Jiang agrees with Franklin Perkins and identifies a "cosmogonic turn" in the late fourth century b.c. that reveals a demotion of anthropomorphic Heaven. By paring Heaven with earth and accordingly decentering human beings in the intellectual discourse, the Laozi directs equal values to the ten thousand things, with humans as just one among them (Perkins 2016, quoted in Jiang 2021, p. 196).5 Laozi's philosophy thus stands out by rejecting Heaven as caring for human affairs while taking Dao as the ultimate origin of the cosmos, as claimed in chapter 5 of the received text: "Heaven and earth are not humane; they treat ten-thousand things in the world as straw dogs; the sage is not humane; he treats people as straw dogs" (p. 199).With Jiang's reading of a naturalistic heaven, we see a re-alignment between cosmos and the human since sage-rulers follow the Heaven-Dao. The Laozian Dao is "self-so-ing" (ziran 自然), "accommodating" (rong 容), and "impartial" (gong 公), and thus marks a sharp contrast to Confucian familial bias resulting in "actively intervening in human affairs on behalf of the (certain) humans by appointing them to carry out its mission" (p. 200). With the sage emulating the ultimate Dao, the Laozi replaces the Confucian anthropocentric Heaven, which focuses on partiality (qin 親) and humaneness (ren 仁).This Laozian understanding of the cosmos also grounds its philosophical departure from Mohism despite both having a political... (shrink)

Asian Philosophy

Justice in Social and Political Philosophy

Direct download (2 more)

Export citation

Bookmark

26

An explanation space to align user studies with the technical development of Explainable AI.Garrick Cabour, Andrés Morales-Forero, Élise Ledoux & Samuel Bassetto - 2023 - AI and Society 38 (2):869-887.

Providing meaningful and actionable explanations for end-users is a situated problem requiring the intersection of multiple disciplines to address social, operational, and technical challenges. However, the explainable artificial intelligence community has not commonly adopted or created tangible design tools that allow interdisciplinary work to develop reliable AI-powered solutions. This paper proposes a formative architecture that defines the explanation space from a user-inspired perspective. The architecture comprises five intertwined components to outline explanation requirements for a task: (1) the end-users’ mental (...)

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (3 more)

Export citation

Bookmark

28

A finite model property for RMImin.Ai-ni Hsieh & James G. Raftery - 2006 - Mathematical Logic Quarterly 52 (6):602-612.

It is proved that the variety of relevant disjunction lattices has the finite embeddability property. It follows that Avron's relevance logic RMImin has a strong form of the finite model property, so it has a solvable deducibility problem. This strengthens Avron's result that RMImin is decidable.

Logic and Philosophy of Logic, Miscellaneous in Logic and Philosophy of Logic

Nonclassical Logics in Logic and Philosophy of Logic

Direct download (2 more)

Export citation

Bookmark

3 citations

16

A Defect Detection Method for the Surface of Metal Materials Based on an Adaptive Ultrasound Pulse Excitation Device and Infrared Thermal Imaging Technology.Yibo Ai, Yingjie Zhang, Xingzhao Cao & Weidong Zhang - 2021 - Complexity 2021:1-9.

Ultrasonic excitation has been widely used in the detection of microcracks on metal surfaces, but there are problems such as poor excitation effect of ultrasonic pulse, long time to reach the best excitation, and difficult to find microcracks. In this paper, an adaptive ultrasonic pulse excitation device and infrared thermal imaging technology have been combined, as well as their control method, to solve the problem. The adaptive ultrasonic pulse excitation device adds intelligent modules to realize automatic adjustment of detection (...)

Natural Sciences

Direct download (2 more)

Export citation

Bookmark

324

Artificial Intelligence, Values, and Alignment.Iason Gabriel - 2020 - Minds and Machines 30 (3):411-437.

This paper looks at philosophical questions that arise in the context of AI alignment. It defends three propositions. First, normative and technical aspects of the AI alignment problem are interrelated, creating space for productive engagement between people working in both domains. Second, it is important to be clear about the goal of alignment. There are significant differences between AI that aligns with instructions, intentions, revealed preferences, ideal preferences, interests and values. A principle-based approach to AI (...), which combines these elements in a systematic way, has considerable advantages in this context. Third, the central challenge for theorists is not to identify ‘true’ moral principles for AI; rather, it is to identify fair principles for alignment that receive reflective endorsement despite widespread variation in people’s moral beliefs. The final part of the paper explores three ways in which fair principles for AI alignment could potentially be identified. (shrink)

Artificial Intelligence Safety in Philosophy of Cognitive Science

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Direct download (3 more)

Export citation

Bookmark

46 citations

259

Robustness to Fundamental Uncertainty in AGI Alignment.G. G. Worley Iii - 2020 - Journal of Consciousness Studies 27 (1-2):225-241.

The AGI alignment problem has a bimodal distribution of outcomes with most outcomes clustering around the poles of total success and existential, catastrophic failure. Consequently, attempts to solve AGI alignment should, all else equal, prefer false negatives (ignoring research programs that would have been successful) to false positives (pursuing research programs that will unexpectedly fail). Thus, we propose adopting a policy of responding to points of philosophical and practical uncertainty associated with the alignment problem by (...)

Artificial Intelligence Safety in Philosophy of Cognitive Science

Philosophy of Mind, Misc in Philosophy of Mind

Thought and Thinking in Philosophy of Mind

Direct download (2 more)

Export citation

Bookmark

390

From Confucius to Coding and Avicenna to Algorithms: Cultivating Ethical AI Development through Cross-Cultural Ancient Wisdom.Ammar Younas & Yi Zeng - manuscript

This paper explores the potential of integrating ancient educational principles from diverse eastern cultures into modern AI ethics curricula. It draws on the rich educational traditions of ancient China, India, Arabia, Persia, Japan, Tibet, Mongolia, and Korea, highlighting their emphasis on philosophy, ethics, holistic development, and critical thinking. By examining these historical educational systems, the paper establishes a correlation with modern AI ethics principles, advocating for the inclusion of these ancient teachings in current AI development and education. The proposed integration (...)

Chinese Philosophy in Asian Philosophy

Teaching Philosophy

Direct download

Export citation

Bookmark

49

Current cases of AI misalignment and their implications for future risks.Leonard Dung - 2023 - Synthese 202 (5):1-23.

How can one build AI systems such that they pursue the goals their designers want them to pursue? This is the alignment problem. Numerous authors have raised concerns that, as research advances and systems become more powerful over time, misalignment might lead to catastrophic outcomes, perhaps even to the extinction or permanent disempowerment of humanity. In this paper, I analyze the severity of this risk based on current instances of misalignment. More specifically, I argue that contemporary large language (...)

Artificial Intelligence Safety in Philosophy of Cognitive Science

Machine Ethics in Philosophy of Cognitive Science

Moral Motivation in Meta-Ethics

Direct download (2 more)

Export citation

Bookmark

2 citations

31

Possibilities and ethical issues of entrusting nursing tasks to robots and artificial intelligence.Tomohide Ibuki, Ai Ibuki & Eisuke Nakazawa - forthcoming - Nursing Ethics.

In recent years, research in robotics and artificial intelligence (AI) has made rapid progress. It is expected that robots and AI will play a part in the field of nursing and their role might broaden in the future. However, there are areas of nursing practice that cannot or should not be entrusted to robots and AI, because nursing is a highly humane practice, and therefore, there would, perhaps, be some practices that should not be replicated by robots or AI. Therefore, (...)

Biomedical Ethics in Applied Ethics

Direct download (2 more)

Export citation

Bookmark

11

Some Opinions on "the Problem of Inheriting the Legacy of Chinese Philosophy".Ai Ssu-ch'I. - 1968 - Chinese Studies in History 2 (2):92-97.

Chinese Philosophy: Topics, Misc in Asian Philosophy

Direct download (2 more)

Export citation

Bookmark

54

Aligning artificial intelligence with human values: reflections from a phenomenological perspective.Shengnan Han, Eugene Kelly, Shahrokh Nikou & Eric-Oluf Svee - 2022 - AI and Society 37 (4):1383-1395.

Artificial Intelligence (AI) must be directed at humane ends. The development of AI has produced great uncertainties of ensuring AI alignment with human values (AI value alignment) through AI operations from design to use. For the purposes of addressing this problem, we adopt the phenomenological theories of material values and technological mediation to be that beginning step. In this paper, we first discuss the AI value alignment from the relevant AI studies. Second, we briefly present what (...)

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (3 more)

Export citation

Bookmark

1 citation

44

Textile Diagrams. Florian Pumhösl's Abstraction as Method.T'ai Smith - 2015 - Zeitschrift für Medien- Und Kulturforschung 2015 (1):101-116.

For Viennese artist Florian Pumhösl »abstraction is a method«, not a category. Or rather, if abstraction is the defining category of modernism, the objective is to reproduce modernism's problems and limits and exploit relationships among its parts. Considering what Pumhösl calls the »textile complex« of modernism, this essay examines the artist's work in parallel with Charles Sanders Peirce's diagram concept and Gottfried Semper's use of textile diagrams throughout Style in the Technical and Tectonic Arts. _German_ »Abstraktion« ist für den Wiener (...)

No categories

Direct download (3 more)

Export citation

Bookmark

3

Textile Diagrams. Florian Pumhösl's Abstraction as Method.T'ai Smith - 2015 - Zeitschrift für Medien- Und Kulturforschung 6 (1):101-116.

For Viennese artist Florian Pumhösl »abstraction is a method«, not a category. Or rather, if abstraction is the defining category of modernism, the objective is to reproduce modernism's problems and limits and exploit relationships among its parts. Considering what Pumhösl calls the »textile complex« of modernism, this essay examines the artist's work in parallel with Charles Sanders Peirce's diagram concept and Gottfried Semper's use of textile diagrams throughout Style in the Technical and Tectonic Arts.

No categories

Direct download (2 more)

Export citation

Bookmark

259

An Enactive Approach to Value Alignment in Artificial Intelligence: A Matter of Relevance.Michael Cannon - 2022 - In Vincent C. Müller (ed.), Philosophy and Theory of Artificial Intelligence 2021. pp. 119-135.

The “Value Alignment Problem” is the challenge of how to align the values of artificial intelligence with human values, whatever they may be, such that AI does not pose a risk to the existence of humans. A fundamental feature of how the problem is currently understood is that AI systems do not take the same things to be relevant as humans, whether turning humans into paperclips in order to “make more paperclips” or eradicating the human race to (...)

No categories

Direct download

Export citation

Bookmark

65

Value alignment, human enhancement, and moral revolutions.Ariela Tubert & Justin Tiehen - forthcoming - Inquiry: An Interdisciplinary Journal of Philosophy.

Human beings are internally inconsistent in various ways. One way to develop this thought involves using the language of value alignment: the values we hold are not always aligned with our behavior, and are not always aligned with each other. Because of this self-misalignment, there is room for potential projects of human enhancement that involve achieving a greater degree of value alignment than we presently have. Relatedly, discussions of AI ethics sometimes focus on what is known as the (...)

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (2 more)

Export citation

Bookmark

180

An Enactive Approach to Value Alignment in Artificial Intelligence: A Matter of Relevance.Michael Cannon - 2021 - In Vincent C. Müller (ed.), Philosophy and Theory of Artificial Intelligence 2021. Springer Cham. pp. 119-135.

The “Value Alignment Problem” is the challenge of how to align the values of artificial intelligence with human values, whatever they may be, such that AI does not pose a risk to the existence of humans. Existing approaches appear to conceive of the problem as "how do we ensure that AI solves the problem in the right way", in order to avoid the possibility of AI turning humans into paperclips in order to “make more paperclips” or (...)

No categories

Direct download

Export citation

Bookmark

234

The Blood Ontology: An ontology in the domain of hematology.Almeida Mauricio Barcellos, Proietti Anna Barbara de Freitas Carneiro, Ai Jiye & Barry Smith - 2011 - In Proceedings of the Second International Conference on Biomedical Ontology, Buffalo, NY, July 28-30, 2011 (CEUR 883). pp. (CEUR Workshop Proceedings, 833).

Despite the importance of human blood to clinical practice and research, hematology and blood transfusion data remain scattered throughout a range of disparate sources. This lack of systematization concerning the use and definition of terms poses problems for physicians and biomedical professionals. We are introducing here the Blood Ontology, an ongoing initiative designed to serve as a controlled vocabulary for use in organizing information about blood. The paper describes the scope of the Blood Ontology, its stage of development and some (...)

Ontology in Metaphysics

Direct download

Export citation

Bookmark

529

The argument for near-term human disempowerment through AI.Leonard Dung - 2024 - AI and Society:1-14.

Many researchers and intellectuals warn about extreme risks from artificial intelligence. However, these warnings typically came without systematic arguments in support. This paper provides an argument that AI will lead to the permanent disempowerment of humanity, e.g. human extinction, by 2100. It rests on four substantive premises which it motivates and defends: first, the speed of advances in AI capability, as well as the capability level current systems have already reached, suggest that it is practically possible to build AI systems (...)

Artificial Intelligence Safety in Philosophy of Cognitive Science

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Existential Risk in Philosophy of Action

Moral Status of Artificial Systems in Philosophy of Cognitive Science

Philosophy of AI, Misc in Philosophy of Cognitive Science

Direct download (4 more)

Export citation

Bookmark

1 citation

6

Minangkabaunese matrilineal: The correlation between the Qur’an and gender.Halimatussa’Diyah Halimatussa’Diyah, Kusnadi Kusnadi, Ai Y. Yuliyanti, Deddy Ilyas & Eko Zulfikar - 2024 - HTS Theological Studies 80 (1):7.

Upon previous research, the matrilineal system seems to oppose Islamic teaching. However, the matrilineal system practiced by the Minangkabau society in West Sumatra, Indonesia has its uniqueness. Thus, this study aims to examine the correlation between the Qur’an and gender roles within the context of Minangkabau customs, specifically focusing on the matrilineal aspect. The present study employs qualitative methods for conducting library research through critical analysis. This study discovered that the matrilineal system practiced by the Minangkabau society aligns with Qur’anic (...)

Arts and Humanities

Direct download (3 more)

Export citation

Bookmark

379

ChatGPT: towards AI subjectivity.Kristian D’Amato - 2024 - AI and Society 39:1-15.

Motivated by the question of responsible AI and value alignment, I seek to offer a uniquely Foucauldian reconstruction of the problem as the emergence of an ethical subject in a disciplinary setting. This reconstruction contrasts with the strictly human-oriented programme typical to current scholarship that often views technology in instrumental terms. With this in mind, I problematise the concept of a technological subjectivity through an exploration of various aspects of ChatGPT in light of Foucault’s work, arguing that current (...)

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Machine Ethics in Philosophy of Cognitive Science

Michel Foucault in Continental Philosophy

Moral Status of Artificial Systems in Philosophy of Cognitive Science

Phenomenalism in Metaphysics

Robot Ethics in Applied Ethics

Direct download (6 more)

Export citation

Bookmark

59

Challenges of Aligning Artificial Intelligence with Human Values.Margit Sutrop - 2020 - Acta Baltica Historiae Et Philosophiae Scientiarum 8 (2):54-72.

As artificial intelligence systems are becoming increasingly autonomous and will soon be able to make decisions on their own about what to do, AI researchers have started to talk about the need to align AI with human values. The AI ‘value alignment problem’ faces two kinds of challenges—a technical and a normative one—which are interrelated. The technical challenge deals with the question of how to encode human values in artificial intelligence. The normative challenge is associated with two questions: (...)

Artificial Intelligence Safety in Philosophy of Cognitive Science

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Direct download (2 more)

Export citation

Bookmark

4 citations

819

Varieties of Artificial Moral Agency and the New Control Problem.Marcus Arvan - 2022 - Humana.Mente - Journal of Philosophical Studies 15 (42):225-256.

This paper presents a new trilemma with respect to resolving the control and alignment problems in machine ethics. Section 1 outlines three possible types of artificial moral agents (AMAs): (1) 'Inhuman AMAs' programmed to learn or execute moral rules or principles without understanding them in anything like the way that we do; (2) 'Better-Human AMAs' programmed to learn, execute, and understand moral rules or principles somewhat like we do, but correcting for various sources of human moral error; and (3) (...)

Direct download

Export citation

Bookmark

8

Identify and Assess Hydropower Project’s Multidimensional Social Impacts with Rough Set and Projection Pursuit Model.Hui An, Wenjing Yang, Jin Huang, Ai Huang, Zhongchi Wan & Min An - 2020 - Complexity 2020:1-16.

To realize the coordinated and sustainable development of hydropower projects and regional society, comprehensively evaluating hydropower projects’ influence is critical. Usually, hydropower project development has an impact on environmental geology and social and regional cultural development. Based on comprehensive consideration of complicated geological conditions, fragile ecological environment, resettlement of reservoir area, and other factors of future hydropower development in each country, we have constructed a comprehensive evaluation index system of hydropower projects, including 4 first-level indicators of social economy, environment, safety, (...)

Natural Sciences

Direct download (2 more)

Export citation

Bookmark

1 citation

30

Instilling moral value alignment by means of multi-objective reinforcement learning.Juan Antonio Rodriguez-Aguilar, Maite Lopez-Sanchez, Marc Serramia & Manel Rodriguez-Soto - 2022 - Ethics and Information Technology 24 (1).

AI research is being challenged with ensuring that autonomous agents learn to behave ethically, namely in alignment with moral values. Here, we propose a novel way of tackling the value alignment problem as a two-step process. The first step consists on formalising moral values and value aligned behaviour based on philosophical foundations. Our formalisation is compatible with the framework of (Multi-Objective) Reinforcement Learning, to ease the handling of an agent’s individual and ethical objectives. The second step consists (...)

No categories

Direct download (2 more)

Export citation

Bookmark

10

Comparative Analysis of Food Related Sustainable Development Goals in the North Asia Pacific Region.Charles V. Trappey, Amy J. C. Trappey, Hsin-Jung Lin & Ai-Che Chang - 2023 - Food Ethics 8 (2):1-24.

Member States of the United Nations proposed Seventeen Sustainable Development Goals (SDGs) in 2015, emphasizing the well-being of people, planet, prosperity, peace, and partnership. Countries are expected to work diligently to achieve these goals by the year 2030. The paths chosen to achieve the SDGs depend on each country’s specific needs, challenges, and opportunities. This contribution conducts a bibliometric study of selected SDG research related to hunger and climate change among countries of the North Asia Pacific region. A review of (...)

No categories

Direct download (2 more)

Export citation

Bookmark

8

Research on the application of search algorithm in computer communication network.Kayhan Zrar Ghafoor, Shaweta Khanna, Jilei Zhang, Jianwei Chai & Hua Ai - 2022 - Journal of Intelligent Systems 31 (1):1150-1159.

This article mitigates the challenges of previously reported literature by reducing the operating cost and improving the performance of network. A genetic algorithm-based tabu search methodology is proposed to solve the link capacity and traffic allocation problem in a computer communication network. An efficient modern super-heuristic search method is used to influence the fixed cost, delay cost, and variable cost of a link on the total operating cost in the computer communication network are discussed. The article analyses a large (...)

No categories

Direct download

Export citation

Bookmark

14

Topology optimization of computer communication network based on improved genetic algorithm.Kayhan Zrar Ghafoor, Jilei Zhang, Yuhong Fan & Hua Ai - 2022 - Journal of Intelligent Systems 31 (1):651-659.

The topology optimization of computer communication network is studied based on improved genetic algorithm, a network optimization design model based on the establishment of network reliability maximization under given cost constraints, and the corresponding improved GA is proposed. In this method, the corresponding computer communication network cost model and computer communication network reliability model are established through a specific project, and the genetic intelligence algorithm is used to solve the cost model and computer communication network reliability model, respectively. It has (...)

Topology in Philosophy of Mathematics

Direct download

Export citation

Bookmark

236

Human-Centered AI: The Aristotelian Approach.Jacob Sparks & Ava Wright - 2023 - Divus Thomas 126 (2):200-218.

As we build increasingly intelligent machines, we confront difficult questions about how to specify their objectives. One approach, which we call human-centered, tasks the machine with the objective of learning and satisfying human objectives by observing our behavior. This paper considers how human-centered AI should conceive the humans it is trying to help. We argue that an Aristotelian model of human agency has certain advantages over the currently dominant theory drawn from economics.

Aristotle: Free Will and Agency in Ancient Greek and Roman Philosophy

Artificial Intelligence Safety in Philosophy of Cognitive Science

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Instrumental Reasoning in Philosophy of Action

Machine Ethics in Philosophy of Cognitive Science

Rational Requirements in Epistemology

Reasons and Rationality in Philosophy of Action

Robot Ethics in Applied Ethics

Direct download

Export citation

Bookmark

24

Where lies the grail? AI, common sense, and human practical intelligence.William Hasselberger & Micah Lott - forthcoming - Phenomenology and the Cognitive Sciences:1-22.

The creation of machines with intelligence comparable to human beings—so-called "human-level” and “general” intelligence—is often regarded as the Holy Grail of Artificial Intelligence (AI) research. However, many prominent discussions of AI lean heavily on the notion of human-level intelligence to frame AI research, but then rely on conceptions of human cognitive capacities, including “common sense,” that are sketchy, one-sided, philosophically loaded, and highly contestable. Our goal in this essay is to bring into view some underappreciated features of the practical intelligence (...)

Philosophy of Cognitive Science

Direct download (2 more)

Export citation

Bookmark

47

Social Choice for AI Alignment: Dealing with Diverse Human Feedback.Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mosse, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde & William S. Zwicker - manuscript

Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, so that, for example, they refuse to comply with requests for help with committing crimes or with producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How (...)

Artificial Intelligence Methodology in Philosophy of Cognitive Science

Artificial Intelligence Safety in Philosophy of Cognitive Science

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Large Language Models in Philosophy of Cognitive Science

Reinforcement Learning in Philosophy of Cognitive Science

Social Choice Theory, Misc in Social and Political Philosophy

Direct download

Export citation

Bookmark

Deontology and Safe Artificial Intelligence.William D'Alessandro - forthcoming - Philosophical Studies.

The field of AI safety aims to prevent increasingly capable artificially intelligent systems from causing humans harm. Research on moral alignment is widely thought to offer a promising safety strategy: if we can equip AI systems with appropriate ethical rules, according to this line of thought, they'll be unlikely to disempower, destroy or otherwise seriously harm us. Deontological morality looks like a particularly attractive candidate for an alignment target, given its popularity, relative technical tractability and commitment to harm-avoidance (...)

Direct download

Export citation

Bookmark

117

The value alignment problem: a geometric approach.Martin Peterson - 2019 - Ethics and Information Technology 21 (1):19-28.

Stuart Russell defines the value alignment problem as follows: How can we build autonomous systems with values that “are aligned with those of the human race”? In this article I outline some distinctions that are useful for understanding the value alignment problem and then propose a solution: I argue that the methods currently applied by computer scientists for embedding moral values in autonomous systems can be improved by representing moral principles as conceptual spaces, i.e. as Voronoi (...)

Computer Ethics, Misc in Applied Ethics

Direct download (2 more)

Export citation

Bookmark

3 citations

17

Epistemic (in)justice, social identity and the Black Box problem in patient care.Muneerah Khan & Cornelius Ewuoso - 2024 - Medicine, Health Care and Philosophy 27 (2):227-240.

This manuscript draws on the moral norms arising from the nuanced accounts of epistemic (in)justice and social identity in relational autonomy to normatively assess and articulate the ethical problems associated with using AI in patient care in light of the Black Box problem. The article also describes how black-boxed AI may be used within the healthcare system. The manuscript highlights what needs to happen to align AI with the moral norms it draws on. Deeper thinking – from other backgrounds (...)

Biomedical Ethics in Applied Ethics

Justice in Social and Political Philosophy

Direct download (2 more)

Export citation

Bookmark

1972

Two Victim Paradigms and the Problem of ‘Impure’ Victims.Diana Tietjens Meyers - 2011 - Humanity 2 (2):255-275.

Philosophers have had surprisingly little to say about the concept of a victim although it is presupposed by the extensive philosophical literature on rights. Proceeding in four stages, I seek to remedy this deficiency and to offer an alternative to the two current paradigms that eliminates the Othering of victims. First, I analyze two victim paradigms that emerged in the late 20th century along with the initial iteration of the international human rights regime – the pathetic victim paradigm and the (...) heroic victim paradigm. Holocaust victims are quintessential instances of the pathetic victim paradigm. They are marked by passivity and innocence in the face of overpowering force and unspeakable humanly inflicted suffering. Aung San Suu Kyi is an exemplar of the heroic victim paradigm – prisoners of conscience, in Amnesty International’s terms. Because heroic victims face off against the repressive power of the state to fight injustice, they are by no means passive, but they must be innocent of wrongdoing – that is, they must use nonviolent means of dissent – to qualify as heroic victims. Second, I problematize the asymmetrical conceptions of innocence that underwrite the two victim paradigms. Whereas the pathetic victim paradigm identifies innocence with passivity, the heroic victim paradigm countenances agentic victims and adverts to a universalist, absolutist stance on the limits of the legitimate use of state power to ascribe innocence to heroic victims. Both conceptions of innocence are out of keeping with well established social and legal practices regarding what constitutes coercive force and innocent victimhood. Consequently, there is reason to be skeptical of the two victim paradigms. Third, I identify two kinds of human rights violations and two categories of victims that AI defends despite their failure to fit the two paradigms – women trafficked into sex work and prisoners on death row. In many cases, women forced to do sex work are not innocent girls who are ignorant of the trafficking system and who helplessly fall prey to smugglers. They are desperately poor women who for that reason are willing to take enormous risks to try to relieve their own and often their families’ deprivation and suffering. Although these women act nonviolently for irreproachable reasons, they lack the public political agendas that characterize heroic victims. Unless non-fulfillment of subsistence rights is recognized as a form of overpowering force that inflicts severe, avoidable suffering, these women do not qualify as pathetic victims either. The victim paradigms pose an even greater obstacle to recognizing that the death penalty is a human rights violation and that death row prisoners are victims. Because a jury concluded that these individuals committed heinous, violent crimes, they are excluded by the heroic victim paradigm. Only if death row prisoners can be proven (usually through DNA evidence) not to have committed the crimes for which they were convicted can these individuals qualify as pathetic victims. In the absence of any reason to believe that they are innocent and especially if they are unrepentant, they are widely regarded as brutal victimizers of others who deserve no sympathy for, let along relief from, the suffering they “brought on themselves.” Finally, I confront the Othering of victims that results from the two victim paradigms, which leads many victims to eschew the label, thereby opting out of human right discourse. I propose revisions in the victim paradigms that eliminate the real-world exclusions they sponsor as well as the Othering of victims of human rights abuses. In particular, I endorse greater attention to what people and the institutions they create do to other people, and I favor a presumption that unnecessary and severe humanly inflicted suffering is a human rights violation. Moreover, I reject the innocence criterion embedded in the two paradigms and urge that it be replaced by a burdened agency criterion. These modifications better align the concept of a victim with a realistic understanding of human subjectivity and agency and allow for a more capacious understanding of who is a bearer of human rights and under what conditions right-holders become victims of rights violations. (shrink)

Human Rights, Misc in Social and Political Philosophy

Direct download

Export citation

Bookmark

1 citation

44

The AI Commander Problem: Ethical, Political, and Psychological Dilemmas of Human-Machine Interactions in AI-enabled Warfare.James Johnson - 2022 - Journal of Military Ethics 21 (3):246-271.

Can AI solve the ethical, moral, and political dilemmas of warfare? How is artificial intelligence (AI)-enabled warfare changing the way we think about the ethical-political dilemmas and practice of war? This article explores the key elements of the ethical, moral, and political dilemmas of human-machine interactions in modern digitized warfare. It provides a counterpoint to the argument that AI “rational” efficiency can simultaneously offer a viable solution to human psychological and biological fallibility in combat while retaining “meaningful” human control over (...)

Military Ethics in Applied Ethics

Direct download (3 more)

Export citation

Bookmark

29

Artificial intelligence and democratic legitimacy. The problem of publicity in public authority.Ludvig Beckman, Jonas Hultin Rosenberg & Karim Jebari - forthcoming - AI and Society.

Machine learning algorithms are increasingly used to support decision-making in the exercise of public authority. Here, we argue that an important consideration has been overlooked in previous discussions: whether the use of ML undermines the democratic legitimacy of public institutions. From the perspective of democratic legitimacy, it is not enough that ML contributes to efficiency and accuracy in the exercise of public authority, which has so far been the focus in the scholarly literature engaging with these developments. According to one (...)

Government and Democracy in Social and Political Philosophy

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (3 more)

Export citation

Bookmark

2 citations

23

Is AI a Problem for Forward Looking Moral Responsibility? The Problem Followed by a Solution.Fabio Tollon - 2022 - In Communications in Computer and Information Science. Cham: pp. 307-318.

Recent work in AI ethics has come to bear on questions of responsibility. Specifically, questions of whether the nature of AI-based systems render various notions of responsibility inappropriate. While substantial attention has been given to backward-looking senses of responsibility, there has been little consideration of forward-looking senses of responsibility. This paper aims to plug this gap, and will concern itself with responsibility as moral obligation, a particular kind of forward-looking sense of responsibility. Responsibility as moral obligation is predicated on the (...)

Applied Ethics

Moral Responsibility in Meta-Ethics

Normative Ethics

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Philosophy of Technology in Philosophy of Computing and Information

Direct download

Export citation

Bookmark

6312

How does Artificial Intelligence Pose an Existential Risk?Karina Vold & Daniel R. Harris - 2023 - In Carissa Véliz (ed.), The Oxford Handbook of Digital Ethics. Oxford University Press.

Alan Turing, one of the fathers of computing, warned that Artificial Intelligence (AI) could one day pose an existential risk to humanity. Today, recent advancements in the field AI have been accompanied by a renewed set of existential warnings. But what exactly constitutes an existential risk? And how exactly does AI pose such a threat? In this chapter we aim to answer these questions. In particular, we will critically explore three commonly cited reasons for thinking that AI poses an existential (...)

Artificial Intelligence Safety in Philosophy of Cognitive Science

Existential Risk in Philosophy of Action

$117.54 used $123.46 new $136.30 from Amazon (collection) View on Amazon.com

Direct download

Export citation

Bookmark

1 citation

108

Mapping the Stony Road toward Trustworthy AI: Expectations, Problems, Conundrums.Gernot Rieder, Judith Simon & Pak-Hang Wong - forthcoming - In Marcello Pelillo & Teresa Scantamburlo (eds.), Machines We Trust: Perspectives on Dependable AI. Cambridge, Mass.:

The notion of trustworthy AI has been proposed in response to mounting public criticism of AI systems, in particular with regard to the proliferation of such systems into ever more sensitive areas of human life without proper checks and balances. In Europe, the High-Level Expert Group on Artificial Intelligence has recently presented its Ethics Guidelines for Trustworthy AI. To some, the guidelines are an important step for the governance of AI. To others, the guidelines distract effort from genuine AI regulation. (...)

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Moral Status of Artificial Systems in Philosophy of Cognitive Science

Trust in Normative Ethics

$4.76 used (collection) View on Amazon.com

Direct download

Export citation

Bookmark

1 citation

41

Shared Moral Foundations of Embodied Artificial Intelligence.Joe Cruz - 2019 - In Vincent Conitzer, Gillian Hadfield & Shannon Vallor (eds.), AIES '19: Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society. pp. 139-146.

Sophisticated AI's will make decisions about how to respond to complex situations, and we may wonder whether those decisions will align with the moral values of human beings. I argue that pessimistic worries about this value alignment problem are overstated. In order to achieve intelligence in its full generality and adaptiveness, cognition in AI's will need to be embodied in the sense of the Embodied Cognition research program. That embodiment will yield AI's that share our moral foundations, namely (...)

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Machine Ethics in Philosophy of Cognitive Science

Direct download

Export citation

Bookmark

1 citation

33

AI in the noosphere: an alignment of scientific and wisdom traditions.Stephen D. Edwards - 2021 - AI and Society 36 (1):397-399.

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (2 more)

Export citation

Bookmark

	show categories
	categorization shortcuts
	hide abstracts
	open articles in new windows

	show categories
	categorization shortcuts
	hide abstracts
	open articles in new windows

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Results for 'ai alignment problem'