Search results for `speech recognition technology`

Export citation

Bookmark

41

DLD: An Optimized Chinese Speech Recognition Model Based on Deep Learning.Hong Lei, Yue Xiao, Yanchun Liang, Dalin Li & Heow Pueh Lee - 2022 - Complexity 2022:1-8.

Speech recognition technology has played an indispensable role in realizing human-computer intelligent interaction. However, most of the current Chinese speech recognition systems are provided online or offline models with low accuracy and poor performance. To improve the performance of offline Chinese speech recognition, we propose a hybrid acoustic model of deep convolutional neural network, long short-term memory, and deep neural network. This model utilizes DCNN to reduce frequency variation and adds a batch normalization (...)

Export citation

Bookmark

8

English Flipped Classroom Teaching Mode Based on Emotion Recognition Technology.Lin Lai - 2022 - Frontiers in Psychology 13.

With the development of modern information technology, the flipped classroom teaching mode came into being. It has gradually become one of the hotspots of contemporary educational circles and has been applied to various disciplines at the same time. The domestic research on the flipped classroom teaching mode is still in the exploratory stage. The application of flipped classroom teaching mode is still in the exploratory stage. It also has many problems, such as low class efficiency, poor teacher-student interaction, outdated (...)

Computational Linguistics in Philosophy of Language

Export citation

Bookmark

26

Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition.Dan Jurafsky & James H. Martin - 2000 - Prentice-Hall.

The first of its kind to thoroughly cover language technology at all levels and with all modern technologies this book takes an empirical approach to the ...

Knowledge of Language in Philosophy of Language

$10.94 used $100.56 new View on Amazon.com

Philosophy of Linguistics in Philosophy of Language

Export citation

Bookmark

32 citations

11

Locus equation and hidden parameters of speech.Li Deng - 1998 - Behavioral and Brain Sciences 21 (2):263-264.

Locus equations contain an economical set of hidden (i.e., not directly observable in the data) parameters of speech that provide an elegant way of characterizing the ubiquitous context-dependent behaviors exhibited in speech acoustics. These hidden parameters can be effectively exploited to constrain the huge set of context-dependent speech model parameters currently in use in modern, mainstream speech recognition technology.

The Nature of Context in Philosophy of Language

Direct download (6 more)

Export citation

Bookmark

4

Speech trasformations solutions.Dimitri Kanevsky, Sara Basson, Alexander Faisman, Leonid Rachevsky, Alex Zlatsin & Sarah Conrod - 2006 - Pragmatics and Cognition 14 (2):411-442.

This paper outlines the background development of “intelligent“ technologies such as speech recognition. Despite significant progress in the development of these technologies, they still fall short in many areas, and rapid advances in areas such as dictation are actually stalled. In this paper we have proposed semi-automatic solutions — smart integration of human and intelligent efforts. One such technique involves improvement to the speech recognition editing interface, thereby reducing the perception of errors to the viewer. Other (...)

Metaphysics and Epistemology

Philosophy of Linguistics in Philosophy of Language

Export citation

Bookmark

51

Intelligent Aging Home Control Method and System for Internet of Things Emotion Recognition.Xu Wu & Qian Zhang - 2022 - Frontiers in Psychology 13.

To solve a series of pension problems caused by aging, based on the emotional recognition of the Internet of Things, the control method and system research of smart homes are proposed. This article makes a detailed analysis and research on the necessity, feasibility, and how to realize speech emotion recognition technology in smart families, introduces the definition and classification of emotion, and puts forward five main emotions to be recognized in speech emotion recognition based (...)

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Export citation

Bookmark

4

Effect of retroflex sounds on the recognition of Hindi voiced and unvoiced stops.Amita Dev - 2009 - AI and Society 23 (4):603-612.

As development of the speech recognition system entirely depends upon the spoken language used for its development, and the very fact that speech technology is highly language dependent and reverse engineering is not possible, there is an utmost need to develop such systems for Indian languages. In this paper we present the implementation of a time delay neural network system (TDNN) in a modular fashion by exploiting the hidden structure of previously phonetic subcategory network for (...) of Hindi consonants. For the present study we have selected all the Hindi phonemes for srecognition. A vocabulary of 207 Hindi words was designed for the task-specific environment and used as a database. For the recognition of phoneme, a three-layered network was constructed and the network was trained using the back propagation learning algorithm. Experiments were conducted to categorize the Hindi voiced, unvoiced stops, semi vowels, vowels, nasals and fricatives. A close observation of confusion matrix of Hindi stops revealed maximum confusion of retroflex stops with their non-retroflex counterparts. (shrink)

Export citation

Bookmark

13

Agreeing on a Norm: What Sort of Speech Act?Cristina Corredor - 2023 - Topoi 42 (2):495-507.

What type of speech act is a norm of action, when the norm is agreed upon as the conclusion of an argumentative dialogue? My hypothesis is that, whenever a norm of action is the conclusion of an argument, it should be analyzed as the statement of a norm and thus as a verdictive speech act. If the context is appropriate, and the interlocutors are sincerely (or institutionally) committed to their argumentative exchange and its conclusion, then this verdictive motivates (...)

Value Theory

Export citation

Bookmark

2 citations

3

Optimization of Intelligent English Pronunciation Training System Based on Android Platform.Qianyu Cao & Hanmei Hao - 2021 - Complexity 2021:1-11.

Oral English, as a language tool, is not only an important part of English learning but also an essential part. For nonnative English learners, effective and meaningful voice feedback is very important. At present, most of the traditional recognition and error correction systems for oral English training are still in the theoretical stage. At the same time, the corresponding high-end experimental prototype also has the disadvantages of large and complex system. In the speech recognition technology, the (...)

Ethics of Artificial Intelligence, Misc in Philosophy of Cognitive Science

Export citation

Bookmark

31

The Future of Work: Augmentation or Stunting?Markus Furendal & Karim Jebari - 2023 - Philosophy and Technology (2):1-22.

The last decade has seen significant improvements in Artificial Intelligence (AI) technologies, including robotics, machine vision, speech recognition and text generation. Increasing automation will undoubtedly affect the future of work, and discussions on how the development of AI in the workplace will impact labor markets often include two scenarios: (1) labor replacement and (2) labor enabling. The former involves replacing workers with machines, while the latter assumes that human-machine cooperation can significantly improve worker productivity. In this context, it (...)

Social and Political Philosophy

Technology Ethics in Applied Ethics

Export citation

Bookmark

3

Automatic Speech Recognition: A Comprehensive Survey.Arbana Kadriu & Amarildo Rista - 2020 - Seeu Review 15 (2):86-112.

Speech recognition is an interdisciplinary subfield of natural language processing (NLP) that facilitates the recognition and translation of spoken language into text by machine. Speech recognition plays an important role in digital transformation. It is widely used in different areas such as education, industry, and healthcare and has recently been used in many Internet of Things and Machine Learning applications. The process of speech recognition is one of the most difficult processes in computer (...)

No categories

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Export citation

Bookmark

30

Intelligent service robots for elderly or disabled people and human dignity: legal point of view.Katarzyna Pfeifer-Chomiczewska - 2023 - AI and Society 38 (2):789-800.

This article aims to present the problem of the impact of artificial intelligence on respect for human dignity in the sphere of care for people who, for various reasons, are described as particularly vulnerable, especially seniors and people with various disabilities. In recent years, various initiatives and works have been undertaken on the European scene to define the directions in which the development and use of artificial intelligence should go. According to the human-centric approach, artificial intelligence should be developed, used (...) and monitored with people in mind, their needs and rights. It is artificial intelligence that should adapt to the rules set by people, not the other way around. The source of all human rights and freedoms is respect for human dignity. This is evidenced by numerous European, international, national regulations and documents. Respecting this value is also one of the works on the AI development. One of the areas of our life into which AI enters more and more boldly and which in the near future, due to demographic reasons and aging population, may be almost dominated by modern technologies, is the medical and care system. The usefulness of industrial co-robots made us think about the possibility of using similar devices in the non-industrial sphere, including in the sphere of care, nursing and therapy. Actually, despite the impressive development of technology and AI, intelligent robots/devices used for care, nursing or therapy (so-called care/nursing or therapy robots) perform rather simple activities. They provide information, navigate, bring something and give it (e.g. medicines, food), help to get up. They are an important support for medical staff in this respect. Performing tasks that are more complex in terms of technology and movement, such as feeding, washing, intimate hygiene, dressing or undressing, is still beyond the reach of the currently used robots. The question we must ask ourselves is whether, in the future, care robots will be able to perform such tasks, and should they do so? Will this lead to the replacement of human medical personnel in the performance of the above-mentioned activities with such robots. Would anyone want to be "looked after" by a soulless machine? Should potential wards/patients, in other words all of us in fact, have the right to object to being taken into the care of an intelligent robot? There is no doubt that intelligent robots have great potential, especially when it comes to ensuring or maintaining the independence or mobility of people with various health limitations. However, we must not forget about the risks and dangers they may generate. Even a simple robot generates serious legal and ethical problems. Who should be liable for damage caused in connection with improper care performed with the participation of a robot? Robots based on AI systems are able to collect a significant amount of sensitive data, for example through the face and speech recognition function. There is considerable scope for abuse in this area, not only in terms of personal data protection but also in terms of informational self-determination, privacy and dignity. Bearing in mind the above risks, the creation of a new right to object to being looked after by a robot should be considered. (shrink)

Export citation

Bookmark

1 citation

8

Restricted Speech Recognition in Noise and Quality of Life of Hearing-Impaired Children and Adolescents With Cochlear Implants – Need for Studies Addressing This Topic With Valid Pediatric Quality of Life Instruments.Maria Huber & Clara Havas - 2019 - Frontiers in Psychology 10.

Cochlear implants (CI) support the development of oral language in hearing-impaired children. However, even with CI, speech recognition in noise (SRiN) is limited. This raised the question, whether these restrictions are related to the quality of life (QoL) of children and adolescents with CI and how SRiN and QoL are related to each other. As a result of a systematic literature research only three studies were found, indicating positive moderating effects between SRiN and QoL of young CI users. (...)

Export citation

Bookmark

7

Integrated Design of Financial Self-Service Terminal Based on Artificial Intelligence Voice Interaction.Huizhong Chen, Shu Chen & Jingfeng Zhao - 2022 - Frontiers in Psychology 13.

Integrated design of financial self-service terminal based on artificial intelligence voice interaction with the rapid development of science and technology, artificial intelligence technology is deepening in the field of intelligence and automation. The financial industry is the lifeblood of a country’s economy, with great growth potential and high growth rate. The integrated design of intelligent financial self-service terminal has become an important topic in the field of rapid development of social economy and science and technology. Therefore, this (...)

Science of Consciousness, Miscellaneous in Philosophy of Cognitive Science

Export citation

Bookmark

18

Merging information in speech recognition: Feedback is never necessary.Dennis Norris, James M. McQueen & Anne Cutler - 2000 - Behavioral and Brain Sciences 23 (3):299-325.

Top-down feedback does not benefit speech recognition; on the contrary, it can hinder it. No experimental data imply that feedback loops are required for speech recognition. Feedback is accordingly unnecessary and spoken word recognition is modular. To defend this thesis, we analyse lexical involvement in phonemic decision making. TRACE (McClelland & Elman 1986), a model with feedback from the lexicon to prelexical processes, is unable to account for all the available data on phonemic decision making. (...)

Direct download (8 more)

Export citation

Bookmark

47 citations

1

Detecting Pronunciation Errors in Spoken English Tests Based on Multifeature Fusion Algorithm.Yinping Wang - 2021 - Complexity 2021:1-11.

In this study, multidimensional feature extraction is performed on the U-language recordings of the test takers, and these features are evaluated separately, with five categories of features: pronunciation, fluency, vocabulary, grammar, and semantics. A deep neural network model is constructed to model the feature values to obtain the final score. Based on the previous research, this study uses a deep neural network training model instead of linear regression to improve the correlation between model score and expert score. The method of (...)

Export citation

Bookmark

9

Image Recognition Technology in Texture Identification of Marine Sediment Sonar Image.Chao Sun, Li Wang, Nan Wang & Shaohua Jin - 2021 - Complexity 2021:1-8.

Through the recognition of ocean sediment sonar images, the texture in the image can be classified, which provides an important basis for the classification of ocean sediment. Aiming at the problems of low efficiency, waste of human resources, and low accuracy in the traditional manual side-scan sonar image discrimination, this paper studies the application of image recognition technology in sonar image substrate texture discrimination, which is popular in many fields. At the same time, considering the scale complexity, (...)

No categories

Philosophy of Cognitive Science

Export citation

Bookmark

18

Effects of Semantic Context and Fundamental Frequency Contours on Mandarin Speech Recognition by Second Language Learners.Linjun Zhang, Yu Li, Han Wu, Xin Li, Hua Shu, Yang Zhang & Ping Li - 2016 - Frontiers in Psychology 7:189783.

Speech recognition by second language (L2) learners in optimal and suboptimal conditions has been examined extensively with English as the target language in most previous studies. This study extended existing experimental protocols ( Wang et al., 2013 ) to investigate Mandarin speech recognition by Japanese learners of Mandarin at two different levels (elementary vs. intermediate) of proficiency. The overall results showed that in addition to L2 proficiency, semantic context, F0 contours, and listening condition all affected the (...)

Direct download (6 more)

Export citation

Bookmark

3 citations

7

Philosophical Lessons for Emotion Recognition Technology.Rosalie Waelen - 2024 - Minds and Machines 34 (1):1-13.

Emotion recognition technology uses artificial intelligence to make inferences about a person’s emotions, on the basis of their facial expressions, body language, tone of voice, or other types of input. Underlying such technology are a variety of assumptions about the manifestation, nature, and value of emotions. To assure the quality and desirability of emotion recognition technology, it is important to critically assess the assumptions embedded in the technology. Within philosophy, there is a long tradition (...)

Emotions in Philosophy of Mind

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Export citation

Bookmark

6

Masked Speech Recognition in School-Age Children.Lori J. Leibold & Emily Buss - 2019 - Frontiers in Psychology 10.

Export citation

Bookmark

10

Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models.H. J. Nock & S. J. Young - 2002 - Cognitive Science 26 (3):283-301.

Hidden Markov models (HMMs) have been successful for modelling the dynamics of carefully dictated speech, but their performance degrades severely when used to model conversational speech. Since speech is produced by a system of loosely coupled articulators, stochastic models explicitly representing this parallelism may have advantages for automatic speech recognition (ASR), particularly when trying to model the phonological effects inherent in casual spontaneous speech. This paper presents a preliminary feasibility study of one such model (...)

Export citation

Bookmark

3

Longitudinal Speech Recognition in Noise in Children: Effects of Hearing Status and Vocabulary.Elizabeth A. Walker, Caitlin Sapp, Jacob J. Oleson & Ryan W. McCreery - 2019 - Frontiers in Psychology 10.

Export citation

Bookmark

2

Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling.R. K. Aggarwal & A. Kumar - 2020 - Journal of Intelligent Systems 30 (1):165-179.

This paper implements the continuous Hindi Automatic Speech Recognition (ASR) system using the proposed integrated features vector with Recurrent Neural Network (RNN) based Language Modeling (LM). The proposed system also implements the speaker adaptation using Maximum-Likelihood Linear Regression (MLLR) and Constrained Maximum likelihood Linear Regression (C-MLLR). This system is discriminatively trained by Maximum Mutual Information (MMI) and Minimum Phone Error (MPE) techniques with 256 Gaussian mixture per Hidden Markov Model(HMM) state. The training of the baseline system has been (...)

No categories

Export citation

Bookmark

Facial recognition technology : ethical and legal implication.Ellen Raineri, Erin Brennan & Audrey Ryder - 2022 - In Tamara Phillips Fudge (ed.), Exploring ethical problems in today's technological world. Hershey PA: Engineering Science Reference.

Technology Ethics in Applied Ethics

$185.00 new (collection) View on Amazon.com

Export citation

Bookmark

20

EARSHOT: A Minimal Neural Network Model of Incremental Human Speech Recognition.James S. Magnuson, Heejo You, Sahil Luthra, Monica Li, Hosung Nam, Monty Escabí, Kevin Brown, Paul D. Allopenna, Rachel M. Theodore, Nicholas Monto & Jay G. Rueckl - 2020 - Cognitive Science 44 (4):e12823.

Despite the lack of invariance problem (the many‐to‐many mapping between acoustics and percepts), human listeners experience phonetic constancy and typically perceive what a speaker intends. Most models of human speech recognition (HSR) have side‐stepped this problem, working with abstract, idealized inputs and deferring the challenge of working with real speech. In contrast, carefully engineered deep learning networks allow robust, real‐world automatic speech recognition (ASR). However, the complexities of deep learning architectures and training regimens make it (...)

No categories

Democracy in Social and Political Philosophy

Export citation

Bookmark

4 citations

6

Biased Face Recognition Technology Used by Government: A Problem for Liberal Democracy.Michael Gentzel - 2021 - Philosophy and Technology 34 (4):1639-1663.

This paper presents a novel philosophical analysis of the problem of law enforcement’s use of biased face recognition technology in liberal democracies. FRT programs used by law enforcement in identifying crime suspects are substantially more error-prone on facial images depicting darker skin tones and females as compared to facial images depicting Caucasian males. This bias can lead to citizens being wrongfully investigated by police along racial and gender lines. The author develops and defends “A Liberal Argument Against Biased (...)

Export citation

Bookmark

1 citation

29

Race, again: how face recognition technology reinforces racial discrimination.Fabio Bacchini & Ludovica Lorusso - 2019 - Journal of Information, Communication and Ethics in Society 17 (3):321-335.

Purpose This study aims to explore whether face recognition technology – as it is intensely used by state and local police departments and law enforcement agencies – is racism free or, on the contrary, is affected by racial biases and/or racist prejudices, thus reinforcing overall racial discrimination. Design/methodology/approach The study investigates the causal pathways through which face recognition technology may reinforce the racial disproportion in enforcement; it also inquires whether it further discriminates black people by making (...)

No categories

Export citation

Bookmark

2 citations

Speech recognition: Statistical methods.L. R. Rabiner & B. H. Juang - 2005 - In Keith Brown (ed.), Encyclopedia of Language and Linguistics. Elsevier. pp. 1--18.

Linguistics in Cognitive Sciences

$11850.00 new (collection) View on Amazon.com

Export citation

Bookmark

8

Vocabulary size has been suggested as a useful measure of “verbal abilities” that correlates with speech recognition scores. Knowing more words is linked to better speech recognition. How vocabulary knowledge translates to general speech recognition mechanisms, how these mechanisms relate to offline speech recognition scores, and how they may be modulated by acoustical distortion or age, is less clear. Age-related differences in linguistic measures may predict age-related differences in speech recognition (...)

Psycholinguistics in Philosophy of Language

Export citation

Bookmark

3 citations

10

Merging information versus speech recognition.Irene Appelbaum - 2000 - Behavioral and Brain Sciences 23 (3):325-326.

Norris, McQueen & Cutler claim that all known speech recognition data can be accounted for with their autonomous model, “Merge.” But this claim is doubly misleading. (1) Although speech recognition is autonomous in their view, the Merge model is not. (2) The body of data which the Merge model accounts for, is not, in their view, speech recognition data. Footnotes1 Author is also affiliated with the Center for the Study of Language and Information, Stanford (...)

Export citation

Bookmark

5

English Phrase Speech Recognition Based on Continuous Speech Recognition Algorithm and Word Tree Constraints.Haifan Du & Haiwen Duan - 2021 - Complexity 2021:1-11.

This paper combines domestic and international research results to analyze and study the difference between the attribute features of English phrase speech and noise to enhance the short-time energy, which is used to improve the threshold judgment sensitivity; noise addition to the discrepancy data set is used to enhance the recognition robustness. The backpropagation algorithm is improved to constrain the range of weight variation, avoid oscillation phenomenon, and shorten the training time. In the real English phrase sound (...) system, there are problems such as massive training data and low training efficiency caused by the super large-scale model parameters of the convolutional neural network. To address these problems, the NWBP algorithm is based on the oscillation phenomenon that tends to occur when searching for the minimum error value in the late training period of the network parameters, using the K-MEANS algorithm to obtain the seed nodes that approach the minimal error value, and using the boundary value rule to reduce the range of weight change to reduce the oscillation phenomenon so that the network error converges as soon as possible and improve the training efficiency. Through simulation experiments, the NWBP algorithm improves the degree of fitting and convergence speed in the training of complex convolutional neural networks compared with other algorithms, reduces the redundant computation, and shortens the training time to a certain extent, and the algorithm has the advantage of accelerating the convergence of the network compared with simple networks. The word tree constraint and its efficient storage structure are introduced, which improves the storage efficiency of the word tree constraint and the retrieval efficiency in the English phrase recognition search. (shrink)

Export citation

Bookmark

2

On Dynamic Pitch Benefit for Speech Recognition in Speech Masker.Jing Shen & Pamela E. Souza - 2018 - Frontiers in Psychology 9.

Previous work demonstrated that dynamic pitch (i.e., pitch variation in speech) aids speech recognition in various types of noises. While this finding suggests dynamic pitch enhancement in target speech can benefit speech recognition in noise, it is of importance to know what noise characteristics affect dynamic pitch benefit and who will benefit from enhanced dynamic pitch cues. Following our recent finding that temporal modulation in noise influences dynamic pitch benefit, we examined the effect of (...)

Psycholinguistics in Philosophy of Language

Export citation

Bookmark

9

Perceptual units in speech recognition.Dominic W. Massaro - 1974 - Journal of Experimental Psychology 102 (2):199.

Export citation

Bookmark

2 citations

25

Mandarin-Speaking Children’s Speech Recognition: Developmental Changes in the Influences of Semantic Context and F0 Contours.Zhou Hong, Li Yu, Liang Meng, Guan Connie Qun, Zhang Linjun, Shu Hua & Zhang Yang - 2017 - Frontiers in Psychology 8.

Autonomy in Applied Ethics in Applied Ethics

Direct download (5 more)

Export citation

Bookmark

1 citation

12

Identity crisis: Face recognition technology and freedom of the will.Benjamin Hale - 2005 - Ethics, Place and Environment 8 (2):141 – 158.

In this paper I present the position that the use of face recognition technology (FRT) in law enforcement and in business is restrictive of individual autonomy. I reason that FRT severely undermines autonomous self-determination by hobbling the idea of freedom of the will. I distinguish this position from two other common arguments against surveillance technologies: the privacy argument (that FRT is an invasion of privacy) and the objective freedom argument (that FRT is restrictive of one's freedom to act). (...)

Environmental Ethics in Applied Ethics

Export citation

Bookmark

2 citations

10

Differences in Speech Recognition Between Children with Attention Deficits and Typically Developed Children Disappear When Exposed to 65 dB of Auditory Noise.Göran B. W. Söderlund & Elisabeth Nilsson Jobs - 2016 - Frontiers in Psychology 7.

Direct download (7 more)

Export citation

Bookmark

1 citation

6

A Hybrid of Deep CNN and Bidirectional LSTM for Automatic Speech Recognition.Rajesh Kumar Aggarwal & Vishal Passricha - 2019 - Journal of Intelligent Systems 29 (1):1261-1274.

Deep neural networks (DNNs) have been playing a significant role in acoustic modeling. Convolutional neural networks (CNNs) are the advanced version of DNNs that achieve 4–12% relative gain in the word error rate (WER) over DNNs. Existence of spectral variations and local correlations in speech signal makes CNNs more capable of speech recognition. Recently, it has been demonstrated that bidirectional long short-term memory (BLSTM) produces higher recognition rate in acoustic modeling because they are adequate to reinforce (...)

No categories

Export citation

Bookmark

13

Shortlist B: A Bayesian model of continuous speech recognition.Dennis Norris & James M. McQueen - 2008 - Psychological Review 115 (2):357-395.

No categories

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Export citation

Bookmark

66 citations

45

The ethical application of biometric facial recognition technology.Marcus Smith & Seumas Miller - 2022 - AI and Society 37 (1):167-175.

Biometric facial recognition is an artificial intelligence technology involving the automated comparison of facial features, used by law enforcement to identify unknown suspects from photographs and closed circuit television. Its capability is expanding rapidly in association with artificial intelligence and has great potential to solve crime. However, it also carries significant privacy and other ethical implications that require law and regulation. This article examines the rise of biometric facial recognition, current applications and legal developments, and conducts an (...)

Aspects of Consciousness in Philosophy of Mind

Export citation

Bookmark

6 citations

Audio-visual speech recognition.G. Potamianos & J. Luettin - 2005 - In Keith Brown (ed.), Encyclopedia of Language and Linguistics. Elsevier.

Philosophy of Consciousness in Philosophy of Mind

$11850.00 new (collection) View on Amazon.com

Export citation

Bookmark

6

Single-Channel Speech Enhancement Techniques for Distant Speech Recognition.Ramaswamy Kumaraswamy & Jaya Kumar Ashwini - 2013 - Journal of Intelligent Systems 22 (2):81-93.

This article presents an overview of the single-channel dereverberation methods suitable for distant speech recognition application. The dereverberation methods are mainly classified based on the domain of enhancement of speech signal captured by a distant microphone. Many single-channel speech enhancement methods focus on either denoising or dereverberating the distorted speech signal. There are very few methods that consider both noise and reverberation effects. Such methods are discussed under a multistage approach in this article. The article (...)

Philosophy of Linguistics in Philosophy of Language

Export citation

Bookmark

10

Multitask Learning with Local Attention for Tibetan Speech Recognition.Hui Wang, Fei Gao, Yue Zhao, Li Yang, Jianjian Yue & Huilin Ma - 2020 - Complexity 2020:1-10.

In this paper, we propose to incorporate the local attention in WaveNet-CTC to improve the performance of Tibetan speech recognition in multitask learning. With an increase in task number, such as simultaneous Tibetan speech content recognition, dialect identification, and speaker recognition, the accuracy rate of a single WaveNet-CTC decreases on speech recognition. Inspired by the attention mechanism, we introduce the local attention to automatically tune the weights of feature frames in a window and (...)

Philosophy of Cognitive Science

Export citation

Bookmark

7

Temporal cortex activation during speech recognition: an optical topography study.Hiroki Sato, Tatsuya Takeuchi & Kuniyoshi L. Sakai - 1999 - Cognition 73 (3):B55-B66.

Philosophy of Psychology in Philosophy of Cognitive Science

Direct download (5 more)

Export citation

Bookmark

8

Deconstructing public participation in the governance of facial recognition technologies in Canada.Maurice Jones & Fenwick McKelvey - forthcoming - AI and Society:1-14.

On February 13, 2020, the Toronto Police Services (TPS) issued a statement admitting that its members had used Clearview AI’s controversial facial recognition technology (FRT). The controversy sparked widespread outcry by the media, civil society, and community groups, and put pressure on policy-makers to address FRTs. Public consultations presented a key tool to contain the scandal in Toronto and across Canada. Drawing on media reports, policy documents, and expert interviews, we investigate four consultations held by the Toronto Police (...)

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Export citation

Bookmark

30

Eyes on the Streets: Media Use and Public Opinion About Facial Recognition Technology.David C. Wilson, Ashley Paintsil, Wyatt Dawson, James Bingaman & Paul R. Brewer - 2022 - Bulletin of Science, Technology and Society 42 (4):133-143.

This study examines how different forms of media use predict attitudes toward the development of facial recognition technology (FRT) and applications of it by law enforcement to identify criminal suspects, identify potential terrorists, and monitor public protests. The theoretical framework builds on theories of cultivation and genre-specific viewing to develop hypotheses and research questions. The analyses draw on original data from two nationally representative surveys of the U.S. public conducted in 2020, amid a series of controversies and protests (...)

No categories

Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Export citation

Bookmark

19

A phenomenological perspective on AI ethical failures: The case of facial recognition technology.Yuni Wen & Matthias Holweg - forthcoming - AI and Society:1-18.

As more and more companies adopt artificial intelligence to increase the efficiency and effectiveness of their products and services, they expose themselves to ethical crises and potentially damaging public controversy associated with its use. Despite the prevalence of AI ethical problems, most companies are strategically unprepared to respond effectively to the public. This paper aims to advance our empirical understanding of company responses to AI ethical crises by focusing on the rise and fall of facial recognition technology. Specifically, (...)

Philosophy of Cognitive Science, Miscellaneous in Philosophy of Cognitive Science

Export citation

Bookmark

9

Shortlist: a connectionist model of continuous speech recognition.Dennis Norris - 1994 - Cognition 52 (3):189-234.

Export citation

Bookmark

77 citations

6

Do Age and Linguistic Status Alter the Effect of Sound Source Diffuseness on Speech Recognition in Noise?Meital Avivi-Reich, Rupinder Kaur Sran & Bruce A. Schneider - 2022 - Frontiers in Psychology 13.

One aspect of auditory scenes that has received very little attention is the level of diffuseness of sound sources. This aspect has increasing importance due to growing use of amplification systems. When an auditory stimulus is amplified and presented over multiple, spatially-separated loudspeakers, the signal’s timbre is altered due to comb filtering. In a previous study we examined how increasing the diffuseness of the sound sources might affect listeners’ ability to recognize speech presented in different types of background noise. (...)