Results for 'speech recognition technology'

1000+ found
Order:
  1. Speech recognition technology.F. Beaufays, H. Bourlard, Horacio Franco & Nelson Morgan - 2002 - In Michael A. Arbib (ed.), The Handbook of Brain Theory and Neural Networks, Second Edition. MIT Press.
     
    Export citation  
     
    Bookmark  
  2.  98
    Neutrosophic speech recognition Algorithm for speech under stress by Machine learning.Florentin Smarandache, D. Nagarajan & Said Broumi - 2023 - Neutrosophic Sets and Systems 53.
    It is well known that the unpredictable speech production brought on by stress from the task at hand has a significant negative impact on the performance of speech processing algorithms. Speech therapy benefits from being able to detect stress in speech. Speech processing performance suffers noticeably when perceptually produced stress causes variations in speech production. Using the acoustic speech signal to objectively characterize speaker stress is one method for assessing production variances brought on (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  3.  41
    DLD: An Optimized Chinese Speech Recognition Model Based on Deep Learning.Hong Lei, Yue Xiao, Yanchun Liang, Dalin Li & Heow Pueh Lee - 2022 - Complexity 2022:1-8.
    Speech recognition technology has played an indispensable role in realizing human-computer intelligent interaction. However, most of the current Chinese speech recognition systems are provided online or offline models with low accuracy and poor performance. To improve the performance of offline Chinese speech recognition, we propose a hybrid acoustic model of deep convolutional neural network, long short-term memory, and deep neural network. This model utilizes DCNN to reduce frequency variation and adds a batch normalization (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  4.  8
    English Flipped Classroom Teaching Mode Based on Emotion Recognition Technology.Lin Lai - 2022 - Frontiers in Psychology 13.
    With the development of modern information technology, the flipped classroom teaching mode came into being. It has gradually become one of the hotspots of contemporary educational circles and has been applied to various disciplines at the same time. The domestic research on the flipped classroom teaching mode is still in the exploratory stage. The application of flipped classroom teaching mode is still in the exploratory stage. It also has many problems, such as low class efficiency, poor teacher-student interaction, outdated (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  5.  26
    Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition.Dan Jurafsky & James H. Martin - 2000 - Prentice-Hall.
    The first of its kind to thoroughly cover language technology at all levels and with all modern technologies this book takes an empirical approach to the ...
    Direct download  
     
    Export citation  
     
    Bookmark   32 citations  
  6.  11
    Locus equation and hidden parameters of speech.Li Deng - 1998 - Behavioral and Brain Sciences 21 (2):263-264.
    Locus equations contain an economical set of hidden (i.e., not directly observable in the data) parameters of speech that provide an elegant way of characterizing the ubiquitous context-dependent behaviors exhibited in speech acoustics. These hidden parameters can be effectively exploited to constrain the huge set of context-dependent speech model parameters currently in use in modern, mainstream speech recognition technology.
    Direct download (6 more)  
     
    Export citation  
     
    Bookmark  
  7.  4
    Speech trasformations solutions.Dimitri Kanevsky, Sara Basson, Alexander Faisman, Leonid Rachevsky, Alex Zlatsin & Sarah Conrod - 2006 - Pragmatics and Cognition 14 (2):411-442.
    This paper outlines the background development of “intelligent“ technologies such as speech recognition. Despite significant progress in the development of these technologies, they still fall short in many areas, and rapid advances in areas such as dictation are actually stalled. In this paper we have proposed semi-automatic solutions — smart integration of human and intelligent efforts. One such technique involves improvement to the speech recognition editing interface, thereby reducing the perception of errors to the viewer. Other (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  8.  51
    Intelligent Aging Home Control Method and System for Internet of Things Emotion Recognition.Xu Wu & Qian Zhang - 2022 - Frontiers in Psychology 13.
    To solve a series of pension problems caused by aging, based on the emotional recognition of the Internet of Things, the control method and system research of smart homes are proposed. This article makes a detailed analysis and research on the necessity, feasibility, and how to realize speech emotion recognition technology in smart families, introduces the definition and classification of emotion, and puts forward five main emotions to be recognized in speech emotion recognition based (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  9.  4
    Effect of retroflex sounds on the recognition of Hindi voiced and unvoiced stops.Amita Dev - 2009 - AI and Society 23 (4):603-612.
    As development of the speech recognition system entirely depends upon the spoken language used for its development, and the very fact that speech technology is highly language dependent and reverse engineering is not possible, there is an utmost need to develop such systems for Indian languages. In this paper we present the implementation of a time delay neural network system (TDNN) in a modular fashion by exploiting the hidden structure of previously phonetic subcategory network for (...) of Hindi consonants. For the present study we have selected all the Hindi phonemes for srecognition. A vocabulary of 207 Hindi words was designed for the task-specific environment and used as a database. For the recognition of phoneme, a three-layered network was constructed and the network was trained using the back propagation learning algorithm. Experiments were conducted to categorize the Hindi voiced, unvoiced stops, semi vowels, vowels, nasals and fricatives. A close observation of confusion matrix of Hindi stops revealed maximum confusion of retroflex stops with their non-retroflex counterparts. (shrink)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  10.  13
    Agreeing on a Norm: What Sort of Speech Act?Cristina Corredor - 2023 - Topoi 42 (2):495-507.
    What type of speech act is a norm of action, when the norm is agreed upon as the conclusion of an argumentative dialogue? My hypothesis is that, whenever a norm of action is the conclusion of an argument, it should be analyzed as the statement of a norm and thus as a verdictive speech act. If the context is appropriate, and the interlocutors are sincerely (or institutionally) committed to their argumentative exchange and its conclusion, then this verdictive motivates (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  11.  3
    Optimization of Intelligent English Pronunciation Training System Based on Android Platform.Qianyu Cao & Hanmei Hao - 2021 - Complexity 2021:1-11.
    Oral English, as a language tool, is not only an important part of English learning but also an essential part. For nonnative English learners, effective and meaningful voice feedback is very important. At present, most of the traditional recognition and error correction systems for oral English training are still in the theoretical stage. At the same time, the corresponding high-end experimental prototype also has the disadvantages of large and complex system. In the speech recognition technology, the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  12.  31
    The Future of Work: Augmentation or Stunting?Markus Furendal & Karim Jebari - 2023 - Philosophy and Technology (2):1-22.
    The last decade has seen significant improvements in Artificial Intelligence (AI) technologies, including robotics, machine vision, speech recognition and text generation. Increasing automation will undoubtedly affect the future of work, and discussions on how the development of AI in the workplace will impact labor markets often include two scenarios: (1) labor replacement and (2) labor enabling. The former involves replacing workers with machines, while the latter assumes that human-machine cooperation can significantly improve worker productivity. In this context, it (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  13.  3
    Automatic Speech Recognition: A Comprehensive Survey.Arbana Kadriu & Amarildo Rista - 2020 - Seeu Review 15 (2):86-112.
    Speech recognition is an interdisciplinary subfield of natural language processing (NLP) that facilitates the recognition and translation of spoken language into text by machine. Speech recognition plays an important role in digital transformation. It is widely used in different areas such as education, industry, and healthcare and has recently been used in many Internet of Things and Machine Learning applications. The process of speech recognition is one of the most difficult processes in computer (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  14.  30
    Intelligent service robots for elderly or disabled people and human dignity: legal point of view.Katarzyna Pfeifer-Chomiczewska - 2023 - AI and Society 38 (2):789-800.
    This article aims to present the problem of the impact of artificial intelligence on respect for human dignity in the sphere of care for people who, for various reasons, are described as particularly vulnerable, especially seniors and people with various disabilities. In recent years, various initiatives and works have been undertaken on the European scene to define the directions in which the development and use of artificial intelligence should go. According to the human-centric approach, artificial intelligence should be developed, used (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  15.  8
    Restricted Speech Recognition in Noise and Quality of Life of Hearing-Impaired Children and Adolescents With Cochlear Implants – Need for Studies Addressing This Topic With Valid Pediatric Quality of Life Instruments.Maria Huber & Clara Havas - 2019 - Frontiers in Psychology 10.
    Cochlear implants (CI) support the development of oral language in hearing-impaired children. However, even with CI, speech recognition in noise (SRiN) is limited. This raised the question, whether these restrictions are related to the quality of life (QoL) of children and adolescents with CI and how SRiN and QoL are related to each other. As a result of a systematic literature research only three studies were found, indicating positive moderating effects between SRiN and QoL of young CI users. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  16.  7
    Integrated Design of Financial Self-Service Terminal Based on Artificial Intelligence Voice Interaction.Huizhong Chen, Shu Chen & Jingfeng Zhao - 2022 - Frontiers in Psychology 13.
    Integrated design of financial self-service terminal based on artificial intelligence voice interaction with the rapid development of science and technology, artificial intelligence technology is deepening in the field of intelligence and automation. The financial industry is the lifeblood of a country’s economy, with great growth potential and high growth rate. The integrated design of intelligent financial self-service terminal has become an important topic in the field of rapid development of social economy and science and technology. Therefore, this (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  17.  18
    Merging information in speech recognition: Feedback is never necessary.Dennis Norris, James M. McQueen & Anne Cutler - 2000 - Behavioral and Brain Sciences 23 (3):299-325.
    Top-down feedback does not benefit speech recognition; on the contrary, it can hinder it. No experimental data imply that feedback loops are required for speech recognition. Feedback is accordingly unnecessary and spoken word recognition is modular. To defend this thesis, we analyse lexical involvement in phonemic decision making. TRACE (McClelland & Elman 1986), a model with feedback from the lexicon to prelexical processes, is unable to account for all the available data on phonemic decision making. (...)
    Direct download (8 more)  
     
    Export citation  
     
    Bookmark   47 citations  
  18.  1
    Detecting Pronunciation Errors in Spoken English Tests Based on Multifeature Fusion Algorithm.Yinping Wang - 2021 - Complexity 2021:1-11.
    In this study, multidimensional feature extraction is performed on the U-language recordings of the test takers, and these features are evaluated separately, with five categories of features: pronunciation, fluency, vocabulary, grammar, and semantics. A deep neural network model is constructed to model the feature values to obtain the final score. Based on the previous research, this study uses a deep neural network training model instead of linear regression to improve the correlation between model score and expert score. The method of (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  19.  9
    Image Recognition Technology in Texture Identification of Marine Sediment Sonar Image.Chao Sun, Li Wang, Nan Wang & Shaohua Jin - 2021 - Complexity 2021:1-8.
    Through the recognition of ocean sediment sonar images, the texture in the image can be classified, which provides an important basis for the classification of ocean sediment. Aiming at the problems of low efficiency, waste of human resources, and low accuracy in the traditional manual side-scan sonar image discrimination, this paper studies the application of image recognition technology in sonar image substrate texture discrimination, which is popular in many fields. At the same time, considering the scale complexity, (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  20.  18
    Effects of Semantic Context and Fundamental Frequency Contours on Mandarin Speech Recognition by Second Language Learners.Linjun Zhang, Yu Li, Han Wu, Xin Li, Hua Shu, Yang Zhang & Ping Li - 2016 - Frontiers in Psychology 7:189783.
    Speech recognition by second language (L2) learners in optimal and suboptimal conditions has been examined extensively with English as the target language in most previous studies. This study extended existing experimental protocols ( Wang et al., 2013 ) to investigate Mandarin speech recognition by Japanese learners of Mandarin at two different levels (elementary vs. intermediate) of proficiency. The overall results showed that in addition to L2 proficiency, semantic context, F0 contours, and listening condition all affected the (...)
    Direct download (6 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  21.  7
    Philosophical Lessons for Emotion Recognition Technology.Rosalie Waelen - 2024 - Minds and Machines 34 (1):1-13.
    Emotion recognition technology uses artificial intelligence to make inferences about a person’s emotions, on the basis of their facial expressions, body language, tone of voice, or other types of input. Underlying such technology are a variety of assumptions about the manifestation, nature, and value of emotions. To assure the quality and desirability of emotion recognition technology, it is important to critically assess the assumptions embedded in the technology. Within philosophy, there is a long tradition (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  22.  6
    Masked Speech Recognition in School-Age Children.Lori J. Leibold & Emily Buss - 2019 - Frontiers in Psychology 10.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  23.  10
    Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models.H. J. Nock & S. J. Young - 2002 - Cognitive Science 26 (3):283-301.
    Hidden Markov models (HMMs) have been successful for modelling the dynamics of carefully dictated speech, but their performance degrades severely when used to model conversational speech. Since speech is produced by a system of loosely coupled articulators, stochastic models explicitly representing this parallelism may have advantages for automatic speech recognition (ASR), particularly when trying to model the phonological effects inherent in casual spontaneous speech. This paper presents a preliminary feasibility study of one such model (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  24.  3
    Longitudinal Speech Recognition in Noise in Children: Effects of Hearing Status and Vocabulary.Elizabeth A. Walker, Caitlin Sapp, Jacob J. Oleson & Ryan W. McCreery - 2019 - Frontiers in Psychology 10.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  25.  2
    Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling.R. K. Aggarwal & A. Kumar - 2020 - Journal of Intelligent Systems 30 (1):165-179.
    This paper implements the continuous Hindi Automatic Speech Recognition (ASR) system using the proposed integrated features vector with Recurrent Neural Network (RNN) based Language Modeling (LM). The proposed system also implements the speaker adaptation using Maximum-Likelihood Linear Regression (MLLR) and Constrained Maximum likelihood Linear Regression (C-MLLR). This system is discriminatively trained by Maximum Mutual Information (MMI) and Minimum Phone Error (MPE) techniques with 256 Gaussian mixture per Hidden Markov Model(HMM) state. The training of the baseline system has been (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  26. Facial recognition technology : ethical and legal implication.Ellen Raineri, Erin Brennan & Audrey Ryder - 2022 - In Tamara Phillips Fudge (ed.), Exploring ethical problems in today's technological world. Hershey PA: Engineering Science Reference.
     
    Export citation  
     
    Bookmark  
  27.  20
    EARSHOT: A Minimal Neural Network Model of Incremental Human Speech Recognition.James S. Magnuson, Heejo You, Sahil Luthra, Monica Li, Hosung Nam, Monty Escabí, Kevin Brown, Paul D. Allopenna, Rachel M. Theodore, Nicholas Monto & Jay G. Rueckl - 2020 - Cognitive Science 44 (4):e12823.
    Despite the lack of invariance problem (the many‐to‐many mapping between acoustics and percepts), human listeners experience phonetic constancy and typically perceive what a speaker intends. Most models of human speech recognition (HSR) have side‐stepped this problem, working with abstract, idealized inputs and deferring the challenge of working with real speech. In contrast, carefully engineered deep learning networks allow robust, real‐world automatic speech recognition (ASR). However, the complexities of deep learning architectures and training regimens make it (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   4 citations  
  28.  6
    Biased Face Recognition Technology Used by Government: A Problem for Liberal Democracy.Michael Gentzel - 2021 - Philosophy and Technology 34 (4):1639-1663.
    This paper presents a novel philosophical analysis of the problem of law enforcement’s use of biased face recognition technology in liberal democracies. FRT programs used by law enforcement in identifying crime suspects are substantially more error-prone on facial images depicting darker skin tones and females as compared to facial images depicting Caucasian males. This bias can lead to citizens being wrongfully investigated by police along racial and gender lines. The author develops and defends “A Liberal Argument Against Biased (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  29.  29
    Race, again: how face recognition technology reinforces racial discrimination.Fabio Bacchini & Ludovica Lorusso - 2019 - Journal of Information, Communication and Ethics in Society 17 (3):321-335.
    Purpose This study aims to explore whether face recognition technology – as it is intensely used by state and local police departments and law enforcement agencies – is racism free or, on the contrary, is affected by racial biases and/or racist prejudices, thus reinforcing overall racial discrimination. Design/methodology/approach The study investigates the causal pathways through which face recognition technology may reinforce the racial disproportion in enforcement; it also inquires whether it further discriminates black people by making (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  30. Speech recognition: Statistical methods.L. R. Rabiner & B. H. Juang - 2005 - In Keith Brown (ed.), Encyclopedia of Language and Linguistics. Elsevier. pp. 1--18.
     
    Export citation  
     
    Bookmark  
  31.  8
    Age-Related Differences in Lexical Access Relate to Speech Recognition in Noise.Rebecca Carroll, Anna Warzybok, Birger Kollmeier & Esther Ruigendijk - 2016 - Frontiers in Psychology 7:170619.
    Vocabulary size has been suggested as a useful measure of “verbal abilities” that correlates with speech recognition scores. Knowing more words is linked to better speech recognition. How vocabulary knowledge translates to general speech recognition mechanisms, how these mechanisms relate to offline speech recognition scores, and how they may be modulated by acoustical distortion or age, is less clear. Age-related differences in linguistic measures may predict age-related differences in speech recognition (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  32.  10
    Merging information versus speech recognition.Irene Appelbaum - 2000 - Behavioral and Brain Sciences 23 (3):325-326.
    Norris, McQueen & Cutler claim that all known speech recognition data can be accounted for with their autonomous model, “Merge.” But this claim is doubly misleading. (1) Although speech recognition is autonomous in their view, the Merge model is not. (2) The body of data which the Merge model accounts for, is not, in their view, speech recognition data. Footnotes1 Author is also affiliated with the Center for the Study of Language and Information, Stanford (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  33.  5
    English Phrase Speech Recognition Based on Continuous Speech Recognition Algorithm and Word Tree Constraints.Haifan Du & Haiwen Duan - 2021 - Complexity 2021:1-11.
    This paper combines domestic and international research results to analyze and study the difference between the attribute features of English phrase speech and noise to enhance the short-time energy, which is used to improve the threshold judgment sensitivity; noise addition to the discrepancy data set is used to enhance the recognition robustness. The backpropagation algorithm is improved to constrain the range of weight variation, avoid oscillation phenomenon, and shorten the training time. In the real English phrase sound (...) system, there are problems such as massive training data and low training efficiency caused by the super large-scale model parameters of the convolutional neural network. To address these problems, the NWBP algorithm is based on the oscillation phenomenon that tends to occur when searching for the minimum error value in the late training period of the network parameters, using the K-MEANS algorithm to obtain the seed nodes that approach the minimal error value, and using the boundary value rule to reduce the range of weight change to reduce the oscillation phenomenon so that the network error converges as soon as possible and improve the training efficiency. Through simulation experiments, the NWBP algorithm improves the degree of fitting and convergence speed in the training of complex convolutional neural networks compared with other algorithms, reduces the redundant computation, and shortens the training time to a certain extent, and the algorithm has the advantage of accelerating the convergence of the network compared with simple networks. The word tree constraint and its efficient storage structure are introduced, which improves the storage efficiency of the word tree constraint and the retrieval efficiency in the English phrase recognition search. (shrink)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  34.  2
    On Dynamic Pitch Benefit for Speech Recognition in Speech Masker.Jing Shen & Pamela E. Souza - 2018 - Frontiers in Psychology 9.
    Previous work demonstrated that dynamic pitch (i.e., pitch variation in speech) aids speech recognition in various types of noises. While this finding suggests dynamic pitch enhancement in target speech can benefit speech recognition in noise, it is of importance to know what noise characteristics affect dynamic pitch benefit and who will benefit from enhanced dynamic pitch cues. Following our recent finding that temporal modulation in noise influences dynamic pitch benefit, we examined the effect of (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  35.  9
    Perceptual units in speech recognition.Dominic W. Massaro - 1974 - Journal of Experimental Psychology 102 (2):199.
  36.  25
    Mandarin-Speaking Children’s Speech Recognition: Developmental Changes in the Influences of Semantic Context and F0 Contours.Zhou Hong, Li Yu, Liang Meng, Guan Connie Qun, Zhang Linjun, Shu Hua & Zhang Yang - 2017 - Frontiers in Psychology 8.
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  37.  12
    Identity crisis: Face recognition technology and freedom of the will.Benjamin Hale - 2005 - Ethics, Place and Environment 8 (2):141 – 158.
    In this paper I present the position that the use of face recognition technology (FRT) in law enforcement and in business is restrictive of individual autonomy. I reason that FRT severely undermines autonomous self-determination by hobbling the idea of freedom of the will. I distinguish this position from two other common arguments against surveillance technologies: the privacy argument (that FRT is an invasion of privacy) and the objective freedom argument (that FRT is restrictive of one's freedom to act). (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  38.  10
    Differences in Speech Recognition Between Children with Attention Deficits and Typically Developed Children Disappear When Exposed to 65 dB of Auditory Noise.Göran B. W. Söderlund & Elisabeth Nilsson Jobs - 2016 - Frontiers in Psychology 7.
    Direct download (7 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  39.  6
    A Hybrid of Deep CNN and Bidirectional LSTM for Automatic Speech Recognition.Rajesh Kumar Aggarwal & Vishal Passricha - 2019 - Journal of Intelligent Systems 29 (1):1261-1274.
    Deep neural networks (DNNs) have been playing a significant role in acoustic modeling. Convolutional neural networks (CNNs) are the advanced version of DNNs that achieve 4–12% relative gain in the word error rate (WER) over DNNs. Existence of spectral variations and local correlations in speech signal makes CNNs more capable of speech recognition. Recently, it has been demonstrated that bidirectional long short-term memory (BLSTM) produces higher recognition rate in acoustic modeling because they are adequate to reinforce (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  40.  13
    Shortlist B: A Bayesian model of continuous speech recognition.Dennis Norris & James M. McQueen - 2008 - Psychological Review 115 (2):357-395.
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   66 citations  
  41.  45
    The ethical application of biometric facial recognition technology.Marcus Smith & Seumas Miller - 2022 - AI and Society 37 (1):167-175.
    Biometric facial recognition is an artificial intelligence technology involving the automated comparison of facial features, used by law enforcement to identify unknown suspects from photographs and closed circuit television. Its capability is expanding rapidly in association with artificial intelligence and has great potential to solve crime. However, it also carries significant privacy and other ethical implications that require law and regulation. This article examines the rise of biometric facial recognition, current applications and legal developments, and conducts an (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  42. Audio-visual speech recognition.G. Potamianos & J. Luettin - 2005 - In Keith Brown (ed.), Encyclopedia of Language and Linguistics. Elsevier.
  43.  6
    Single-Channel Speech Enhancement Techniques for Distant Speech Recognition.Ramaswamy Kumaraswamy & Jaya Kumar Ashwini - 2013 - Journal of Intelligent Systems 22 (2):81-93.
    This article presents an overview of the single-channel dereverberation methods suitable for distant speech recognition application. The dereverberation methods are mainly classified based on the domain of enhancement of speech signal captured by a distant microphone. Many single-channel speech enhancement methods focus on either denoising or dereverberating the distorted speech signal. There are very few methods that consider both noise and reverberation effects. Such methods are discussed under a multistage approach in this article. The article (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  44.  10
    Multitask Learning with Local Attention for Tibetan Speech Recognition.Hui Wang, Fei Gao, Yue Zhao, Li Yang, Jianjian Yue & Huilin Ma - 2020 - Complexity 2020:1-10.
    In this paper, we propose to incorporate the local attention in WaveNet-CTC to improve the performance of Tibetan speech recognition in multitask learning. With an increase in task number, such as simultaneous Tibetan speech content recognition, dialect identification, and speaker recognition, the accuracy rate of a single WaveNet-CTC decreases on speech recognition. Inspired by the attention mechanism, we introduce the local attention to automatically tune the weights of feature frames in a window and (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  45.  7
    Temporal cortex activation during speech recognition: an optical topography study.Hiroki Sato, Tatsuya Takeuchi & Kuniyoshi L. Sakai - 1999 - Cognition 73 (3):B55-B66.
  46.  8
    Deconstructing public participation in the governance of facial recognition technologies in Canada.Maurice Jones & Fenwick McKelvey - forthcoming - AI and Society:1-14.
    On February 13, 2020, the Toronto Police Services (TPS) issued a statement admitting that its members had used Clearview AI’s controversial facial recognition technology (FRT). The controversy sparked widespread outcry by the media, civil society, and community groups, and put pressure on policy-makers to address FRTs. Public consultations presented a key tool to contain the scandal in Toronto and across Canada. Drawing on media reports, policy documents, and expert interviews, we investigate four consultations held by the Toronto Police (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  47.  30
    Eyes on the Streets: Media Use and Public Opinion About Facial Recognition Technology.David C. Wilson, Ashley Paintsil, Wyatt Dawson, James Bingaman & Paul R. Brewer - 2022 - Bulletin of Science, Technology and Society 42 (4):133-143.
    This study examines how different forms of media use predict attitudes toward the development of facial recognition technology (FRT) and applications of it by law enforcement to identify criminal suspects, identify potential terrorists, and monitor public protests. The theoretical framework builds on theories of cultivation and genre-specific viewing to develop hypotheses and research questions. The analyses draw on original data from two nationally representative surveys of the U.S. public conducted in 2020, amid a series of controversies and protests (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  48.  19
    A phenomenological perspective on AI ethical failures: The case of facial recognition technology.Yuni Wen & Matthias Holweg - forthcoming - AI and Society:1-18.
    As more and more companies adopt artificial intelligence to increase the efficiency and effectiveness of their products and services, they expose themselves to ethical crises and potentially damaging public controversy associated with its use. Despite the prevalence of AI ethical problems, most companies are strategically unprepared to respond effectively to the public. This paper aims to advance our empirical understanding of company responses to AI ethical crises by focusing on the rise and fall of facial recognition technology. Specifically, (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  49.  9
    Shortlist: a connectionist model of continuous speech recognition.Dennis Norris - 1994 - Cognition 52 (3):189-234.
  50.  6
    Do Age and Linguistic Status Alter the Effect of Sound Source Diffuseness on Speech Recognition in Noise?Meital Avivi-Reich, Rupinder Kaur Sran & Bruce A. Schneider - 2022 - Frontiers in Psychology 13.
    One aspect of auditory scenes that has received very little attention is the level of diffuseness of sound sources. This aspect has increasing importance due to growing use of amplification systems. When an auditory stimulus is amplified and presented over multiple, spatially-separated loudspeakers, the signal’s timbre is altered due to comb filtering. In a previous study we examined how increasing the diffuseness of the sound sources might affect listeners’ ability to recognize speech presented in different types of background noise. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
1 — 50 / 1000