Results for ' corpus data(sets)'

61 found
Order:
  1.  5
    Individual corpus data predict variation in judgments: testing the usage-based nature of mental representations in a language transfer setting.Maria Mos, Ad Backus & Marie Barking - 2022 - Cognitive Linguistics 33 (3):481-519.
    This study puts the usage-based assumption that our linguistic knowledge is based on usage to the test. To do so, we explore individual variation in speakers’ language use as established based on corpus data – both in terms of frequency of use and productivity of use – and link this variation to the same participants’ responses in an experimental judgment task. The empirical focus is on transfer by native German speakers living in the Netherlands, who oftentimes experience transfer (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  2.  9
    The VIDAS Data Set: A Spoken Corpus of Migrant and Refugee Spanish Learners.Margarita Planelles Almeida, Jon Andoni Duñabeitia & Anna Doquin de Saint Preux - 2022 - Frontiers in Psychology 12:798614.
    The VIDAS data set presents data from 200 participants from different countries and language backgrounds. They completed an oral expression and interaction test in the context of a Spanish certification exam for adult migrants. The aim of the VIDAS data set is to provide researchers in psycholinguistics and second language acquisition with a Spanish spoken corpus of traditionally marginalized and underrepresented learners, providing a compelling data set of oral interactions by migrants and refugees. The (...) contains more than 29 h of recordings of the oral interactions of the participants with trained interviewers, as well as background information about the participants. It furthermore contains the scores obtained by the participants in the oral expression and interaction exam. The VIDAS corpus allows for the development of studies on L2 spoken language comprehension and processing, as well as for comparative analyses of language acquisition between different L1 groups at different linguistic levels. (shrink)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  3.  64
    Building an ACT‐R Reader for Eye‐Tracking Corpus Data.Jakub Dotlačil - 2018 - Topics in Cognitive Science 10 (1):144-160.
    Cognitive architectures have often been applied to data from individual experiments. In this paper, I develop an ACT-R reader that can model a much larger set of data, eye-tracking corpus data. It is shown that the resulting model has a good fit to the data for the considered low-level processes. Unlike previous related works, the model achieves the fit by estimating free parameters of ACT-R using Bayesian estimation and Markov-Chain Monte Carlo techniques, rather than by (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   5 citations  
  4.  42
    Unsupervised context sensitive language acquisition from a large corpus.Shimon Edelman - unknown
    We describe a pattern acquisition algorithm that learns, in an unsupervised fashion, a streamlined representation of linguistic structures from a plain natural-language corpus. This paper addresses the issues of learning structured knowledge from a large-scale natural language data set, and of generalization to unseen text. The implemented algorithm represents sentences as paths on a graph whose vertices are words. Significant patterns, determined by recursive context-sensitive statistical inference, form new vertices. Linguistic constructions are represented by trees composed of significant (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   4 citations  
  5. The Inclusion of Polysemes in Non-native English Textbooks: A Corpus-based Study.Hicham Lahlou & Hajar Abdul Rahim - 2023 - Arab World English Journal 14 (2):19-29.
    Despite the large number of studies conducted on polysemy, they mostly compare the different methods and techniques to learn a language and establish the extent to which particular sense relations facilitate the learning of second language vocabulary. To our best knowledge, no research has been conducted to determine whether or not polysemy is emphasized in non-native English textbooks. The objective of the present research was to determine the degree to which polysemy is incorporated in English textbooks. Thus, the research question (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  6. Mining social media data: How are research sponsors and researchers addressing the ethical challenges?Joanna Taylor & Claudia Pagliari - 2018 - Research Ethics 14 (2):1-39.
    Background:Data representing people’s behaviour, attitudes, feelings and relationships are increasingly being harvested from social media platforms and re-used for research purposes. This can be ethically problematic, even where such data exist in the public domain. We set out to explore how the academic community is addressing these challenges by analysing a national corpus of research ethics guidelines and published studies in one interdisciplinary research area.Methods:Ethics guidelines published by Research Councils UK, its seven-member councils and guidelines cited within (...)
    Direct download  
     
    Export citation  
     
    Bookmark   6 citations  
  7.  13
    Structure and Grammaticalization of Serial Verb Constructions in Sign Language of the Netherlands—A Corpus-Based Study.Sascha Couvee & Roland Pfau - 2018 - Frontiers in Psychology 9:355519.
    In serial verb constructions (SVCs), multiple independent lexical verbs are combined in a mono-clausal construction. SVCs express a range of grammatical meanings and are attested in numerous spoken languages all around the world. Yet, to date only few studies have investigated the existence and functions of SVCs in sign languages. For the most part, these studies – including a previous study on Sign Language of the Netherlands (NGT) – relied on elicited data. In this article, we offer a cross-modal (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  8.  6
    From Wodehouse to the White House: A Corpus-Assisted Study of Play, Fantasy and Dramatic Incongruity in Comic Writing and Laughter-Talk.Alan Partington - 2008 - Lodz Papers in Pragmatics 4 (2):189-213.
    From Wodehouse to the White House: A Corpus-Assisted Study of Play, Fantasy and Dramatic Incongruity in Comic Writing and Laughter-Talk In this paper I consider two discourse types, one written and literary, the other spoken and semi-conversational, in an attempt to discover if there are any similarities in the ways in which humour is generated in such apparently diverse forms of communication. The first part of the paper is concerned with the explicitly comic prose of P. G. Wodehouse, whilst (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  9. The Pandemic Experience Survey II: A Second Corpus of Subjective Reports of Life Under Social Restrictions During COVID-19 in the UK, Japan, and Mexico.Mark M. James, Havi Carel, Matthew Ratcliffe, Tom Froese, Jamila Rodrigues, Ekaterina Sangati, Morgan Montoya, Federico Sangati & Natalia Koshkina - 2022 - Frontiers in Public Health.
    In August 2021, Froese et al. published survey data collected from 2,543 respondents on their subjective experiences living under imposed social distancing measures during COVID-19 (1). The questionnaire was issued to respondents in the UK, Japan, and Mexico. By combining the authors’ expertise in phenomenological philosophy, phenomenological psychopathology, and enactive cognitive science, the questions were carefully phrased to prompt reports that would be useful to phenomenological investigation and theorizing (2–4). These questions reflected the various author’s research interests (e.g., technology, (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  10.  12
    An integrated explicit and implicit offensive language taxonomy.Barbara Lewandowska-Tomaszczyk, Anna Bączkowska, Chaya Liebeskind, Giedre Valunaite Oleskeviciene & Slavko Žitnik - 2023 - Lodz Papers in Pragmatics 19 (1):7-48.
    The current study represents an integrated model of explicit and implicit offensive language taxonomy. First, it focuses on a definitional revision and enrichment of the explicit offensive language taxonomy by reviewing the collection of available corpora and comparing tagging schemas applied there. The study relies mainly on the categories originally proposed by Zampieri et al. (2019) in terms of offensive language categorization schemata. After the explanation of semantic differences between particular concepts used in the tagging systems and the analysis of (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   4 citations  
  11.  16
    Analysing discourse around COVID-19 in the Australian Twittersphere: A real-time corpus-based analysis.Sam Hames, Michael Haugh & Martin Schweinberger - 2021 - Big Data and Society 8 (1).
    Public discourse about the COVID-19 that appears on Twitter and other social media platforms provides useful insights into public concerns and responses to the pandemic. However, acknowledging that public discourse around COVID-19 is multi-faceted and evolves over time poses both analytical and ontological challenges. Studies that use text-mining approaches to analyse responses to major events commonly treat public discourse on social media as an undifferentiated whole, without systematically examining the extent to which that discourse consists of distinct sub-discourses or which (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  12.  27
    Public responses to the sharing and linkage of health data for research purposes: a systematic review and thematic synthesis of qualitative studies.Mhairi Aitken, Jenna de St Jorre, Claudia Pagliari, Ruth Jepson & Sarah Cunningham-Burley - 2016 - BMC Medical Ethics 17 (1):73.
    BackgroundThe past 10 years have witnessed a significant growth in sharing of health data for secondary uses. Alongside this there has been growing interest in the public acceptability of data sharing and data linkage practices. Public acceptance is recognised as crucial for ensuring the legitimacy of current practices and systems of governance. Given the growing international interest in this area this systematic review and thematic synthesis represents a timely review of current evidence. It highlights the key factors (...)
    Direct download (8 more)  
     
    Export citation  
     
    Bookmark   16 citations  
  13.  89
    Attrition and revival in Awjila BerberFacebook posts as a new data source for an endangered Berber language.Marijn van Putten & Lameen Souag - 2015 - Corpus 14:23-58.
    Awjila Berber is a highly endangered Berber variety spoken in eastern Libya. The minimal material available on it reveals that the language is in some respects very archaic and in others grammatically unique, and as such is of particular comparative and historical interest. Fieldwork has been impossible for decades due to the political situation. Recently, however, several inhabitants of Awjila have set up a Facebook group Ašal=ənnax (“our village”), posting largely in Awjili. Analysis of this partly conversational corpus makes (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  14.  10
    Interactional strategies for progressing through quizzes in dementia settings.Val Williams, Camilla Lindholm & Joseph Webb - 2020 - Discourse Studies 22 (4):503-522.
    People with early-to-mid stage dementia frequently attend groups that provide opportunities for socialising and engaging in group activities, such as quizzes. This article uses conversation analysis to investigate the interactional strategies that the staff use to initiate and keep these quizzes ‘on track’, and what they orient to as impediments and facilitators of quiz progression. Specifically, we outline how staff deal with incorrect or ‘non-answers’, and what happens when players have their own goals or ‘projects’ that do not align with (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  15. The Telegram Chronicles of Online Harm.Mihaela Popa-Wyatt - manuscript
    Harmful and dangerous language is frequent in social media, in particular in spaces which are considered anonymous and/or allow free participation. In this paper, we analyse the language in a Telegram channel populated by followers of Donald Trump, in order to identify the ways in which harmful language is used to create a specific narrative in a group of mostly like-minded discussants. Our research has several aims. First, we create an extended taxonomy of potentially harmful language that includes not only (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  16.  95
    The degree functions of negative adjectives.Galit Weidman Sassoon - 2010 - Natural Language Semantics 18 (2):141-181.
    This paper provides a new account of positive versus negative antonyms. The data includes well-known linguistic generalizations regarding negative adjectives, such as their incompatibility with measure phrases (cf. two meters tall/ *short) and ratio phrases (twice as tall/ #short) as well as the impossibility of truly crosspolar comparisons (*Dan is taller than Sam is short). These generalizations admit a variety of exceptions, e.g., positive adjectives that do not license measure phrases (cf. #two degrees warm/cold) and rarely also negative adjectives (...)
    Direct download (8 more)  
     
    Export citation  
     
    Bookmark   5 citations  
  17. Social media opposition to the 2022/2023 UK nurse strikes.Erika Kalocsányiová, Ryan Essex, Sorcha A. Brophy & Veena Sriram - forthcoming - Nursing Inquiry:e12600.
    Previous research has established that the success of strikes, and social movements more broadly, depends on their ability to garner support from the public. However, there is scant published research investigating the response of the public to strike action by healthcare workers. In this study, we address this gap through a study of public responses to UK nursing strikes in 2022–2023, using a data set drawn from Twitter of more than 2300 publicly available tweets. We focus on negative tweets, (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  18.  27
    The Way We Ask for Money… The Emergence and Institutionalization of Grant Writing Practices in Academia.Kathia Serrano Velarde - 2018 - Minerva 56 (1):85-107.
    Although existing scholarship offers critical insights into the working mechanisms of project-based research funding, little is known about the actual practice of writing grant proposals. Our study seeks to add a longitudinal dimension to the ongoing debate on the implications of competitive research funding by focusing on the incremental adjustment of the funder/fundee relationship around a common discursive practice that consists in describing and evaluating research projects: How has the perception of what constitutes a legitimate funding claim changed over time (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   4 citations  
  19.  69
    Similarity-based Word Sense Disambiguation.Shimon Edelman - unknown
    We describe a method for automatic word sense disambiguation using a text corpus and a machine- readable dictionary (MRD). The method is based on word similarity and context similarity measures. Words are considered similar if they appear in similar contexts; contexts are similar if they contain similar words. The circularity of this definition is resolved by an iterative, converging process, in which the system learns from the corpus a set of typical usages for each of the senses of (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  20.  34
    Robert Boyle and the early Royal Society: a reciprocal exchange in the making of Baconian science.Michael Hunter - 2007 - British Journal for the History of Science 40 (1):1-23.
    This paper documents an important development in Robert Boyle's natural-philosophical method – his use from the 1660s onwards of ‘heads’ and ‘inquiries’ as a means of organizing his data, setting himself an agenda when studying a subject and soliciting information from others. Boyle acknowledged that he derived this approach from Francis Bacon, but he had not previously used it in his work, and the reason why it came to the fore when it did is not apparent from his printed (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   14 citations  
  21. Theoretical Virtues in Scientific Practice: An Empirical Study.Moti Mizrahi - 2022 - British Journal for the Philosophy of Science 73 (4):879-902.
    It is a common view among philosophers of science that theoretical virtues (also known as epistemic or cognitive values), such as simplicity and consistency, play an important role in scientific practice. In this article, I set out to study the role that theoretical virtues play in scientific practice empirically. I apply the methods of data science, such as text mining and corpus analysis, to study large corpora of scientific texts in order to uncover patterns of usage. These patterns (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  22.  19
    Lifelong learning for tactile emotion recognition.Jiaqi Wei, Huaping Liu, Bowen Wang & Fuchun Sun - 2019 - Interaction Studies 20 (1):25-41.
    Tactile emotion recognition provides a lot of valuable information in human-computer interaction, and it has strong application prospects in many aspects such as smart home and medical treatment. So this situation raises a question: How to quickly and efficiently let the robot perform the correct emotion recognition? In this work, we develop a lifelong learning algorithm which is based on the efficient dictionary learning technology, to tackle the tactile emotion recognition across different tasks. To verify the efficiency of the proposed (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  23.  21
    A chained metonymic approach to ίdὸ‘eye’ constructional metonymies in Hausa.Mustapha Bala Tsakuwa, Xu Wen & Ibrahim Lamido - 2023 - Cognitive Linguistics 34 (2):165-196.
    Unlike previous studies which generally seem to focus more on Hausa metaphorical expressions, this study investigates a wide range of uses ofίdὸ‘eye’ in its constructional metonymy patterns in the language by exploring corpus data that contain over 300 eye-related expressions. We observe that some constructional metonymies maintain a set of fixed words and syntax in activating conceptual shifts and producing eye metonymies while others have semi-fixed patterns and produce the same metonymies. Lexical items liketsόkάlế,kὰn,ὰ,dὰ, andbὰsίrὰamong others are constant (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  24.  6
    “Shut up! Don’t say that! You’ve got to say ḤASHĀKEM_!” The pragmatics of _Ḥashāk and its variants in colloquial Algerian Arabic.Boudjemaa Dendenne - 2023 - Lodz Papers in Pragmatics 19 (1):145-174.
    In this paper, the pragmatic functions served by ḥāshāk and its variants in colloquial Algerian Arabic (CAA) are unravelled. Literally, ḥāshāk means “You’re exalted/exempt from X/I distance you from X,” where X is a bad thing or socially/religiously unacceptable act. Its variants include ḥāsha, ḥāshākem, ḥāshāh/ḥāshāha/ḥāshāhem, maḥashākesh, and the verb ḥāsha/ḥāshi. As far as the author is aware, this is the first study on the pragmatics of ḥāshāk and its variants in colloquial (Algerian) Arabic. Two complementary data sets (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  25.  7
    Constructional Meaning and Knowledge-Driven Interpretation of Motion Events: Examples from Three Romance Varieties.Alfonsina Buoniconto - 2020 - Gestalt Theory 42 (1):31-42.
    Summary Covert encoding is one of the strategies available to languages for the encoding of motion, in which, in accordance with the laws of Gestalt, the meaning of an expression encoding motion is not coincident with the mere sum of the meanings of each of its constitutive units, relying on the mediation of grammatical and co(n)text­established knowledge for its interpretability. Moving on from a data set gathered for a previous study and adopting a holistic, constructional approach, several strategies were (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  26.  14
    The eye-movement engine.Wayne S. Murray - 2003 - Behavioral and Brain Sciences 26 (4):494-495.
    E-Z Reader fits key parameters from one corpus of eye movement data, but has not really been tested with new data sets. More critically, it is argued that the key mechanism driving eye movements – a serial process involving a proportion of word recognition time – is implausible on the basis of a broad range of experimental findings.
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  27. Philosophical reasoning about science: a quantitative, digital study.Moti Mizrahi & Michael Adam Dickinson - 2022 - Synthese 200 (2).
    In this paper, we set out to investigate the following question: if science relies heavily on induction, does philosophy of science rely heavily on induction as well? Using data mining and text analysis methods, we study a large corpus of philosophical texts mined from the JSTOR database (n = 14,199) in order to answer this question empirically. If philosophy of science relies heavily on induction, just as science supposedly does, then we would expect to find significantly more inductive (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  28. Learning a Generative Probabilistic Grammar of Experience: A Process‐Level Model of Language Acquisition.Oren Kolodny, Arnon Lotem & Shimon Edelman - 2014 - Cognitive Science 38 (4):227-267.
    We introduce a set of biologically and computationally motivated design choices for modeling the learning of language, or of other types of sequential, hierarchically structured experience and behavior, and describe an implemented system that conforms to these choices and is capable of unsupervised learning from raw natural-language corpora. Given a stream of linguistic input, our model incrementally learns a grammar that captures its statistical patterns, which can then be used to parse or generate new data. The grammar constructed in (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   5 citations  
  29.  15
    Learning a Generative Probabilistic Grammar of Experience: A Process‐Level Model of Language Acquisition.Oren Kolodny, Arnon Lotem & Shimon Edelman - 2015 - Cognitive Science 39 (2):227-267.
    We introduce a set of biologically and computationally motivated design choices for modeling the learning of language, or of other types of sequential, hierarchically structured experience and behavior, and describe an implemented system that conforms to these choices and is capable of unsupervised learning from raw natural‐language corpora. Given a stream of linguistic input, our model incrementally learns a grammar that captures its statistical patterns, which can then be used to parse or generate new data. The grammar constructed in (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  30.  21
    The Constructivist Foundations Bibliography: Humberto Maturana.R. Whitaker - 2011 - Constructivist Foundations 6 (3):393-406.
    Context: Maturana’s published corpus is vast, and his publications span multiple venues, formats, and languages. For these and other reasons, the corpus is as complex as it is daunting in its scale. Problem: Over the last two decades, bibliographic data on Maturana’s publications had proliferated in terms of available resources, scope of coverage, and accessibility. However, as of 2011 the degree of accessibility was not matched by the inclusiveness, detail, and accuracy of the relatively few dedicated bibliographies (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  31.  10
    A communicative conception of discourse.Patrick Charaudeau - 2002 - Discourse Studies 4 (3):301-318.
    This article sets out to define an approach to discourse that takes into account the characteristics of the phenomenon of social communication. First, the article examines different conceptions of discourse analysis such as `cognitive', `representational' and `communicational'. These distinctions are made using various criteria: definitions of the subject of analysis, the nature of the speaker, the corpus of data resulting from the discourse. This is followed by an examination of the types of competence that have to be (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  32. Intuition Talk is Not Methodologically Cheap: Empirically Testing the “Received Wisdom” About Armchair Philosophy.Zoe Ashton & Moti Mizrahi - 2018 - Erkenntnis 83 (3):595-612.
    The “received wisdom” in contemporary analytic philosophy is that intuition talk is a fairly recent phenomenon, dating back to the 1960s. In this paper, we set out to test two interpretations of this “received wisdom.” The first is that intuition talk is just talk, without any methodological significance. The second is that intuition talk is methodologically significant; it shows that analytic philosophers appeal to intuition. We present empirical and contextual evidence, systematically mined from the JSTOR corpus and HathiTrust’s Digital (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   12 citations  
  33.  96
    The Prevalence of Mind–Body Dualism in Early China.Edward Slingerland & Maciej Chudek - 2011 - Cognitive Science 35 (5):997-1007.
    We present the first large-scale, quantitative examination of mind and body concepts in a set of historical sources by measuring the predictions of folk mind–body dualism against the surviving textual corpus of pre-Qin (pre-221 BCE) China. Our textual analysis found clear patterns in the historically evolving reference of the word xin (heart/heart–mind): It alone of the organs was regularly contrasted with the physical body, and during the Warring States period it became less associated with emotions and increasingly portrayed as (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   18 citations  
  34.  36
    Spatial Subsystem of Moral Metaphors: A Cognitive Semantic Study.Ning Yu, Tianfang Wang & Yingliang He - 2016 - Metaphor and Symbol 31 (4):195-211.
    Cognitive semantic studies have shown that our conceptualization of morality is at least partially metaphorical and that our moral cognition is grounded in some fundamental contrastive categories of our embodied experience in the physical environment. It is argued that our moral cognition is built on a moral metaphor system. Within the framework of conceptual metaphor theory, this study aims to examine the spatial subsystem of moral metaphors in English. We set out with five pairs of moral metaphors that involve the (...)
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   5 citations  
  35.  1
    The Ecosemiotics of Human-Wolf Relations in a Northern Tourist Economy: A Case Study.Andrew Mark Creighton - forthcoming - Biosemiotics:1-20.
    This article investigates the use of wolves to enchant the rationalization of Thompson Manitoba. The city attempted to refocus towards a more touristic economy based around the large wolf population in the surrounding regions. The paper also examines why this attempt at a tourist economy has not produced its intended results. I accomplish this by first discussing the McDonaldization and enchantment of the city. This discussion is framed through George Ritzer and Jeffery C. Alexander’s work. I then integrate Umwelt analysis (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  36.  8
    Offers of assistance in politician–constituent interaction.Elizabeth Stokoe & Emily Hofstetter - 2015 - Discourse Studies 17 (6):724-751.
    How do politicians engage with and offer to assist their constituents: the people who vote them into power? We address the question by analysing a corpus of 80 interactions recorded at the office of a Member of Parliament in the United Kingdom, and comprising telephone calls between constituents and the MP’s clerical ‘caseworkers’ as well as face-to-face encounters with MPs in their fortnightly ‘surgeries’. The data were transcribed, and then analysed using conversation analysis, focusing on the design and (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  37. Exploring the rhetorical semiotic brand image structure of ad films with multivariate mapping techniques.George Rossolatos - 2014 - Semiotica 2014 (200):335-358.
    The aim of this paper is to demonstrate the applicability of multivariate mapping techniques to the exploration of the rhetorical semiotic brand image structure of ad films. By drawing on correspondence analysis and multidimensional scaling, two techniques that are amply used in corpus linguistics and in marketing research, but also on the data reduction technique of factor analysis, it will be displayed how a set of nuclear semes and classemes or an intended semic structure that underlies ad filmic (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  38.  13
    Heritage Speakers as Part of the Native Language Continuum.Heike Wiese, Artemis Alexiadou, Shanley Allen, Oliver Bunk, Natalia Gagarina, Kateryna Iefremenko, Maria Martynova, Tatiana Pashkova, Vicky Rizou, Christoph Schroeder, Anna Shadrova, Luka Szucsich, Rosemarie Tracy, Wintai Tsehaye, Sabine Zerbian & Yulia Zuban - 2022 - Frontiers in Psychology 12.
    We argue for a perspective on bilingual heritage speakers as native speakers of both their languages and present results from a large-scale, cross-linguistic study that took such a perspective and approached bilinguals and monolinguals on equal grounds. We targeted comparable language use in bilingual and monolingual speakers, crucially covering broader repertoires than just formal language. A main database was the open-access RUEG corpus, which covers comparable informal vs. formal and spoken vs. written productions by adolescent and adult bilinguals with (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  39.  6
    Repair: Comparing Facebook ‘chat’ with spoken interaction.Elizabeth Stokoe & Joanne Meredith - 2014 - Discourse and Communication 8 (2):181-207.
    Previous research on the conversation analytic phenomenon of ‘repair’ has focused on its design and function in spoken interaction. Conversely, research on written text or writing rarely focuses on interaction. In this article, we examine repair in written discourse; specifically in online settings. The data corpus comprises one-to-one quasi-synchronous Facebook ‘chat’. First, we show that, as in spoken interaction, repair happens. This basic observation supports conversation analytic arguments that features of talk, like repair and laughter, do not ‘leak (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  40.  21
    Event Mining Through Clustering.T. V. Geetha & E. Umamaheswari - 2014 - Journal of Intelligent Systems 23 (1):59-73.
    Traditional document clustering algorithms consider text-based features such as unique word count, concept count, etc. to cluster documents. Meanwhile, event mining is the extraction of specific events, their related sub-events, and the associated semantic relations from documents. This work discusses an approach to event mining through clustering. The Universal Networking Language -based subgraph, a semantic representation of the document, is used as the input for clustering. Our research focuses on exploring the use of three different feature sets for event (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  41.  12
    Exploring the Metaphorical Models of Transgenderism.Jenny Lederer - 2015 - Metaphor and Symbol 30 (2):95-117.
    This article explores the metaphorical models English speakers employ in their understanding of transgenderism. Transgender is the term ascribed to those who have begun or completed a change in their sex characteristics from male to female or female to male. Using both qualitative and quantitative measures, I examine an archive of narrative data and a transition-specific corpus to show how spoken and written narrative support a spatially based representation of gender identity and transition. Two robust models are revealed (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  42.  7
    The Metaphorical Construction of Complex Domains: The Case of Speech Activity in English.Elena Semino - 2005 - Metaphor and Symbol 20 (1):35-70.
    In this article I provide an account of the way in which the domain of spoken communication is metaphorically constructed in English, on the basis of the analysis of over 450 metaphorical references to speech activity in a corpus of contemporary written British English. I show how spoken communication is mainly structured via a set of source domains that conventionally apply to a wide variety of target domains, such as the source domains of MOTION, PHYSICAL TRANSFER, PHYSICAL CONSTRUCTION, and (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  43.  12
    Morphological Tagging and Lemmatization in the Albanian Language.Elissa Mollakuqe, Mentor Hamiti & Diellza Nagavci Mati - 2021 - Seeu Review 16 (2):3-16.
    An important element of Natural Language Processing is parts of speech tagging. With fine-grained word-class annotations, the word forms in a text can be enhanced and can also be used in downstream processes, such as dependency parsing. The improved search options that tagged data offers also greatly benefit linguists and lexicographers. Natural language processing research is becoming increasingly popular and important as unsupervised learning methods are developed. There are some aspects of the Albanian language that make the creation of (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  44.  70
    Appellate Court Modifications Extraction for Portuguese.William Paulo Ducca Fernandes, Luiz José Schirmer Silva, Isabella Zalcberg Frajhof, Guilherme da Franca Couto Fernandes de Almeida, Carlos Nelson Konder, Rafael Barbosa Nasser, Gustavo Robichez de Carvalho, Simone Diniz Junqueira Barbosa & Hélio Côrtes Vieira Lopes - 2020 - Artificial Intelligence and Law 28 (3):327-360.
    Appellate Court Modifications Extraction consists of, given an Appellate Court decision, identifying the proposed modifications by the upper Court of the lower Court judge’s decision. In this work, we propose a system to extract Appellate Court Modifications for Portuguese. Information extraction for legal texts has been previously addressed using different techniques and for several languages. Our proposal differs from previous work in two ways: our corpus is composed of Brazilian Appellate Court decisions, in which we look for a set (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  45.  10
    Exploring Sources of Satisfaction and Dissatisfaction in Airbnb Accommodation Using Unsupervised and Supervised Topic Modeling.Kai Ding, Wei Chong Choo, Keng Yap Ng, Siew Imm Ng & Pu Song - 2021 - Frontiers in Psychology 12.
    This study aims to examine key attributes affecting Airbnb users' satisfaction and dissatisfaction through the analysis of online reviews. A corpus that comprises 59,766 Airbnb reviews form 27,980 listings located in 12 different cities is analyzed by using both Latent Dirichlet Allocation and supervised LDA approach. Unlike previous LDA based Airbnb studies, this study examines positive and negative Airbnb reviews separately, and results reveal the heterogeneity of satisfaction and dissatisfaction attributes in Airbnb accommodation. In particular, the emergence of the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  46.  5
    Analysis and Argumentation in Rabbinic Judaism.Jacob Neusner - 2003 - University Press of Amer.
    Do ubiquitous modes of thought (types of analysis, types of argumentation) pervade the entire corpus of the Rabbinic writings of late antiquity and impart coherence to those diverse documents? Here are the results of a systematic probe of representative Halakhic and Aggadic documents in search of the answer to that question. The result is limited but one-sided: the answer is yes, they do. The inquiry proves urgent, because the bases for supposing the Rabbinic documents coalesce have diminished, and the (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  47.  4
    Some considerations on the attribution of the ‘new apuleius’.Dmitry Nikolaev & Mikhail Shumilin - 2021 - Classical Quarterly 71 (2):819-848.
    The ‘New Apuleius’ is a set of Latin summaries of Plato's works first published in 2016 by Justin Stover, who attributed it to Apuleius. The present article attempts to assess two key aspects of Stover's argument, viz. his reconstruction of the manuscript transmission of the new text and his use of computer-assisted stylometric techniques. The authors suggest that both strands of his argument are inconclusive. First, it is argued that the transposition of gatherings in the archetype of the Apuleian philosophica (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  48. “Identifying Phrasal Connectives in Italian Using Quantitative Methods”.Edoardo Zamuner, Fabio Tamburini & Cristiana de Sanctis - 2002 - In Stefania Nuccorini (ed.), Phrases and Phraseology – Data and Descriptions. Peter Lang Verlag.
    In recent decades, the analysis of phraseology has made use of the exploration of large corpora as a source of quantitative information about language. This paper intends to present the main lines of work in progress based on this empirical approach to linguistic analysis. In particular, we focus our attention on some problems relating to the morpho-syntactic annotation of corpora. The CORIS/CODIS corpus of contemporary written Italian, developed at CILTA – University of Bologna (Rossini Favretti 2000; Rossini Favretti, Tamburini, (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  49. Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library.Jaimie Murdock, Colin Allen, Katy Börner, Robert Light, Simon McAlister, Andrew Ravenscroft, Robert Rose, Doori Rose, Jun Otsuka, David Bourget, John Lawrence & Chris Reed - 2017 - PLoS ONE 12 (9).
    We show how faceted search using a combination of traditional classification systems and mixed-membership topic models can go beyond keyword search to inform resource discovery, hypothesis formulation, and argument extraction for interdisciplinary research. Our test domain is the history and philosophy of scientific work on animal mind and cognition. The methods can be generalized to other research areas and ultimately support a system for semi-automatic identification of argument structures. We provide a case study for the application of the methods to (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  50.  11
    Field-specific Conventions in the Translation of Commercial Law Documentation for Court Proceedings.Edyta Więcławska - 2019 - Studies in Logic, Grammar and Rhetoric 58 (1):221-243.
    The paper presents findings gathered in an exploratory, descriptive, corpus-based analysis of a parallel corpus composed of English corporate documents and their translations into Polish with regard to the frequency-related, binary strategy distribution pattern. In general, the author posits a distinctiveness of interlingual communication in the domain of law, as delineated by the institutional and disciplinary framework. The material extracted from the corpus and studied for its generic features points to the hermetic character of corporate written communication (...)
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   1 citation  
1 — 50 / 61