Optimization of Scientific Reasoning: a Data-Driven Approach

Dissertation, (2019)
  Copy   BIBTEX

Abstract

Scientific reasoning represents complex argumentation patterns that eventually lead to scientific discoveries. Social epistemology of science provides a perspective on the scientific community as a whole and on its collective knowledge acquisition. Different techniques have been employed with the goal of maximization of scientific knowledge on the group level. These techniques include formal models and computer simulations of scientific reasoning and interaction. Still, these models have tested mainly abstract hypothetical scenarios. The present thesis instead presents data-driven approaches in social epistemology of science. A data-driven approach requires data collection and curation for its further usage, which can include creating empirically calibrated models and simulations of scientific inquiry, performing statistical analyses, or employing data- mining techniques and other procedures. We present and analyze in detail three co-authored research projects on which the thesis’ author was engaged during her PhD. The first project sought to identify optimal team composition in high energy physics laboratories using data-mining techniques. The results of this project are published in (Perović et al. 2016), and indicate that projects with smaller numbers of teams and team members outperform bigger ones. In the second project, we attempted to determine whether there is an epistemic saturation point in experimentation in high energy physics. The initial results from this project are published in (Sikimić et al. 2018). In the thesis, we expand on this topic by using computer simulations to test for biases that could induce scientists to invest in projects beyond their epistemic saturation point. Finally, in previous examples of data-driven analyses, citations are used as a measure of epistemic efficiency of projects in high energy physics. In order to additionally justify and analyze the usage of this parameter in their data-driven research, in the third project Perović & Sikimić (2019) analyzed and compared inductive patterns in experimental physics and biology with the reliability of citation records in these fields. They conclude that while citations are a relatively reliable measure of efficiency in high energy physics research, the same does not hold for the majority of research in experimental biology. Additionally, contributions of the author that are for the first time published in this theses are: (a) an empirically calibrated model of scientific interaction of research groups in biology, (b) a case study of irregular argumentation patterns in some pathogen discoveries, and (c) an introductory discussion of the benefits and limitations of data- driven approaches to the social epistemology of science. Using computer simulations of an empirically calibrated model, we demonstrate that having several levels of hierarchy and division into smaller research sub-teams is epistemically beneficial for researchers in experimental biology. We also show that argumentation analysis in biology represents a good starting point for further data-driven analyses in the field. Finally, we conclude that a data-driven approach is informative and useful for science policy, but requires careful considerations about data collection, curation, and interpretation.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,628

External links

  • This entry has no external links. Add one.
Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Scientific perspectivism: A philosopher of science's response to the challenge of big data biology.Werner Callebaut - 2012 - Studies in History and Philosophy of Science Part C: Studies in History and Philosophy of Biological and Biomedical Sciences 43 (1):69-80.
What Counts as Scientific Data? A Relational Framework.Sabina Leonelli - 2015 - Philosophy of Science 82 (5):810-821.
How Theories of Induction Can Streamline Measurements of Scientific Performance.Slobodan Perović & Vlasta Sikimić - 2020 - Journal for General Philosophy of Science / Zeitschrift für Allgemeine Wissenschaftstheorie 51 (2):267-291.
Data Interpretation in the Digital Age.Sabina Leonelli - 2014 - Perspectives on Science 22 (3):397-417.
Virtual Models and Simulations.Peter Krebs - 2007 - Techné: Research in Philosophy and Technology 11 (1):42-54.
How non-epistemic values can be epistemically beneficial in scientific classification.Soohyun Ahn - 2020 - Studies in History and Philosophy of Science Part A 84:57-65.

Analytics

Added to PP
2020-11-03

Downloads
0

6 months
0

Historical graph of downloads

Sorry, there are not enough data points to plot this chart.
How can I increase my downloads?

Author's Profile

Vlasta Sikimić
Eindhoven University of Technology

Citations of this work

No citations found.

Add more citations