Big Data solutions on a small scale: Evaluating accessible high-performance computing for social research

Big Data and Society 1 (2) (2014)
  Copy   BIBTEX

Abstract

Though full of promise, Big Data research success is often contingent on access to the newest, most advanced, and often expensive hardware systems and the expertise needed to build and implement such systems. As a result, the accessibility of the growing number of Big Data-capable technology solutions has often been the preserve of business analytics. Pay as you store/process services like Amazon Web Services have opened up possibilities for smaller scale Big Data projects. There is high demand for this type of research in the digital humanities and digital sociology, for example. However, scholars are increasingly finding themselves at a disadvantage as available data sets of interest continue to grow in size and complexity. Without a large amount of funding or the ability to form interdisciplinary partnerships, only a select few find themselves in the position to successfully engage Big Data. This article identifies several notable and popular Big Data technologies typically implemented using large and extremely powerful cloud-based systems and investigates the feasibility and utility of development of Big Data analytics systems implemented using low-cost commodity hardware in basic and easily maintainable configurations for use within academic social research. Through our investigation and experimental case study, we found that not only are solutions like Cloudera’s Hadoop feasible, but that they can also enable robust, deep, and fruitful research outcomes in a variety of use-case scenarios across the disciplines.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,928

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Data Storage, Security And Techniques In Cloud Computing.R. Dinesh Arpitha & Shobha R. Sai - 2018 - International Journal of Research and Analytical Reviews 5 (4).
Internet of Things future in Edge Computing.C. Pvandana & Ajeet Chikkamannur - 2016 - International Journal of Advanced Engineering Research and Science 3 (12):148-154.
Challenges, Approaches and Solutions in Data Integration for Research and Innovation.Maurizio Lenzerini & Cinzia Daraio - 2019 - In Wolfgang Glänzel, Henk F. Moed, Ulrich Schmoch & Mike Thelwall (eds.), Springer Handbook of Science and Technology Indicators. Springer Verlag. pp. 397-420.
Bodies of Data: Genomic Data and Bioscience Data Sharing.Pilar Ossorio - 2011 - Social Research: An International Quarterly 78 (4):907-932.
Bodies of data: genomic data and bioscience data sharing.Pilar N. Ossorio - 2011 - Social Research: An International Quarterly 78 (3):907-932.
Pitfalls and promises: The use of secondary data analysis in educational research.Emma Smith - 2008 - British Journal of Educational Studies 56 (3):323-339.
Developing Current Research Information Systems as Data Sources for Studies of Research.Gunnar Sivertsen - 2019 - In Wolfgang Glänzel, Henk F. Moed, Ulrich Schmoch & Mike Thelwall (eds.), Springer Handbook of Science and Technology Indicators. Springer Verlag. pp. 667-683.

Analytics

Added to PP
2020-11-24

Downloads
10 (#1,194,153)

6 months
8 (#361,341)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations