File(s) under permanent embargo

Reason: This item is currently closed access.

Text mining and social media: when quantitative meets qualitative, and software meets humans [Working paper]

preprint
posted on 15.12.2017 by Lawrence Ampofo, Simon Collister, Ben O'Loughlin, Andrew Chadwick
The ongoing production of staggeringly huge volumes of digital data is a ubiquitous part of life in the early twenty-first century. A large proportion of this data is text. This development has serious implications for almost all scholarly endeavour. It is now possible for researchers from a wide range of disciplines to use text mining techniques and software tools in their daily practice. In our own field of political communication, the prospect of cheap access to what, how, and to whom very large numbers of citizens communicate in social media environments provides opportunities that are often too good to miss as we seek to understand how and why citizens think and feel the way they do about policies, political organizations, and political events. But what are the methods and tools on offer, how should they best be used, and what sorts of ethical issues are raised by their use? In this article we proceed as follows. First, we provide a basic definition of text mining. Second, we provide examples of how text mining has been used recently in a diverse range of analytical contexts, from business to media to politics. Third, we discuss the challenges of conducting text mining in online social media environments, focusing on issues such as the problem of gaining access to social media data, research ethics, and the integrity of the data corpuses that are available from social media companies. Fourth, we present a basic but comprehensive survey of the text mining tools that are currently available. Finally, we present two brief case studies of the application of text mining in the authors’ field of political communication. We conclude with some observations about the proper place of text mining in social science research.

History

School

  • Social Sciences

Department

  • Communication, Media, Social and Policy Studies

Citation

AMPOFO, L. ... et al, 2013. Text mining and social media: when quantitative meets qualitative, and software meets humans. Royal Holloway, University of London New Political Communication Unit Working Paper. London: Royal Holloway, University of London.

Publisher

Royal Holloway, University of London

Version

VoR (Version of Record)

Publisher statement

This work is made available according to the conditions of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) licence. Full details of this licence are available at: https://creativecommons.org/licenses/by-nc-nd/4.0/

Publication date

2013

Notes

This working paper is closed access. This paper is available at http://www.newpolcom.rhul.ac.uk/working-papers-1/.

Language

en

Exports

Logo branding

Keyword(s)

Exports