Document Type
Book Chapter
Publication Date
5-1-2016
First Page
126
Last Page
145
URL with Digital Object Identifier
https://doi.org/10.4135/9781473983847
Abstract
This chapter provides a broad introduction to the modelling, cleaning, and transformation techniques that must be applied to social media data before it can be imported into storage and analysis software. While each of the above topics in itself encompasses a wide range of issues, they are also inextricably related in that each relies in some way upon the others. In order to discuss these processes as a group, we employ the term data processing to describe the preparatory phase between data collection and data analysis. The sections that follow demonstrate how data processing can be broken down into a pipeline of three phases: modelling, cleaning and transformation.