Thesis-2020-Palmer.pdf (5.95 MB)

Reusable templates for the extraction of knowledge

Download (5.95 MB)
posted on 21.05.2021, 10:11 by Paul Palmer
‘Big Data’ is typically noted to contain undesirable imperfections that are usually described using terminology such as ‘messy’, ‘untidy’ or ‘ragged’ requiring ‘cleaning’ as preparation for analysis. Once the data has been cleaned, a vast amount of literature exists exploring how best to proceed. The use of this pejorative terminology implies that it is imperfect data hindering analysis, rather than recognising that the encapsulated knowledge is presented in an inconvenient state for the chosen analytical tools, which in turn leads to a presumption about the unsuitability of desktop computers for this task. As there is no universally accepted definition of ‘Big Data’ this inconvenient starting state is described here as ‘nascent data’ as it carries no baggage associated with popular usage. This leads to the primary research question: Can an empirical theory of the knowledge extraction process be developed that guides the creation of tools that gather, transform and analyse nascent data? A secondary pragmatic question follows naturally from the first: Will data stakeholders use these tools?


DTP 2018-19 Loughborough University

Engineering and Physical Sciences Research Council

Find out more...



  • Mechanical, Electrical and Manufacturing Engineering


Loughborough University

Rights holder

© Paul J. Palmer

Publication date



A thesis submitted in partial fulfilment of the requirements for the award of the degree of Doctor of Philosophy of Loughborough University.




Michael Henshaw ; Russell Lock

Qualification name


Qualification level


This submission includes a signed certificate in addition to the thesis file(s)

I have submitted a signed certificate

Usage metrics

Mechanical, Electrical and Manufacturing Engineering Theses