Loughborough University
Browse
Thesis-2020-Palmer.pdf (5.95 MB)

Reusable templates for the extraction of knowledge

Download (5.95 MB)
thesis
posted on 2021-05-21, 10:11 authored by Paul Palmer
‘Big Data’ is typically noted to contain undesirable imperfections that are usually described using terminology such as ‘messy’, ‘untidy’ or ‘ragged’ requiring ‘cleaning’ as preparation for analysis. Once the data has been cleaned, a vast amount of literature exists exploring how best to proceed. The use of this pejorative terminology implies that it is imperfect data hindering analysis, rather than recognising that the encapsulated knowledge is presented in an inconvenient state for the chosen analytical tools, which in turn leads to a presumption about the unsuitability of desktop computers for this task. As there is no universally accepted definition of ‘Big Data’ this inconvenient starting state is described here as ‘nascent data’ as it carries no baggage associated with popular usage. This leads to the primary research question: Can an empirical theory of the knowledge extraction process be developed that guides the creation of tools that gather, transform and analyse nascent data? A secondary pragmatic question follows naturally from the first: Will data stakeholders use these tools?

Funding

DTP 2018-19 Loughborough University

Engineering and Physical Sciences Research Council

Find out more...

History

School

  • Mechanical, Electrical and Manufacturing Engineering

Publisher

Loughborough University

Rights holder

© Paul J. Palmer

Publication date

2020

Notes

A thesis submitted in partial fulfilment of the requirements for the award of the degree of Doctor of Philosophy of Loughborough University.

Language

  • en

Supervisor(s)

Michael Henshaw ; Russell Lock

Qualification name

  • PhD

Qualification level

  • Doctoral

This submission includes a signed certificate in addition to the thesis file(s)

  • I have submitted a signed certificate

Usage metrics

    Mechanical, Electrical and Manufacturing Engineering Theses

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC