A Discussion of the Validity of Data Analytics

Bernard J. Jansen*, Kholoud K. Aldous, Joni Salminen, Hind Almerekhi, Soon gyo Jung

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

Validity is often understood as the “correctness” of a method or instrument. However, many data collection issues can degrade the validity of findings about people. Data validity is also a crucial issue for web and social media analytics. Data validity is data accuracy, where accuracy is the degree to which the data conforms to actual values. If the data has high validity, it means that the values in the dataset correspond to true properties in the physical (or virtual) world. If the numbers do not correspond to the actual values, there is a data validity issue; in other words, the data validity is low. In this chapter, you will gain a better understanding of data validity, the major causes of data validity, and how to account for and correct data validity.

Original languageEnglish
Title of host publicationSynthesis Lectures on Information Concepts, Retrieval, and Services
PublisherSpringer Nature
Pages139-145
Number of pages7
DOIs
Publication statusPublished - 6 Sept 2024

Publication series

NameSynthesis Lectures on Information Concepts, Retrieval, and Services
VolumePart F1359
ISSN (Print)1947-945X
ISSN (Electronic)1947-9468

Fingerprint

Dive into the research topics of 'A Discussion of the Validity of Data Analytics'. Together they form a unique fingerprint.

Cite this