admin@publications.scrs.in   
New Frontiers in Communication and Intelligent Systems

Big Data Quality - A Survey Paper to Attain Data Quality

Authors: Anusha Y., Visalakshi R and Srinivas K.


Publishing Date: 28-08-2022

ISBN: 978-81-95502-00-4

DOI: https://doi.org/10.52458/978-81-95502-00-4-71

Abstract

Data, collection of data, and assessing the quality of data are crucial aspects. The World Federation of Hemophilia (WFH) collects data from its members, which helps it to keep track of the health details of the people. In such cases, if the available current data is accurate then the results will be correct. Ensuring high-quality data and assessing the quality of the data is also important in health systems. Quality data has a few measures to be checked. Timeliness, relevance, understandability, completeness, and reliability are some of the main measures of data quality. After reviewing data quality assessment methods, we found that institutions and health organizations review their data frequently to achieve data quality. Huge data with massive quantity and complicated nature is generally not structured well and is in transliterated language and it becomes an issue to deal with this big data. Big data comprises a 4V model which means huge volume, velocity, variety, and veracity. Huge volume is a chunk of gathered data that contains different formats like graphics, videos, text, images, etc. Data from all these areas will be collected and assessed. We will be working with the health databases and their outcomes. There are many limitations to assessing data quality; it is because of incomplete definitions of data quality and its measures. In the future, research can be done to improve data quality and its attributes. We can increase efforts to improvise the data collection process and define the attributes more clearly and precisely.

Keywords

Data quality, Data mining, Big data, Data assessment methods

Cite as

Anusha Y., Visalakshi R and Srinivas K., "Big Data Quality - A Survey Paper to Attain Data Quality", In: Rahul Srivastava and Aditya Kr. Singh Pundir (eds), New Frontiers in Communication and Intelligent Systems, SCRS, India, 2022, pp. 703-711. https://doi.org/10.52458/978-81-95502-00-4-71

Recent