Data science is the study of collecting, processing and extracting value from big and diverse data sets. It enables the scientists to create data-driven solutions to boost profits, reduce costs and make solutions to various problems.
Data cleaning is an important part of data analysis. This is done to eliminate, modify, or restore data depending on its state. Data that is corrupt or redundant apart from duplicate files is removed. Inaccurate data is identified and sorted. Incomplete data is marked and modified. Back up of data is taken before cleaning it to prevent loss of information.