Building a process for cleaning data
The process of cleaning data involves several key steps that help to form a systematic approach to ensure comprehensive data cleaning.
While the specific steps may vary depending on the nature of the data and the organization’s requirements, the following general process provides a framework for effective data cleaning.
The effective steps to cleaning data follow this flow:
- Data assessment
- Data profiling
- Data validation
- Data cleaning strategies
- Data transformation
- Data quality assurance
- Documentation
Let’s go through these effective steps in detail next.
Data assessment
First of all, it’s imperative to assess the quality of data before we get started with cleaning the data. This may sound obvious; however, tracking this information will help you later down the line to ensure you have not missed any data transformations.
Equally, in the world of data analysis, it is always...