Data Profiling

Data Profiling is the process of familiarizing oneself with the data.

Data Familiarization/Profiling is needed because:

  • Data may come from multiple sources

  • The meaning of the data/attributes may not be evident

  • Sufficient effort and interaction with subject matter experts may be needed to understand the meaning of the data/attributes in the data

It is also important to ensure data quality before attempting to create a visualization. Some quality issues include erroneous values, values with wrong types, missing values etc. The process of transforming poor quality data into usable data is called data wrangling.

Last updated