Data Transformation

As discussed earlier, the first step in creating a visualization is data selection. This refers to choosing from the data the attributes needed for the visualization.

Once we select the attributes, the next step is usually to perform aggregation or other transformations in order to be able to create appropriate visualizations, that communicate the required information effectively.

Common aggregation functions include SUM, MAX, MIN, AVERAGE, MEDIAN and STDDEV.

These involve transformations at different levels/resolutions: seconds, minutes, hours; days, weeks, months, years etc.

These involve transformations at different levels/resolutions: zip code, county, city, state, country etc.

Geo coding/decoding refers to going from the name of a place to its geo coordinates and vice versa.

Binning

It is the process of transforming a quantitative attribute to an ordinal attribute. Ex. restaurant inspection scores from numbers to letters.

Rescaling/Re-Expression

There are few ways to accomplish this:

- Normalization

This refers to bringing all the values to a scale of say [-1, 1].

- Percentages

This refers to converting quantitative values to percentages, which makes it easier to make comparisons.

Last updated