CS-GY 6313: Information Visualization
1.0.0
1.0.0
  • Introduction
  • Defining Information Visualization
  • Why Use Visualization?
  • Popular Visualization Sources and Tools
  • Why Use a Graphical Representation?
  • The Problem with Statistics
  • Why Use a Computer to Visualize Data?
  • Why Use Interaction?
  • Assessing the Quality of a Visualization
  • Data Abstraction
    • Types of Datasets
    • Types of Attributes
    • Attribute Semantics
    • Data Abstraction to Visualization
    • Data Profiling
  • Fundamental Graphs
    • Alternate Representations
    • Visualizing More Than 2 Attributes
    • Faceting
  • Data Transformation
  • Graphical Components and Mapping Strategies
    • Marks
    • Channels
    • Graphical Decoding
    • Evaluating the Quality of a Visual Encoding
    • Contextual Components
  • Color
    • Color Perception
    • Color Specification
    • Color Use
      • Quantitative Color Scales
      • Categorical Color Scales
      • Diverging Color Scales
      • Highlighting
    • Perceptual Issues with Color
  • Geo Visualization
    • When to Use Maps
    • Geo Visualization Techniques
      • Dot Maps
      • Heat Maps
      • Hexbin Maps
      • Choropleth Maps
      • Graduated Symbol Maps
      • Summary of Map Types
    • Issues with Maps
    • Visualizing Geo Data with Time
  • Visualizing Temporal Data
    • Time Structures
    • Visualization Methods
    • Increasing Visual Scalability
    • Beyond Using Position
  • Networks and Trees
    • Visualizing Network Data
      • Node-Link Diagrams
        • Clutter Reduction
      • Matrices
    • Visualizing Trees
      • Node-Link Diagrams
      • Special Kinds of Trees
      • Space-Partitioning and Containment
        • Sunburst and Icicle Plots
  • Interaction and Multiple Views
    • Single and Multiple View Methods
      • Single View Methods
      • Multiple (Linked) Views Methods
    • Common Scenarios
  • Exploring Data
  • Animation, Pacing and Exposition
Powered by GitBook
On this page

Was this helpful?

  1. Data Abstraction

Data Profiling

Data Profiling is the process of familiarizing oneself with the data.

Data Familiarization/Profiling is needed because:

  • Data may come from multiple sources

  • The meaning of the data/attributes may not be evident

  • Sufficient effort and interaction with subject matter experts may be needed to understand the meaning of the data/attributes in the data

It is also important to ensure data quality before attempting to create a visualization. Some quality issues include erroneous values, values with wrong types, missing values etc. The process of transforming poor quality data into usable data is called data wrangling.

PreviousData Abstraction to VisualizationNextFundamental Graphs

Last updated 5 years ago

Was this helpful?