8  Descriptive Visualizations

8.1 Categorical Data

8.1.1 Bar Chart

  • Used to display frequencies or proportions of categorical variables
  • Each bar represents a category with its height corresponding to frequency
  • Easily compares values across different categories

8.1.2 Pie Chart

  • Represents proportions as slices of a circle
  • Useful for showing part-to-whole relationships
  • Less effective for comparing similar values

8.1.3 Frequency Table

  • Tabular summary of frequency and proportion
  • Can be accompanied by bar chart or pie chart for visual representation
  • Enhances understanding of distribution in categorical data

8.2 Numerical Data

8.2.1 Histogram

  • Displays the distribution of a continuous numeric variable
  • Bins group values into intervals with bar heights representing frequency
  • Useful for identifying skewness, modality, and spread

8.2.2 Boxplot (Box-and-Whisker Plot)

  • Visualizes median, quartiles, and potential outliers
  • Shows distribution shape and variability
  • Ideal for comparing multiple numeric groups

8.2.3 Density Plot

  • Smoothed version of a histogram using a kernel function
  • Visualizes the probability distribution of a continuous variable
  • Useful for overlaying multiple distributions

8.2.4 Line Chart

  • Typically used for time series or ordered data
  • Shows trends, patterns, or changes over intervals
  • Each point connected with lines to indicate progression

8.3 Multivariate Visualizations

8.3.1 Scatter Plot

  • Displays relationship between two numeric variables
  • Shows trends, clusters, and possible outliers
  • Basis for correlation and regression analysis

8.3.2 Bubble Chart

  • Extension of scatter plot with size of bubbles representing a third variable
  • Adds one more dimension of data insight
  • Useful in business and demographic visualizations

8.3.3 Faceted Plots

  • Create multiple subplots by splitting data based on a categorical variable
  • Useful for comparing distributions or relationships across categories
  • Often used with histograms, scatter plots, or boxplots

8.3.4 Heatmap

  • Color-coded matrix showing values of two variables (often categorical or correlation)
  • Commonly used for visualizing correlation matrices or frequency tables
  • Colors represent magnitude or intensity

8.4 Distribution Shape and Outliers

8.4.1 Histogram and Density Plot

  • Enhances distribution interpretation by showing both bar and smoothed curve
  • Ideal for understanding shape, central tendency, and spread

8.4.2 Boxplot with Outlier Labels

  • Explicitly marks outliers beyond whiskers
  • Helps in detecting unusual data points

8.4.3 Violin Plot

  • Merges boxplot and density plot
  • Reveals detailed distribution shape while showing summary statistics