11  Measures of Skewness

The shape of a dataset’s distribution is characterized by its skewness and kurtosis, offering insights into the data’s symmetry and peakness.

Understanding these measures helps in identifying the symmetry and the peakedness of the distribution, respectively, which are crucial for analyzing the data’s behavior and making informed decisions.

11.1 Skewness

Skewness measures the degree of asymmetry or deviation from symmetry in the distribution of data. A distribution is symmetrical if it looks the same to the left and right of the center point.

  • Zero Skewness: Indicates a perfectly symmetrical distribution.
  • Positive Skewness: Indicates a distribution with a tail that stretches out more towards the positive side of the scale.
  • Negative Skewness: Indicates a distribution with a tail that stretches out more towards the negative side of the scale.

Formula for Skewness: \[ Skewness = \frac{N \sum (X_i - \overline{X})^3}{(N-1)(N-2)S^3} \]

Where:

  • \(N\) is the number of observations,
  • \(X_i\) is each individual observation,
  • \(\overline{X}\) is the mean of the observations,
  • \(S\) is the standard deviation.

Skewness measures the asymmetry of a distribution: - Skewness > 0 → Positively skewed (Right-skewed) - Skewness = 0 → Symmetric (Normal distribution) - Skewness < 0 → Negatively skewed (Left-skewed)

Example of Skewness: Consider a dataset of exam scores: [55, 60, 65, 65, 70, 75, 80]. The distribution of these scores might show slight skewness (positive or negative) depending on how they deviate from the mean. If the data were more concentrated on the lower end (more high scores), the distribution would be positively skewed.

Calculation in R