The concept of Standard Deviation is a crucial measure in understanding how data is spread out in a set of numbers. It indicates the average amount of variation or dispersion from the mean of the data. It helps in analyzing the consistency or variability within a dataset.
Its history traces back to the pioneering work of renowned statistician Karl Pearson in the late 19th century, who contributed significantly to its development.
The goal of this article is to explain the concept of Standard deviation with the help of different points.
- Standard Deviation: Definition
- Classification of Standard deviation.
- How to evaluate the Standard deviation
- Computation of grouped data Standard deviation.
- Solved problems Related to Standard Deviation.
Let's explain all the points and gain more insight into the standard deviation
Standard Deviation: Definition
A standard deviation measures how spread-out data is compared to the average. When it's low then data tend to cluster around the average. When it's high then data are more widely spread. This kind of deviation is also called volatility and is often linked to risk.
It calculates the average gap between data points and the mean within a dataset.
Classification of Standard Deviation:
Standard Deviation can be classified based on whether it's calculated from a sample or an entire population.
1. Population Standard Deviation: This is used when we have data for an entire population, encompassing all the elements we're interested in analyzing. It considers the full set of data points.
Formula to calculate:
2. Sample Standard Deviation: This is used when we have a subset (sample) of the population. It estimates the standard deviation of the entire population based on the characteristics of the sample, making adjustments to account for the smaller dataset.
Formula to calculate:
s = √ (Σ (x - x̄) 2 / (n - 1))
Differences to Note in formulas:
- Sample SD factors in the sample mean (x̄) and sample size (n).
- Population SD factors in the population mean (μ) and total population size (N).
- >The Sample SD compensates for potential underestimation resulting from sampling by including an extra term in the denominator
How to Evaluate/ Calculate SD?
To calculate standard deviation (SD):
- Compute the Mean: Find the average of the data points.
- Find the Differences: Subtract the mean from every data value.
- Square the Differences: Square separately of these differences.
- Calculate the Variance: The average of the squared differences.
- Determine SD: Take the sqrt of the Var. to obtain the Standard Deviation.
Computation of Grouped Data Standard Deviation:
The Standard Deviation for grouped data involves using a modified formula that incorporates the frequency of each interval or class. It considers the midpoints of the classes along with their frequencies to calculate the variance.
This adjusted formula accommodates the grouped nature of the data and provides a measure of the spread or dispersion within the grouped intervals.
The formula to calculate the Standard Deviation for grouped data is:
SD = Sqrt [1/ (N - 1) [∑fxi2 - 1/N(∑fxi)2]
Where:
- f represents the frequency of each class interval.
- x denotes the midpoint of each class interval.
Solved Problems Related to Standard Deviation:
Example 1:
Calculate the SD of the following grouped Data:
Consider the following grouped data representing the weights (in kilograms) of a group of people:
Weight (kg) |
Frequency |
51-60 |
5 |
61-70 |
10 |
71-80 |
15 |
81-90 |
8 |
91-100 |
4 |
Solution:
Class Interval |
Frequency (f) |
Mid Value (xi) |
fxi |
fxi2 |
51 - 60 |
5 |
55.5 |
277.5 |
15401.25 |
61 - 70 |
10 |
65.5 |
655 |
42902.5 |
71 - 80 |
15 |
75.5 |
1132.5 |
85503.75 |
81 - 90 |
8 |
85.5 |
684 |
58482 |
91 - 100 |
4 |
95. 5 |
382 |
36481 |
∑f = 42 |
∑fxi = 3131 |
∑fxi2= 238770.5 |
x̄ (Mean) = ∑fxi / ∑f = 3131 / 42 = 74.54
Variance = 1/ (N - 1) [∑fxi2 - 1/N(∑fxi)2]
= 1 / 41 [ 238770.5 - (1 /42) (3131)2] = 130.7781
SD = √variance = √ 130.7781 = 11.4358
Example 2:
Calculate both the Sample Standard Deviation and the Population Standard Deviation for the dataset
: 12, 15, 18, 20, 22.
Sample Standard Deviation:
Given the dataset has 5 values (n = 5), we'll use the Sample Standard Deviation formula:
s = √ (Σ (x - x̄) 2 / (n - 1))
Step 1: Find the Average
Mean=12+15+18+20+225=875=17.4
Step 2: Calculate the Squared Differences from the Mean Squared Differences:
Xi |
Xi - μ (x̄) |
12 |
-5.399 |
15 |
-2.399 |
18 |
0.600 |
20 |
2.600 |
22 |
4.600 |
Step 3: Sum the Squared Differences Sum of Squared Differences:
(Xi - x̄ or (μ))2 |
29.16 |
5.76 |
0.36 |
6.76 |
21.16 |
∑ (Xi - X)2 = 63.2 |
By substituting values in the formula.
s = √ (Σ (x - x̄) 2 / (n - 1)) = √ (63.2)2 / 5-1
s = √ 0.25 *63.2
s = √ 15.8
s = 3.97
Population Standard Deviation: Using the Population Standard Deviation formula:
= √ (Σ (x - μ) 2 / N)
By substituting values in the formula.
σ = [√ 1 / 5(63.2)]
σ = √ 12.64
σ = 3.56
Final words:
In this article, we delved into the fascinating world of Standard Deviation, a key metric for understanding data spread. We explored its definition, different types, and the calculation methods for individual and grouped data. Solved examples showcased how it works in practice.
Tags:
Image Credits: Freepik