Title: Statistics and Probability – A Comprehensive Guide
Introduction: What is Statistics and Probability?
Statistics and Probability are two intertwined branches of mathematics that help us interpret and understand data and randomness in real-world situations. Statistics deals with data collection, analysis, interpretation, and presentation, while Probability focuses on measuring the likelihood of events occurring.
Both domains are fundamental in fields like economics, science, engineering, and social studies, helping us make informed decisions and predictions based on data.
Why Study Statistics and Probability?
Understanding statistics and probability equips us to make data-driven decisions, understand patterns in unpredictable events, and grasp the uncertainty that shapes our world.
Notable modern-day mathematicians and statisticians in the field include:
– Persi Diaconis – Known for work in probability theory and statistical mechanics.
– Susan Holmes – Works on applied statistics in biology and data science.
– Bradley Efron – Developer of the bootstrap method in statistics.
Learning these concepts is crucial in today’s data-driven world, and students are encouraged to join Skorminda to fully benefit from expert-led classes and engaging coursework in statistics and probability.
Descriptive Statistics
Descriptive Statistics summarizes and describes the main features of a data set using measures like mean, median, mode, and standard deviation.
Equations:
– Mean:
$$ \bar{x} = \frac{1}{n} \sum_{i=1}^{n} x_i $$
– Standard Deviation:
$$ \sigma = \sqrt{ \frac{1}{n} \sum_{i=1}^{n} (x_i – \bar{x})^2 } $$
Easy Questions:
1. Find the mean of the numbers: 2, 4, 6, 8, 10.
Solution:
$$ \bar{x} = \frac{2+4+6+8+10}{5} = \frac{30}{5} = 6 $$
2. Identify the mode from the data: 7, 8, 9, 7, 5, 7, 6.
Solution: Mode is 7 (appears 3 times).
Medium Questions:
1. A data set: 3, 5, 7, 7, 10. Find median and range.
Solution:
Median = 7 (middle value), Range = 10 – 3 = 7
2. Calculate the standard deviation of 4, 6, 8.
Solution:
Mean = 6
$$ \sigma = \sqrt{ \frac{(4-6)^2 + (6-6)^2 + (8-6)^2}{3} } = \sqrt{ \frac{4 + 0 + 4}{3} } = \sqrt{ \frac{8}{3} } \approx 1.63 $$
Hard Questions:
1. Given data: 12, 15, 18, 20, 25. Find variance and standard deviation.
Solution:
Mean = 18
$$ \sigma^2 = \frac{(12-18)^2 + (15-18)^2 + (18-18)^2 + (20-18)^2 + (25-18)^2}{5} = \frac{36 + 9 + 0 + 4 + 49}{5} = \frac{98}{5} = 19.6 $$
$$ \sigma = \sqrt{19.6} \approx 4.43 $$
2. A class has scores: 70, 75, 80, 85, 90. A new student scored 100. Calculate new mean and variance.
Solution:
Original mean = 80, new mean:
$$ \bar{x} = \frac{70+75+80+85+90+100}{6} = \frac{500}{6} \approx 83.33 $$
Variance:
$$ \text{New Variance} = \frac{\sum (x_i – \bar{x})^2}{6} $$
Continue with the calculation based on the mean.
Inferential Statistics
Inferential Statistics uses sample data to make generalizations about a population using methods like hypothesis testing and confidence intervals.
Equation:
– Confidence Interval:
$$ \bar{x} \pm z \cdot \frac{\sigma}{\sqrt{n}} $$
Easy Questions:
1. From a survey, the sample mean score is 75 with standard deviation of 5. Find 95% CI for a sample of 25.
Solution:
$$ CI = 75 \pm 1.96 \cdot \frac{5}{\sqrt{25}} = 75 \pm 1.96 $$
CI = (73.04, 76.96)
2. What is the purpose of hypothesis testing?
Solution: To determine whether there’s enough evidence in a sample to infer that a condition holds for an entire population.
Medium Questions:
1. Null hypothesis $H_0$: mean = 80. Sample mean = 85, standard deviation = 10, $n = 16$. Test at $\alpha = 0.05$.
Solution:
$$ z = \frac{85 – 80}{10/\sqrt{16}} = \frac{5}{2.5} = 2 $$
Compare with critical z = ±1.96 → Reject $H_0$.
2. Construct 99% CI for sample mean 50, $\sigma = 6$, $n = 100$.
Solution:
$$ CI = 50 \pm 2.576 \cdot \frac{6}{10} = 50 \pm 1.5456 $$
CI = (48.4544, 51.5456)
Hard Questions:
1. A factory tests light bulbs: Sample mean lifetime = 980 hours, $\sigma = 30$, $n = 36$. Test if mean lifetime is less than 1000 at 0.01 level.
$$ z = \frac{980 – 1000}{30/\sqrt{36}} = \frac{-20}{5} = -4 $$
Critical z = -2.33 → Reject $H_0$.
2. A researcher claims diet reduces weight by more than 5kg. Sample mean = 5.5kg, $\sigma = 1$, $n = 25$. Test at $\alpha = 0.05$.
$$ z = \frac{5.5 – 5}{1/\sqrt{25}} = 2.5 $$
Critical value (one-tailed) = 1.645 → Reject $H_0$.
Probability Concepts
Probability is a measure from 0 to 1 representing the likelihood an event occurs, using principles like addition, multiplication, and conditional probability.
Equations:
– Probability of event A:
$$ P(A) = \frac{\text{Number of favorable outcomes}}{\text{Total outcomes}} $$
– Conditional Probability:
$$ P(A|B) = \frac{P(A \cap B)}{P(B)} $$
Easy Questions:
1. Roll a die. Probability of getting even number?
$$ P = \frac{3}{6} = 0.5 $$
2. Toss a coin. What’s the chance of heads?
$$ P(\text{Heads}) = \frac{1}{2} $$
Medium Questions:
1. Two cards drawn from deck without replacement. Find probability both are aces.
$$ P = \frac{4}{52} \cdot \frac{3}{51} = \frac{1}{221} $$
2. If P(A) = 0.4 and P(B) = 0.5 and A, B are independent, find P(A ∩ B).
$$ P(A \cap B) = 0.4 \cdot 0.5 = 0.2 $$
Hard Questions:
1. Probability that at least one head appears in 3 coin tosses.
Total outcomes = 8, only 1 outcome is all tails.
$$ P = 1 – \frac{1}{8} = \frac{7}{8} $$
2. A box has 3 red, 2 green, 5 blue balls. Draw 2 without replacement. Probability both are green:
$$ P = \frac{2}{10} \cdot \frac{1}{9} = \frac{1}{45} $$
Probability Distributions
Probability Distributions describe how probabilities are distributed over values, including discrete (Binomial) and continuous (Normal) distributions.
Equations:
– Binomial:
$$ P(X = k) = {n \choose k} p^k (1-p)^{n-k} $$
– Normal Distribution PDF:
$$ f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{ -\frac{(x – \mu)^2}{2\sigma^2} } $$
Easy Questions:
1. Flip a coin 2 times. Find probability of 1 head.
$$ P(X=1) = {2 \choose 1} \cdot \frac{1}{2}^1 \cdot \frac{1}{2}^1 = 2 \cdot \frac{1}{4} = 0.5 $$
2. In normal dist. with $\mu = 0$, $\sigma = 1$, what’s probability that value is between -1 and 1?
Approx = 68%
Medium Questions:
1. Binomial (n=5, p=0.6). Find P(X=3):
$$ P(X=3) = {5 \choose 3} (0.6)^3 (0.4)^2 = 10 \cdot 0.216 \cdot 0.16 = 0.3456 $$
2. Find Z-score: x = 85, $\mu = 70$, $\sigma = 10$.
$$ Z = \frac{85 – 70}{10} = 1.5 $$
Hard Questions:
1. Normal Distribution: $\mu = 50$, $\sigma = 5$. Find P(45 ≤ X ≤ 55).
Convert to Z-scores:
$$ Z = \frac{45 – 50}{5} = -1, \quad Z = \frac{55 – 50}{5} = 1 $$
P = 68%
2. Binomial: n=10, p=0.3. Probability of getting at most 2 successes:
$$ P(X \leq 2) = P(0) + P(1) + P(2) $$
Evaluate each using binomial formula.
Random Variables
Random Variables assign numerical value to outcomes of a random experiment and can be discrete or continuous.
Equations:
– Expected value:
$$ E(X) = \sum x_i P(x_i) $$
– Variance:
$$ Var(X) = E(X^2) – [E(X)]^2 $$
Easy Questions:
1. A dice is rolled. Expected outcome:
$$ E(X) = \frac{1+2+3+4+5+6}{6} = 3.5 $$
2. In coin toss, what value does random variable take for head (1) and tail (0)?
Values: 0 or 1
Medium Questions:
1. Probabilities: X = 0: 0.2, X = 1: 0.5, X = 2: 0.3. Find E(X):
$$ E(X) = 0 \cdot 0.2 + 1 \cdot 0.5 + 2 \cdot 0.3 = 0 + 0.5 + 0.6 = 1.1 $$
2. Find Var(X) using values and E(X) = 1.1.
$$ E(X^2) = 0^2 \cdot 0.2 + 1^2 \cdot 0.5 + 2^2 \cdot 0.3 = 0 + 0.5 + 1.2 = 1.7 $$
$$ Var(X) = 1.7 – (1.1)^2 = 1.7 – 1.21 = 0.49 $$
Hard Questions:
1. X takes 1, 2, 3 with probabilities 0.2, 0.5, 0.3. Find E(X) and Var(X).
E(X) = 2.1
$$ E(X^2) = 1^2 \cdot 0.2 + 2^2 \cdot 0.5 + 3^2 \cdot 0.3 = 0.2 + 2 + 2.7 = 4.9 $$
$$ Var(X) = 4.9 – (2.1)^2 = 0.49 $$
2. If P(X=1)=0.3, P(X=2)=0.4, P(X=3)=0.3, find SD(X).
Variance as above, SD = √variance.
Conclusion:
Statistics and Probability form the backbone of data analysis and decision-making in modern industries. With real-world applications in economics, science, and AI, mastering these topics is essential.
To unlock your true potential, join Skorminda and build a strong foundation with interactive lectures, real-world projects, and expert mentorship in Statistics and Probability.




No Comments