What is Inferential Statistics? #
Inferential statistics is used to make conclusions about a population based on a sample.
Example:
Survey 100 people → predict behavior of 1 million
Probability Basics #
What is Probability? #
Probability measures how likely an event is to happen0≤P(Event)≤1
- 0 → Impossible
- 1 → Certain
Formula #
P(E)=Total outcomesFavorable outcomes
Example #
Toss a coinP(Heads)=21=0.5
Key Concepts #
Independent Events #
- One event doesn’t affect another
xample: coin tosses
Conditional Probability #
- Probability depends on another event
P(A∣B)
Normal Distribution #
What is Normal Distribution? #
A bell-shaped curve where:
- Most values are near the mean
- Symmetrical distribution
Key Properties #
- Mean = Median = Mode
- Symmetrical
- Defined by:
- Mean (μ)
- Standard Deviation (σ)
Empirical Rule (Important) #
| Range | Percentage |
|---|---|
| ±1σ | 68% |
| ±2σ | 95% |
| ±3σ | 99.7% |
Example #
Exam scores:
- Mean = 70
- Std Dev = 10
- 68% students → between 60–80
Z-Score #
Definition #
Z-score tells how far a value is from the mean (in standard deviations)
Formula #
Z=σX−μ
Example #
- X = 80
- Mean = 70
- Std Dev = 10
Z=(80−70)/10=1
Value is 1 standard deviation above mean
Use #
- Detect outliers
- Standardize data
- Compare different datasets
Hypothesis Testing #
What is Hypothesis Testing? #
A method to test assumptions using data
Key Terms #
| Term | Meaning |
|---|---|
| Null Hypothesis (H₀) | No effect |
| Alternative Hypothesis (H₁) | There is an effect |
Example #
- H₀: New drug has no effect
- H₁: New drug works
p-value #
Definition #
p-value = probability of getting results if H₀ is true
Decision Rule #
| p-value | Decision |
|---|---|
| p < 0.05 | Reject H₀ |
| p ≥ 0.05 | Fail to reject H₀ |
Meaning #
- Small p-value → strong evidence
- Large p-value → weak evidence
T-Test #
What is T-Test? #
Used to compare means of two groups
Types #
| Type | Use |
|---|---|
| One-sample | Compare with known value |
| Two-sample | Compare two groups |
| Paired | Same group before/after |
Example #
Compare:
- Old method vs New method scores
Chi-Square Test #
What is Chi-Square? #
Used for categorical data to test relationships
Example #
Test:
- Gender vs Product preference
Use #
- Check independence
- Compare observed vs expected
Correlation vs Causation #
Correlation #
Measures relationship between variables
- Positive → both increase
- Negative → one increases, other decreases
Example #
- Ice cream sales ↑
- Temperature ↑
Correlated
9.3 Causation #
One variable directly causes another
Key Difference #
| Aspect | Correlation | Causation |
|---|---|---|
| Meaning | Relationship | Cause-effect |
| Proof | Weak | Strong |
| Example | Sales & weather | Smoking → cancer |
Important Note #
Correlation ≠ Causation
Summary Table #
| Topic | Purpose |
|---|---|
| Probability | Measure likelihood |
| Normal Distribution | Understand data spread |
| Z-score | Distance from mean |
| Hypothesis Testing | Test assumptions |
| p-value | Decision making |
| T-test | Compare means |
| Chi-Square | Categorical analysis |
| Correlation | Relationship |
| Causation | Cause-effect |
