aimnova.
DashboardMy LearningPaper MasteryStudy Plan

Stay in the loop

Study tips, product updates, and early access to new features.

aimnova.

AI-powered IB study platform with personalised plans, instant feedback, and examiner-style marking.

IB Subjects
  • All IB Subjects
  • IB Diploma
  • IB ESS
  • IB Economics
  • IB Business Management
  • IB Math AI
  • IB Math AA
Question Banks
  • ESS Question Bank
  • Economics Question Bank
  • Business Management Question Bank
  • Math AI Question Bank
  • Math AA Question Bank
Predicted Topics 2026
  • ESS Predictions 2026
  • Economics Predictions 2026
  • Business Management Predictions 2026
  • Math AI Predictions 2026
  • Math AA Predictions 2026

Study Resources

  • Free Study Notes
  • Mock Exams
  • Revision Guide
  • Flashcards
  • Exam Skills
  • Command Terms
  • Past Paper Feedback
  • Grade Calculator
  • Exam Timetable 2026

Company

  • Features
  • Pricing
  • About Us
  • Blog
  • Contact
  • Terms
  • Privacy
  • Cookies

© 2026 Aimnova. All rights reserved.

Made with 💜 for IB students worldwide

v0.1.895
NotesMath AI HLTopic 4.1
Unit 4 · Statistics & Probability · Topic 4.1

IB Math AI HL — Introduction to statistics

IB Mathematics AI SL topic covering core concepts and exam-style applications.

Exam technique guidePractice questions

Key concepts in Introduction to statistics

Key Idea: Before you can analyse data, you need to understand what kind of data you have and how it was collected. Topic 4.1 covers the vocabulary of statistics: population vs sample, types of data, and sampling methods. Getting these right matters because the method of collection affects the validity of any conclusions you draw.

✅ Types of data


✅ Population and sampling

Reliability: A random sample tends to produce reliable results (low bias) if it is large enough. Non-random methods are faster but less reliable. Outlier impact: A single extreme value (outlier) can distort the mean significantly. Always identify outliers before drawing conclusions.
Paper 1: Questions often ask you to identify data type or explain why a sampling method is biased. Write a specific reason — 'convenience sampling means people who are easy to reach are over-represented' earns the mark; vague answers do not. Paper 2: You may need to calculate sample size per stratum. Divide: nₛₜᵣₐₜᵤₘ = (stratum size / population size) × total sample size.

IB-style question [6 marks]

A town with 2000 adult residents is surveyed about a proposed cycle lane. The residents are grouped by age: 800 are under 30, 700 are aged 30 to 60, and 500 are over 60. A stratified sample of 80 residents is taken. (a) Explain why stratified sampling is more suitable here than simple random sampling. (b) Find the number of residents that should be selected from each age group. (c) State one type of data being collected (support for the cycle lane) and classify it.

Step by step:

  1. (a) Stratified sampling guarantees every age group is represented in proportion to its size, so the views of older and younger residents are not under- or over-counted by chance.

    each stratum sampled in proportion→representative\text{each stratum sampled in proportion} \to \text{representative}each stratum sampled in proportion→representative
  2. (b) Use the stratified-sample rule for each group. State it first.

    ngroup=group size2000×80n_{\text{group}} = \frac{\text{group size}}{2000}\times 80ngroup​=2000group size​×80
  3. Under 30: 800 out of 2000.

    8002000×80=32\frac{800}{2000}\times 80 = 322000800​×80=32
  4. Aged 30–60: 700 out of 2000.

    7002000×80=28\frac{700}{2000}\times 80 = 282000700​×80=28
  5. Over 60: 500 out of 2000 (or 80 − 32 − 28). Check the total.

    5002000×80=20,32+28+20=80 ✓\frac{500}{2000}\times 80 = 20,\qquad 32+28+20 = 80\ \checkmark2000500​×80=20,32+28+20=80 ✓
  6. (c) 'Support for the cycle lane' (yes / no) is a category, not a number.

    support (yes/no)→qualitative (categorical)\text{support (yes/no)} \to \text{qualitative (categorical)}support (yes/no)→qualitative (categorical)
Final answer:

(a) It keeps each age group represented in proportion, avoiding chance over-representation. (b) 32 under 30, 28 aged 30–60, 20 over 60. (c) Support (yes/no) is qualitative (categorical) data.

What you'll learn in Topic 4.1

  • 4.1.1 Population and Samples
  • 4.1.2 Data Classification
  • 4.1.3 Sampling Techniques
  • 4.1.4 Data Reliability and Outliers
  • 4.1.5 Data Quality Management
Suggested study order: Read the notes for each sub-topic below → test yourself with flashcards → attempt practice questions → review exam technique.

Study resources — 4.1 Introduction to statistics

4.1.1

Population and Samples

Notes
4.1.2

Data Classification

Notes
4.1.3

Sampling Techniques

Notes
4.1.4

Data Reliability and Outliers

Notes
4.1.5

Data Quality Management

Notes

Ready to study Introduction to statistics?

Get AI-powered practice questions, personalised feedback, and a study planner tailored to your IB Math AI HL exam date.

Start studying free

Topic 4.1 Introduction to statistics forms a core part of Unit 4: Statistics & Probability in IB Math AI HL. Mastering these concepts will strengthen your understanding of connected topics across the syllabus and prepare you for exam questions that require analysis, evaluation, and real-world application.

Previous topic
3.16 Route algorithms (HL only)
Next topic
4.2 Presentation of data
All Math AI HL topics
Exam technique

Ready to practice?

Get AI-graded practice questions, mock exams, flashcards, and a personalised study plan — all aligned to your IB syllabus.

Start Studying Free

No credit card required · Cancel anytime