aimnova.
DashboardMy LearningPaper MasteryStudy Plan

Stay in the loop

Study tips, product updates, and early access to new features.

aimnova.

AI-powered IB study platform with personalised plans, instant feedback, and examiner-style marking.

IB Subjects
  • All IB Subjects
  • IB Diploma
  • IB ESS
  • IB Economics
  • IB Business Management
  • IB Math AI
  • IB Math AA
Question Banks
  • ESS Question Bank
  • Economics Question Bank
  • Business Management Question Bank
  • Math AI Question Bank
  • Math AA Question Bank
Predicted Topics 2026
  • ESS Predictions 2026
  • Economics Predictions 2026
  • Business Management Predictions 2026
  • Math AI Predictions 2026
  • Math AA Predictions 2026

Study Resources

  • Free Study Notes
  • Mock Exams
  • Revision Guide
  • Flashcards
  • Exam Skills
  • Command Terms
  • Past Paper Feedback
  • Grade Calculator
  • Exam Timetable 2026

Company

  • Features
  • Pricing
  • About Us
  • Blog
  • Contact
  • Terms
  • Privacy
  • Cookies

© 2026 Aimnova. All rights reserved.

Made with 💜 for IB students worldwide

v0.1.899
NotesMath AI HLTopic 4.18t-tests and z-tests
Back to Math AI HL Topics
4.18.12 min read

t-tests and z-tests

IB Mathematics: Applications and Interpretation • Unit 4

Smart study tools

Turn reading into results

Move beyond passive notes. Answer real exam questions, get AI feedback, and build the skills that earn top marks.

Get Started Free

Contents

  • Hypotheses and the p-value rule
  • t-test (σ unknown) vs z-test (σ known); one- and two-sample
A test weighs the evidence against H₀: Imagine a coffee chain claims its cups hold 250 ml on average. A sample comes out a bit low — but is that a real shortfall, or just random cup-to-cup variation?

A hypothesis test answers this. You set up two rival statements:

H₀ (null): nothing unusual — the mean is exactly the claimed value (μ = 250).

H₁ (alternative): the claim you suspect (μ < 250, or μ ≠ 250).

The GDC returns a p-value = the probability of getting data this extreme if H₀ were true.

The decision rule on its own line:

p < significance level → reject H₀ (the data are too surprising for H₀ to stand).

p ≥ significance level → do not reject H₀ (the data are consistent with H₀).
α is the significance level (often 5% = 0.05). It is the bar the p-value must beat.
One-tailed vs two-tailed: One-tailed (μ < 250 or μ > 250): you only care about one direction. Use when the claim is directional ('cups are short-filled').

Two-tailed (μ ≠ 250): you care about a difference either way. Use when the claim is just 'the mean has changed'.

The GDC needs to know which — pick the tail that matches H₁.

IB-style question — set up and conclude

A machine should fill bottles to 500 ml. A consumer group suspects under-filling and tests at the 5% significance level. Their test gives p = 0.018.

State the hypotheses and the conclusion in context.

Step by step

  1. 'Suspects under-filling' is directional, so this is a one-tailed (lower) test.
  2. Compare the p-value to α = 0.05.
  3. Since p < α, reject H₀.

Final answer

Reject H₀. There is significant evidence (at the 5% level) that the machine under-fills the bottles — the mean is below 500 ml.

Which test? Look at the standard deviation: Both tests compare means, but they differ in what you know about the spread:

z-test — use when the population standard deviation σ is given (known).

t-test — use when σ is unknown and you only have the sample standard deviation. This is the common case in real data, so the t-test dominates AI HL.

Then count the groups:

One-sample — compare one group's mean to a fixed claimed value (e.g. 'is the mean weight 250 g?').

Two-sample — compare two independent groups' means (e.g. 'do brand A and brand B last equally long?').
On the GDC: pick T-Test or Z-Test, enter the data (or stats), choose the tail, read off p.

IB-style question — one-sample t-test

A nutritionist claims a snack bar contains 200 kcal on average. A sample of 12 bars has mean 207 kcal and sample standard deviation 9 kcal. The population σ is unknown. Test at the 5% level whether the mean differs from 200 kcal.

Step by step

  1. σ unknown → t-test. 'Differs' (either way) → two-tailed.
  2. Enter the summary stats into the GDC's one-sample t-test (μ₀ = 200, x̄ = 207, sₙ₋₁ = 9, n = 12, two-tailed).
  3. The GDC returns the p-value.
  4. Compare to α = 0.05.

Final answer

Reject H₀. There is significant evidence at the 5% level that the mean calorie content differs from the claimed 200 kcal (the sample suggests it is higher).

IB-style question — two-sample t-test

Two factories produce LED bulbs. A sample from factory A lasts longer on average than a sample from factory B. A two-sample t-test of H₀: μA = μB against H₁: μA > μB gives p = 0.16 at the 5% level.

State the conclusion in context.

Step by step

  1. Two independent groups, σ unknown → two-sample t-test (one-tailed, since H₁ is directional).
  2. Compare the p-value to α.
  3. Since p ≥ α, do not reject H₀.

Final answer

Do not reject H₀. There is not enough evidence at the 5% level to conclude that factory A's bulbs last longer — the observed difference could be due to chance.

IB Exam Questions on t-tests and z-tests

Practice with IB-style questions filtered to Topic 4.18.1. Get instant AI feedback on every answer.

Practice Topic 4.18.1 QuestionsBrowse All Math AI HL Topics

How t-tests and z-tests Appears in IB Exams

Examiners use specific command terms when asking about this topic. Here's what to expect:

Define

Give the precise meaning of key terms related to t-tests and z-tests.

AO1
Describe

Give a detailed account of processes or features in t-tests and z-tests.

AO2
Explain

Give reasons WHY — cause and effect within t-tests and z-tests.

AO3
Evaluate

Weigh strengths AND limitations of approaches in t-tests and z-tests.

AO3
Discuss

Present arguments FOR and AGAINST with a balanced conclusion.

AO3

See the full IB Command Terms guide →

Related Math AI HL Topics

Continue learning with these related topics from the same unit:

4.1.1Population and Samples
4.1.2Data Classification
4.1.3Sampling Techniques
4.1.4Data Reliability and Outliers
View all Math AI HL topics

Improve your exam technique

Command terms, paper structure, and mark-scheme tips for Math AI HL

Previous
4.17.1Poisson distribution
Next
Type I and Type II errors4.18.2

11 questions to test your understanding

Reading is just the start. Students who tested themselves scored 82% on average — try IB-style questions with AI feedback.

Start Free TrialView All Math AI HL Topics