• Topic Specification

Hypothesis Testing Revision

You may also like, mme learning portal, hypothesis testing.

A hypothesis test uses some sample data to test whether a hypothesis (a belief about the distribution of a random variable) is true. Hypothesis testing comes with a substantial amount of terminology.

Terminology for Hypothesis Testing

  • Hypothesis tests are based on two hypotheses . The null hypothesis , H_{0} , is a statement about the value of a population parameter (a parameter of the distribution of a random variable) which our data will tell us whether or not to reject . The alternate hypothesis , H_{1} , is what we believe the parameter is if we reject the null hypothesis.
  • In general, the null hypothesis is something that we want to show is false . This is because hypothesis tests do not show if something is true , only if something is false, but we can find out something true that we want to know by showing that the null hypothesis is false. To this end, the null hypothesis is usually that a parameter takes a specific value , while the alternate hypothesis is usually that the parameter differs from the specific value , rather than specifying another value.
  • A hypothesis test is the means by which we generate a test statistic that directs us to either reject or not reject the null hypothesis.
  • The test statistic is a “summary” of the collected data , and should have a sampling distribution specified by the null hypothesis .

One or Two Tailed Tests

Hypothesis tests can be one tailed or two tailed . This depends on H_{1} .

  • In a one tailed test , H_{1} takes the form p>x or p<x , where H_{0} is that p=x .
  • In a two tailed test , H_{1} takes the form p\neq x , where H_{0} is that p=x .

MME Logo

Your 2024 Revision Partner

Significant Data

We reject the null hypothesis when the data we observe is unlikely to have occurred if it were true.

Specifically, we state a significance level \alpha before we perform the hypothesis test , and if the probability of getting the data we got is less than \alpha in a one tail test or less than \dfrac{\alpha}{2} in a two tail test if we assume H_{0} is true, then we reject H_{0} .

The way we test the probability of getting the data is by looking at the sampling distribution of the test statistic , which is set by the null hypothesis .

You will usually be told what significance level to use . Common significance levels include 5\%\;(\alpha)=0.05 and 1\%\;(\alpha)=0.01

Critical Region

The critical region is the set of values of the test statistic that would cause H_{0} to be rejected. The first value inside the critical region is called the critical value . If the test statistic is as extreme or more extreme than the critical value, then we reject H_{0} .

A one tailed test has a single critical region , containing the highest or lowest values. A two tailed test has two critical regions , one containing high values and one containing low values.

You can test whether your data is significant by finding the critical region and seeing if the test statistic falls within it.

Actual Significance Level and p-value

The p-value is the probability of obtaining the results we got if H_{0} is true. If the p-value is less than \alpha (or \dfrac{\alpha}{2} for two tail test) we reject H_{0} . If the p-value is greater than \alpha (or \dfrac{\alpha}{2} for two tail test) we do not reject H_{0} .

The actual significance level is the probability of the data being in the critical region if H_{0} is true. For continuous data, this is the same as the significance level. However, for discrete data, it might differ.

A Level Maths Predicted Papers 2024

The MME A level maths predicted papers are an excellent way to practise, using authentic exam style questions that are unique to our papers. Our examiners have studied A level maths past papers to develop predicted A level maths exam questions in an authentic exam format. The profit from every pack is reinvested into making free content on MME, which benefits millions of learners across the country.

hypothesis testing in statistics a level

A Level Maths Revision Cards

The best A level maths revision cards for AQA, Edexcel, OCR, MEI and WJEC. MME is here to help you prepare effectively for your A Level maths exams. The profit from every pack is reinvested into making free content on MME, which benefits millions of learners across the country.

hypothesis testing in statistics a level

Example 1: Binomial Hypothesis Test

X\sim B(10,p) and we observe x=6

Test, at the 5\% significance level , if p is larger than 0.4

H_{0}: p=0.4

H_{1}: p>0.4

\alpha=0.05

\mathbb{P}(X\geq 6)=0.1662>0.05

So we are more likely than the significance level to get the data we observe.

Hence, do not reject H_{0} . There is not significant evidence to suggest p>0.4 .

( Note: More on binomial hypothesis testing can be found in the section Binomial Distribution Hypothesis Tests)

Example 2: Normal Hypothesis Test

The amount by which the train Nicola takes to work is delayed is normally distributed. Observations over a number of years show this delay has a mean of five minutes and a standard deviation of two minutes. Nicola believes this has changed. If she is delayed by ten minutes on Friday, is she right at the 5\% significance level ?

X\sim N(\mu,2)

H_{0}: \mu=5

H_{1}: \mu\neq 5

Significance level \alpha=0.05

Two tail test so we are looking for a probability less than \dfrac{0.05}{2}=0.025

Observed data: x=10

Reject H_{0} . There is sufficient evidence to suggest that the average delay has changed.

( Note: More on normal hypothesis testing can be found in the section Normal Distribution Hypothesis Tests)

Hypothesis Testing Example Questions

Question 1:  Jane has a die that she believes is biased towards rolling 6. Give hypotheses that could be used to test this.

H_{0}:\mathbb{P}(X=6)=\dfrac{1}{6}

H_{1}:\mathbb{P}(X=6)>\dfrac{1}{6}

Question 2:  Is the following a one tail or two tail hypothesis test?

H_{0}: p=0.5

H_{1}: p\neq 0.5

The alternate hypothesis is p\neq 0.5 so this is a two tail test.

Question 3: How close to the bullseye a dart lands in the dartboard when thrown by Phil has a normal distribution. Phil believes his throws have a mean distance from the bullseye of 1 cm, with a standard deviation of 0.4 cm. He throws a dart and it lands 1.5 cm from the centre. Test, at the 5\% significance level, if he is as good as he says.

X\sim N(\mu,0.4)

H_{0}: \mu=1

H_{1}: \mu>1

Significance level \alpha=5\%

Observed data: x=1.5

Do not reject H_{0} . Insufficient evidence to suggest Phil is not as good at darts as he says.

Question 4:  Consider the hypothesis test on X\sim B(20,p) .

H_{0}: p=0.75

H_{1}: p\leq 0.75

Take the significance level to be 0.05

i) What is the critical region for this test?

ii) What is the actual significance level of this test?

i) The critical region is the region for x for which we reject H_{0} .

Critical region is such that \mathbb{P}(X\leq x)<0.05

In this case, \mathbb{P}(X\leq 12)=0.1018>0.05 while \mathbb{P}(X\leq 11)=0.0409<0.05

So the critical region must be the values 11 and under.

ii) The actual significance is the probability of landing in the critical region so it is \mathbb{P}(X\leq 11)=0.0409

Question 5:  Marsha notices that her neighbourhood seems to contain far more blue cars than would be normal. She finds a statistic online that says nationally, around 4\% of cars are blue. She then observes 50 cars near her house and 5 of them are blue.

Construct a hypothesis test for this at the 5\% level and find whether or not Marsha is right that her neighbourhood contains more blue cars.

X\sim B(50,0.04)

H_{0}: p=0.04

H_{1}: p>0.04

Observed data: x=5

Reject H_{0} . There is sufficient evidence to suggest that there are more blue cars in Marsha’s neighbourhood.

Additional Resources

Exam tips cheat sheet, formula booklet, you may also like....

hypothesis testing in statistics a level

Online exams, practice questions and revision videos for every GCSE level 9-1 topic! No fees, no trial period, just totally free access to the UK’s best GCSE maths revision platform.

By clicking continue and using our website you are consenting to our use of cookies in accordance with our Cookie Policy

MME Revision Challenge

hypothesis testing in statistics a level

Report a Question

You must be logged in to vote for this question..

Your personal data will be used to support your experience throughout this website, to manage access to your account, and for other purposes described in our privacy policy .

Hypothesis Testing

This page looks at Hypothesis testing. Topics include null hypothesis, alternative hypothesis, testing and critical regions.

The parameters of a distribution are those quantities that you need to specify when describing the distribution. For example, a normal distribution has parameters μ and σ 2 and a Poisson distribution has parameter λ.

If we know that some data comes from a certain distribution, but the parameter is unknown, we might try to predict what the parameter is. Hypothesis testing is about working out how likely our predictions are.

The null hypothesis , denoted by H 0 , is a prediction about a parameter (so if we are dealing with a normal distribution, we might predict the mean or the variance of the distribution).

We also have an alternative hypothesis , denoted by H 1 . We then perform a test to decide whether or not we should reject the null hypothesis in favour of the alternative.

Suppose we are given a value and told that it comes from a certain distribution, but we don"t know what the parameter of that distribution is.

Suppose we make a null hypothesis about the parameter. We test how likely it is that the value we were given could have come from the distribution with this predicted parameter.

For example, suppose we are told that the value of 3 has come from a Poisson distribution. We might want to test the null hypothesis that the parameter (which is the mean) of the Poisson distribution is 9. So we work out how likely it is that the value of 3 could have come from a Poisson distribution with parameter 9. If it"s not very likely, we reject the null hypothesis in favour of the alternative.

Critical Region

But what exactly is "not very likely"?

We choose a region known as the critical region . If the result of our test lies in this region, then we reject the null hypothesis in favour of the alternative.

Pass Your GCSE Maths Banner

AS Level Mechanics and Statistics - Hypothesis Testing

Library homepage

  • school Campus Bookshelves
  • menu_book Bookshelves
  • perm_media Learning Objects
  • login Login
  • how_to_reg Request Instructor Account
  • hub Instructor Commons
  • Download Page (PDF)
  • Download Full Book (PDF)
  • Periodic Table
  • Physics Constants
  • Scientific Calculator
  • Reference & Cite
  • Tools expand_more
  • Readability

selected template will load here

This action is not available.

Statistics LibreTexts

1.2: The 7-Step Process of Statistical Hypothesis Testing

  • Last updated
  • Save as PDF
  • Page ID 33320

  • Penn State's Department of Statistics
  • The Pennsylvania State University

We will cover the seven steps one by one.

Step 1: State the Null Hypothesis

The null hypothesis can be thought of as the opposite of the "guess" the researchers made: in this example, the biologist thinks the plant height will be different for the fertilizers. So the null would be that there will be no difference among the groups of plants. Specifically, in more statistical language the null for an ANOVA is that the means are the same. We state the null hypothesis as: \[H_{0}: \ \mu_{1} = \mu_{2} = \ldots = \mu_{T}\] for \(T\) levels of an experimental treatment.

Why do we do this? Why not simply test the working hypothesis directly? The answer lies in the Popperian Principle of Falsification. Karl Popper (a philosopher) discovered that we can't conclusively confirm a hypothesis, but we can conclusively negate one. So we set up a null hypothesis which is effectively the opposite of the working hypothesis. The hope is that based on the strength of the data, we will be able to negate or reject the null hypothesis and accept an alternative hypothesis. In other words, we usually see the working hypothesis in \(H_{A}\).

Step 2: State the Alternative Hypothesis

\[H_{A}: \ \text{treatment level means not all equal}\]

The reason we state the alternative hypothesis this way is that if the null is rejected, there are many possibilities.

For example, \(\mu_{1} \neq \mu_{2} = \ldots = \mu_{T}\) is one possibility, as is \(\mu_{1} = \mu_{2} \neq \mu_{3} = \ldots = \mu_{T}\). Many people make the mistake of stating the alternative hypothesis as \(mu_{1} \neq mu_{2} \neq \ldots \neq \mu_{T}\), which says that every mean differs from every other mean. This is a possibility, but only one of many possibilities. To cover all alternative outcomes, we resort to a verbal statement of "not all equal" and then follow up with mean comparisons to find out where differences among means exist. In our example, this means that fertilizer 1 may result in plants that are really tall, but fertilizers 2, 3, and the plants with no fertilizers don't differ from one another. A simpler way of thinking about this is that at least one mean is different from all others.

Step 3: Set \(\alpha\)

If we look at what can happen in a hypothesis test, we can construct the following contingency table:

You should be familiar with type I and type II errors from your introductory course. It is important to note that we want to set \(\alpha\) before the experiment ( a priori ) because the Type I error is the more grievous error to make. The typical value of \(\alpha\) is 0.05, establishing a 95% confidence level. For this course, we will assume \(\alpha\) =0.05, unless stated otherwise.

Step 4: Collect Data

Remember the importance of recognizing whether data is collected through an experimental design or observational study.

Step 5: Calculate a test statistic

For categorical treatment level means, we use an \(F\) statistic, named after R.A. Fisher. We will explore the mechanics of computing the \(F\) statistic beginning in Chapter 2. The \(F\) value we get from the data is labeled \(F_{\text{calculated}}\).

Step 6: Construct Acceptance / Rejection regions

As with all other test statistics, a threshold (critical) value of \(F\) is established. This \(F\) value can be obtained from statistical tables or software and is referred to as \(F_{\text{critical}}\) or \(F_{\alpha}\). As a reminder, this critical value is the minimum value for the test statistic (in this case the F test) for us to be able to reject the null.

The \(F\) distribution, \(F_{\alpha}\), and the location of acceptance and rejection regions are shown in the graph below:

Graph of the F distribution, with the point F_alpha marked on the x-axis. The area under the curve to the left of this point is marked "Accept null", and the area under the curve to the right of this point is marked "Reject null."

Step 7: Based on steps 5 and 6, draw a conclusion about H0

If the \(F_{\text{\calculated}}\) from the data is larger than the \(F_{\alpha}\), then you are in the rejection region and you can reject the null hypothesis with \((1 - \alpha)\) level of confidence.

Note that modern statistical software condenses steps 6 and 7 by providing a \(p\)-value. The \(p\)-value here is the probability of getting an \(F_{\text{calculated}}\) even greater than what you observe assuming the null hypothesis is true. If by chance, the \(F_{\text{calculated}} = F_{\alpha}\), then the \(p\)-value would exactly equal \(\alpha\). With larger \(F_{\text{calculated}}\) values, we move further into the rejection region and the \(p\) - value becomes less than \(\alpha\). So the decision rule is as follows:

If the \(p\) - value obtained from the ANOVA is less than \(\alpha\), then reject \(H_{0}\) and accept \(H_{A}\).

If you are not familiar with this material, we suggest that you review course materials from your basic statistics course.

Hypothesis Testing (cont...)

Hypothesis testing, the null and alternative hypothesis.

In order to undertake hypothesis testing you need to express your research hypothesis as a null and alternative hypothesis. The null hypothesis and alternative hypothesis are statements regarding the differences or effects that occur in the population. You will use your sample to test which statement (i.e., the null hypothesis or alternative hypothesis) is most likely (although technically, you test the evidence against the null hypothesis). So, with respect to our teaching example, the null and alternative hypothesis will reflect statements about all statistics students on graduate management courses.

The null hypothesis is essentially the "devil's advocate" position. That is, it assumes that whatever you are trying to prove did not happen ( hint: it usually states that something equals zero). For example, the two different teaching methods did not result in different exam performances (i.e., zero difference). Another example might be that there is no relationship between anxiety and athletic performance (i.e., the slope is zero). The alternative hypothesis states the opposite and is usually the hypothesis you are trying to prove (e.g., the two different teaching methods did result in different exam performances). Initially, you can state these hypotheses in more general terms (e.g., using terms like "effect", "relationship", etc.), as shown below for the teaching methods example:

Depending on how you want to "summarize" the exam performances will determine how you might want to write a more specific null and alternative hypothesis. For example, you could compare the mean exam performance of each group (i.e., the "seminar" group and the "lectures-only" group). This is what we will demonstrate here, but other options include comparing the distributions , medians , amongst other things. As such, we can state:

Now that you have identified the null and alternative hypotheses, you need to find evidence and develop a strategy for declaring your "support" for either the null or alternative hypothesis. We can do this using some statistical theory and some arbitrary cut-off points. Both these issues are dealt with next.

Significance levels

The level of statistical significance is often expressed as the so-called p -value . Depending on the statistical test you have chosen, you will calculate a probability (i.e., the p -value) of observing your sample results (or more extreme) given that the null hypothesis is true . Another way of phrasing this is to consider the probability that a difference in a mean score (or other statistic) could have arisen based on the assumption that there really is no difference. Let us consider this statement with respect to our example where we are interested in the difference in mean exam performance between two different teaching methods. If there really is no difference between the two teaching methods in the population (i.e., given that the null hypothesis is true), how likely would it be to see a difference in the mean exam performance between the two teaching methods as large as (or larger than) that which has been observed in your sample?

So, you might get a p -value such as 0.03 (i.e., p = .03). This means that there is a 3% chance of finding a difference as large as (or larger than) the one in your study given that the null hypothesis is true. However, you want to know whether this is "statistically significant". Typically, if there was a 5% or less chance (5 times in 100 or less) that the difference in the mean exam performance between the two teaching methods (or whatever statistic you are using) is as different as observed given the null hypothesis is true, you would reject the null hypothesis and accept the alternative hypothesis. Alternately, if the chance was greater than 5% (5 times in 100 or more), you would fail to reject the null hypothesis and would not accept the alternative hypothesis. As such, in this example where p = .03, we would reject the null hypothesis and accept the alternative hypothesis. We reject it because at a significance level of 0.03 (i.e., less than a 5% chance), the result we obtained could happen too frequently for us to be confident that it was the two teaching methods that had an effect on exam performance.

Whilst there is relatively little justification why a significance level of 0.05 is used rather than 0.01 or 0.10, for example, it is widely used in academic research. However, if you want to be particularly confident in your results, you can set a more stringent level of 0.01 (a 1% chance or less; 1 in 100 chance or less).

Testimonials

One- and two-tailed predictions

When considering whether we reject the null hypothesis and accept the alternative hypothesis, we need to consider the direction of the alternative hypothesis statement. For example, the alternative hypothesis that was stated earlier is:

The alternative hypothesis tells us two things. First, what predictions did we make about the effect of the independent variable(s) on the dependent variable(s)? Second, what was the predicted direction of this effect? Let's use our example to highlight these two points.

Sarah predicted that her teaching method (independent variable: teaching method), whereby she not only required her students to attend lectures, but also seminars, would have a positive effect (that is, increased) students' performance (dependent variable: exam marks). If an alternative hypothesis has a direction (and this is how you want to test it), the hypothesis is one-tailed. That is, it predicts direction of the effect. If the alternative hypothesis has stated that the effect was expected to be negative, this is also a one-tailed hypothesis.

Alternatively, a two-tailed prediction means that we do not make a choice over the direction that the effect of the experiment takes. Rather, it simply implies that the effect could be negative or positive. If Sarah had made a two-tailed prediction, the alternative hypothesis might have been:

In other words, we simply take out the word "positive", which implies the direction of our effect. In our example, making a two-tailed prediction may seem strange. After all, it would be logical to expect that "extra" tuition (going to seminar classes as well as lectures) would either have a positive effect on students' performance or no effect at all, but certainly not a negative effect. However, this is just our opinion (and hope) and certainly does not mean that we will get the effect we expect. Generally speaking, making a one-tail prediction (i.e., and testing for it this way) is frowned upon as it usually reflects the hope of a researcher rather than any certainty that it will happen. Notable exceptions to this rule are when there is only one possible way in which a change could occur. This can happen, for example, when biological activity/presence in measured. That is, a protein might be "dormant" and the stimulus you are using can only possibly "wake it up" (i.e., it cannot possibly reduce the activity of a "dormant" protein). In addition, for some statistical tests, one-tailed tests are not possible.

Rejecting or failing to reject the null hypothesis

Let's return finally to the question of whether we reject or fail to reject the null hypothesis.

If our statistical analysis shows that the significance level is below the cut-off value we have set (e.g., either 0.05 or 0.01), we reject the null hypothesis and accept the alternative hypothesis. Alternatively, if the significance level is above the cut-off value, we fail to reject the null hypothesis and cannot accept the alternative hypothesis. You should note that you cannot accept the null hypothesis, but only find evidence against it.

Statology

Statistics Made Easy

Introduction to Hypothesis Testing

A statistical hypothesis is an assumption about a population parameter .

For example, we may assume that the mean height of a male in the U.S. is 70 inches.

The assumption about the height is the statistical hypothesis and the true mean height of a male in the U.S. is the population parameter .

A hypothesis test is a formal statistical test we use to reject or fail to reject a statistical hypothesis.

The Two Types of Statistical Hypotheses

To test whether a statistical hypothesis about a population parameter is true, we obtain a random sample from the population and perform a hypothesis test on the sample data.

There are two types of statistical hypotheses:

The null hypothesis , denoted as H 0 , is the hypothesis that the sample data occurs purely from chance.

The alternative hypothesis , denoted as H 1 or H a , is the hypothesis that the sample data is influenced by some non-random cause.

Hypothesis Tests

A hypothesis test consists of five steps:

1. State the hypotheses. 

State the null and alternative hypotheses. These two hypotheses need to be mutually exclusive, so if one is true then the other must be false.

2. Determine a significance level to use for the hypothesis.

Decide on a significance level. Common choices are .01, .05, and .1. 

3. Find the test statistic.

Find the test statistic and the corresponding p-value. Often we are analyzing a population mean or proportion and the general formula to find the test statistic is: (sample statistic – population parameter) / (standard deviation of statistic)

4. Reject or fail to reject the null hypothesis.

Using the test statistic or the p-value, determine if you can reject or fail to reject the null hypothesis based on the significance level.

The p-value  tells us the strength of evidence in support of a null hypothesis. If the p-value is less than the significance level, we reject the null hypothesis.

5. Interpret the results. 

Interpret the results of the hypothesis test in the context of the question being asked. 

The Two Types of Decision Errors

There are two types of decision errors that one can make when doing a hypothesis test:

Type I error: You reject the null hypothesis when it is actually true. The probability of committing a Type I error is equal to the significance level, often called  alpha , and denoted as α.

Type II error: You fail to reject the null hypothesis when it is actually false. The probability of committing a Type II error is called the Power of the test or  Beta , denoted as β.

One-Tailed and Two-Tailed Tests

A statistical hypothesis can be one-tailed or two-tailed.

A one-tailed hypothesis involves making a “greater than” or “less than ” statement.

For example, suppose we assume the mean height of a male in the U.S. is greater than or equal to 70 inches. The null hypothesis would be H0: µ ≥ 70 inches and the alternative hypothesis would be Ha: µ < 70 inches.

A two-tailed hypothesis involves making an “equal to” or “not equal to” statement.

For example, suppose we assume the mean height of a male in the U.S. is equal to 70 inches. The null hypothesis would be H0: µ = 70 inches and the alternative hypothesis would be Ha: µ ≠ 70 inches.

Note: The “equal” sign is always included in the null hypothesis, whether it is =, ≥, or ≤.

Related:   What is a Directional Hypothesis?

Types of Hypothesis Tests

There are many different types of hypothesis tests you can perform depending on the type of data you’re working with and the goal of your analysis.

The following tutorials provide an explanation of the most common types of hypothesis tests:

Introduction to the One Sample t-test Introduction to the Two Sample t-test Introduction to the Paired Samples t-test Introduction to the One Proportion Z-Test Introduction to the Two Proportion Z-Test

hypothesis testing in statistics a level

Hey there. My name is Zach Bobbitt. I have a Master of Science degree in Applied Statistics and I’ve worked on machine learning algorithms for professional businesses in both healthcare and retail. I’m passionate about statistics, machine learning, and data visualization and I created Statology to be a resource for both students and teachers alike.  My goal with this site is to help you learn statistics through using simple terms, plenty of real-world examples, and helpful illustrations.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

User Preferences

Content preview.

Arcu felis bibendum ut tristique et egestas quis:

  • Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris
  • Duis aute irure dolor in reprehenderit in voluptate
  • Excepteur sint occaecat cupidatat non proident

Keyboard Shortcuts

Hypothesis testing.

Key Topics:

  • Basic approach
  • Null and alternative hypothesis
  • Decision making and the p -value
  • Z-test & Nonparametric alternative

Basic approach to hypothesis testing

  • State a model describing the relationship between the explanatory variables and the outcome variable(s) in the population and the nature of the variability. State all of your assumptions .
  • Specify the null and alternative hypotheses in terms of the parameters of the model.
  • Invent a test statistic that will tend to be different under the null and alternative hypotheses.
  • Using the assumptions of step 1, find the theoretical sampling distribution of the statistic under the null hypothesis of step 2. Ideally the form of the sampling distribution should be one of the “standard distributions”(e.g. normal, t , binomial..)
  • Calculate a p -value , as the area under the sampling distribution more extreme than your statistic. Depends on the form of the alternative hypothesis.
  • Choose your acceptable type 1 error rate (alpha) and apply the decision rule : reject the null hypothesis if the p-value is less than alpha, otherwise do not reject.
  • \(\frac{\bar{X}-\mu_0}{\sigma / \sqrt{n}}\)
  • general form is: (estimate - value we are testing)/(st.dev of the estimate)
  • z-statistic follows N(0,1) distribution
  • 2 × the area above |z|, area above z,or area below z, or
  • compare the statistic to a critical value, |z| ≥ z α/2 , z ≥ z α , or z ≤ - z α
  • Choose the acceptable level of Alpha = 0.05, we conclude …. ?

Making the Decision

It is either likely or unlikely that we would collect the evidence we did given the initial assumption. (Note: “likely” or “unlikely” is measured by calculating a probability!)

If it is likely , then we “ do not reject ” our initial assumption. There is not enough evidence to do otherwise.

If it is unlikely , then:

  • either our initial assumption is correct and we experienced an unusual event or,
  • our initial assumption is incorrect

In statistics, if it is unlikely, we decide to “ reject ” our initial assumption.

Example: Criminal Trial Analogy

First, state 2 hypotheses, the null hypothesis (“H 0 ”) and the alternative hypothesis (“H A ”)

  • H 0 : Defendant is not guilty.
  • H A : Defendant is guilty.

Usually the H 0 is a statement of “no effect”, or “no change”, or “chance only” about a population parameter.

While the H A , depending on the situation, is that there is a difference, trend, effect, or a relationship with respect to a population parameter.

  • It can one-sided and two-sided.
  • In two-sided we only care there is a difference, but not the direction of it. In one-sided we care about a particular direction of the relationship. We want to know if the value is strictly larger or smaller.

Then, collect evidence, such as finger prints, blood spots, hair samples, carpet fibers, shoe prints, ransom notes, handwriting samples, etc. (In statistics, the data are the evidence.)

Next, you make your initial assumption.

  • Defendant is innocent until proven guilty.

In statistics, we always assume the null hypothesis is true .

Then, make a decision based on the available evidence.

  • If there is sufficient evidence (“beyond a reasonable doubt”), reject the null hypothesis . (Behave as if defendant is guilty.)
  • If there is not enough evidence, do not reject the null hypothesis . (Behave as if defendant is not guilty.)

If the observed outcome, e.g., a sample statistic, is surprising under the assumption that the null hypothesis is true, but more probable if the alternative is true, then this outcome is evidence against H 0 and in favor of H A .

An observed effect so large that it would rarely occur by chance is called statistically significant (i.e., not likely to happen by chance).

Using the p -value to make the decision

The p -value represents how likely we would be to observe such an extreme sample if the null hypothesis were true. The p -value is a probability computed assuming the null hypothesis is true, that the test statistic would take a value as extreme or more extreme than that actually observed. Since it's a probability, it is a number between 0 and 1. The closer the number is to 0 means the event is “unlikely.” So if p -value is “small,” (typically, less than 0.05), we can then reject the null hypothesis.

Significance level and p -value

Significance level, α, is a decisive value for p -value. In this context, significant does not mean “important”, but it means “not likely to happened just by chance”.

α is the maximum probability of rejecting the null hypothesis when the null hypothesis is true. If α = 1 we always reject the null, if α = 0 we never reject the null hypothesis. In articles, journals, etc… you may read: “The results were significant ( p <0.05).” So if p =0.03, it's significant at the level of α = 0.05 but not at the level of α = 0.01. If we reject the H 0 at the level of α = 0.05 (which corresponds to 95% CI), we are saying that if H 0 is true, the observed phenomenon would happen no more than 5% of the time (that is 1 in 20). If we choose to compare the p -value to α = 0.01, we are insisting on a stronger evidence!

So, what kind of error could we make? No matter what decision we make, there is always a chance we made an error.

Errors in Criminal Trial:

Errors in Hypothesis Testing

Type I error (False positive): The null hypothesis is rejected when it is true.

  • α is the maximum probability of making a Type I error.

Type II error (False negative): The null hypothesis is not rejected when it is false.

  • β is the probability of making a Type II error

There is always a chance of making one of these errors. But, a good scientific study will minimize the chance of doing so!

The power of a statistical test is its probability of rejecting the null hypothesis if the null hypothesis is false. That is, power is the ability to correctly reject H 0 and detect a significant effect. In other words, power is one minus the type II error risk.

\(\text{Power }=1-\beta = P\left(\text{reject} H_0 | H_0 \text{is false } \right)\)

Which error is worse?

Type I = you are innocent, yet accused of cheating on the test. Type II = you cheated on the test, but you are found innocent.

This depends on the context of the problem too. But in most cases scientists are trying to be “conservative”; it's worse to make a spurious discovery than to fail to make a good one. Our goal it to increase the power of the test that is to minimize the length of the CI.

We need to keep in mind:

  • the effect of the sample size,
  • the correctness of the underlying assumptions about the population,
  • statistical vs. practical significance, etc…

(see the handout). To study the tradeoffs between the sample size, α, and Type II error we can use power and operating characteristic curves.

What type of error might we have made?

Type I error is claiming that average student height is not 65 inches, when it really is. Type II error is failing to claim that the average student height is not 65in when it is.

We rejected the null hypothesis, i.e., claimed that the height is not 65, thus making potentially a Type I error. But sometimes the p -value is too low because of the large sample size, and we may have statistical significance but not really practical significance! That's why most statisticians are much more comfortable with using CI than tests.

There is a need for a further generalization. What if we can't assume that σ is known? In this case we would use s (the sample standard deviation) to estimate σ.

If the sample is very large, we can treat σ as known by assuming that σ = s . According to the law of large numbers, this is not too bad a thing to do. But if the sample is small, the fact that we have to estimate both the standard deviation and the mean adds extra uncertainty to our inference. In practice this means that we need a larger multiplier for the standard error.

We need one-sample t -test.

One sample t -test

  • Assume data are independently sampled from a normal distribution with unknown mean μ and variance σ 2 . Make an initial assumption, μ 0 .
  • t-statistic: \(\frac{\bar{X}-\mu_0}{s / \sqrt{n}}\) where s is a sample st.dev.
  • t-statistic follows t -distribution with df = n - 1
  • Alpha = 0.05, we conclude ….

Testing for the population proportion

Let's go back to our CNN poll. Assume we have a SRS of 1,017 adults.

We are interested in testing the following hypothesis: H 0 : p = 0.50 vs. p > 0.50

What is the test statistic?

If alpha = 0.05, what do we conclude?

We will see more details in the next lesson on proportions, then distributions, and possible tests.

A Level Maths

A Level Maths

Maths A-Level Resources for AQA, OCR and Edexcel

A-Level Maths Statistical Hypothesis Testing

Everything you need to know about statistical hypothesis testing for A-Level Maths. Save countless hours of time!

Statistical Hypothesis Testing Topics for A-Level Maths

This module will teach you the following:

Hypothesis testing in a binomial distribution

Hypothesis testing in a normal distribution, hypothesis test using pearson’s correlation coefficient, what’s included.

We’ve created 52 modules covering every Maths topic needed for A level, and each module contains:

  • An editable PowerPoint lesson presentation
  • Editable revision handouts
  • A glossary which covers the key terminologies of the module
  • Topical mind maps for visualising the key concepts
  • Printable flashcards to help students engage active recall and confidence-based repetition
  • A quiz with accompanying answer key to test knowledge and understanding of the module

As a premium member, once rolled out you get access to the entire library of A-Level Maths resources. For now, we have made the first five topics completely free of charge for you to get a taste of what’s to come.

A Level Maths Resources Mapped by Exam Board

Once completed our modules can be used with both UK and international A Level examination board specifications.

We will put together comprehensive mapping documents which will show you exactly which modules align to the specification you are teaching or learning.

hypothesis testing in statistics a level

Statistical Hypothesis Testing Topics:

Want to pass your maths a-levels.

Join the waiting list now, get the 5 free chapters of our COMPREHENSIVE A-LEVEL MATHS COURSE and we’ll help you to pass your exams!

IMAGES

  1. PPT

    hypothesis testing in statistics a level

  2. PPT

    hypothesis testing in statistics a level

  3. PPT

    hypothesis testing in statistics a level

  4. Hypothesis Testing Solved Examples(Questions and Solutions)

    hypothesis testing in statistics a level

  5. Statistical Hypothesis Testing: Step by Step

    hypothesis testing in statistics a level

  6. PPT

    hypothesis testing in statistics a level

VIDEO

  1. A-Level Maths: O3-05 Sample Means: Hypothesis Test Example 4

  2. FA II STATISTICS/ Chapter no 7 / Testing of hypothesis/ Z distribution / Example 7.8

  3. 8a. Introduction to Hypothesis Testing

  4. Module8: Hypothesis Testing Sigma Unknown

  5. ONE SAMPLE HYPOTHESIS TESTING

  6. HYPOTHESIS TESTING WITH NORMAL DISTRIBUTION

COMMENTS

  1. 5.1.1 Hypothesis Testing

    A hypothesis test is carried out at the 5% level of significance to test if a normal coin is fair or not. (i) Describe what the population parameter could be for the hypothesis test. (ii) State whether the hypothesis test should be a one-tailed test or a two-tailed test, give a reason for your answer. (iii)

  2. Hypothesis Testing

    Table of contents. Step 1: State your null and alternate hypothesis. Step 2: Collect data. Step 3: Perform a statistical test. Step 4: Decide whether to reject or fail to reject your null hypothesis. Step 5: Present your findings. Other interesting articles. Frequently asked questions about hypothesis testing.

  3. Significance tests (hypothesis testing)

    Simple hypothesis testing Get 3 of 4 questions to level up! ... When to use z or t statistics in significance tests (Opens a modal) Example calculating t statistic for a test about a mean ... Level up on all the skills in this unit and collect up to 1,500 Mastery points!

  4. Hypothesis Testing

    A hypothesis test is the means by which we generate a test statistic that directs us to either reject or not reject the null hypothesis. The test statistic is a "summary" of the collected data, and should have a sampling distribution specified by the null hypothesis. A Level AQA Edexcel OCR.

  5. Hypothesis Testing

    Now we carry out the above steps in order to come to a conclusion. Step 1: We state the null hypothesis and the alternate hypothesis: and. Step 2: We select the level of significance which is stated in the problem as 5% or α = 0.05. Step 3: Compute the test statistics. We first identify the test to be used.

  6. Hypothesis Testing

    We choose a region known as the critical region. If the result of our test lies in this region, then we reject the null hypothesis in favour of the alternative. Hypothesis testing A-Level Maths Statistics revision looking at Hypothesis testing. Topics include null hypothesis, alternative hypothesis, testing and critical regions.

  7. 9.1: Introduction to Hypothesis Testing

    In hypothesis testing, the goal is to see if there is sufficient statistical evidence to reject a presumed null hypothesis in favor of a conjectured alternative hypothesis.The null hypothesis is usually denoted \(H_0\) while the alternative hypothesis is usually denoted \(H_1\). An hypothesis test is a statistical decision; the conclusion will either be to reject the null hypothesis in favor ...

  8. Maths Genie

    AS Level Mechanics and Statistics - Hypothesis Testing. Maths revision videos and notes on the topics of hypothesis testing, correlation hypothesis testing, mean of normal distribution hypothesis testing and non linear regression.

  9. PDF Hypothesis Testing Cheat Sheet

    The actual significance 5.level of a hypothesis test is the probability of incorrectly rejecting the null hypothesis. Example 2: A single observation is taken from the binomial distribution ) B(6,$). The observation is used to test H!:$ = 0.35 against H ":$ > 0.35 a. Using a 5% significance level, find the critical region for this test. Assume H

  10. Statistics: Hypothesis Testing

    Hypothesis testing involves using the null hypothesis and the alternative hypothesis in order to assess whether an assumption is true. This video takes a loo...

  11. 1.2: The 7-Step Process of Statistical Hypothesis Testing

    Step 7: Based on steps 5 and 6, draw a conclusion about H0. If the F\calculated F \calculated from the data is larger than the Fα F α, then you are in the rejection region and you can reject the null hypothesis with (1 − α) ( 1 − α) level of confidence. Note that modern statistical software condenses steps 6 and 7 by providing a p p -value.

  12. Hypothesis Testing

    Let's return finally to the question of whether we reject or fail to reject the null hypothesis. If our statistical analysis shows that the significance level is below the cut-off value we have set (e.g., either 0.05 or 0.01), we reject the null hypothesis and accept the alternative hypothesis. Alternatively, if the significance level is above ...

  13. Introduction to Hypothesis Testing

    A hypothesis test consists of five steps: 1. State the hypotheses. State the null and alternative hypotheses. These two hypotheses need to be mutually exclusive, so if one is true then the other must be false. 2. Determine a significance level to use for the hypothesis. Decide on a significance level.

  14. Edexcel A Level Maths Statistics 1

    Probably the toughest chapter for the 1st year material of Edexcel's A Level Maths and Statistics material.This chapter takes a look at Hypothesis Testing fo...

  15. Hypothesis Testing

    Using the p-value to make the decision. The p-value represents how likely we would be to observe such an extreme sample if the null hypothesis were true. The p-value is a probability computed assuming the null hypothesis is true, that the test statistic would take a value as extreme or more extreme than that actually observed. Since it's a probability, it is a number between 0 and 1.

  16. Statistical Hypothesis Testing

    Statistical Hypothesis Testing Topics for A-Level Maths. This module will teach you the following: Hypothesis testing in a binomial distribution. Hypothesis testing in a normal distribution. Hypothesis test using Pearson's correlation coefficient. Download the Sample Chapters →.

  17. CIE A Level Maths: Probability & Statistics 2

    Revision notes on 3.1.2 Type I & Type II Errors for the CIE A Level Maths: Probability & Statistics 2 syllabus, written by the Maths experts at Save My Exams. ... Any hypothesis test will only provide evidence about whether a parameter has changed or not. A conclusion can not claim with certainty whether to accept or reject the null hypothesis ...

  18. PDF Stats 2 Hypothesis Testing Questions

    2-tailed test crit under Ho Allow use Of t method AWFW 0.99 to 1.00 (allow 1) Or z = 1.96 On their z and critical value Or t Accept Ho at 5% level Of significance. Sufficient evidence at the 5% level Of significance to support the manufacturer's belief. Total 6(a) 471 = 94.2 - 6.058 4 1 -tailed test --2.132 Ho Hi : p < 100 94.2-100 --2.14 6.058