Główna zawartość
Statystyka - program rozszerzony
Kurs: Statystyka - program rozszerzony > Rozdział 10
Lekcja 2: Przedziały ufności dla średnich- Odniesienie: Warunki wnioskowania na temat średniej
- Warunki dla przedziału t dla średniej
- Znajdowanie krytycznej wartości t* dla określonego poziomu ufności
- Obliczanie przedziału t dla średniej
- Tworzenie przedziału t dla danych sparowanych
- Interpretowanie przedziału ufności dla średniej
- Wielkość próby i margines błędu w przedziale ufności dla średniej
© 2023 Khan AcademyWarunki użytkowaniapolitykę prywatnościInformacja o plikach cookie
Tworzenie przedziału t dla danych sparowanych
In some studies, we make two observations on the same individual. For instance, we might look at each student's pre-test and post-test scores in a course. In other studies, we might make an observation on each of two similar individuals. For example, some medicine trials involve pairing similar subjects so one receives the medicine and the other receives a placebo.
In both types of studies, we're working with paired data, and whenever we're working with paired data, we're typically interested in the difference between each pair—for example, the difference between the pre-test and the post-test data, or the difference between the medicine and the placebo data.
If certain conditions are met, we can construct a t interval to estimating the mean of these differences and draw conclusions.
In this article, we'll be going through two examples of making a t interval for paired data. Importantly, you'll have a chance to work through the second example on your own to ensure you've picked up on the main ideas.
Przykład 1
A running magazine wanted to review two watches—watch A and watch B—that use global position systems (GPS) to calculate the distance someone runs. They noticed that the watches didn't usually agree on the distance someone traveled in a given run.
The magazine took a random sample of 5 subscribers and asked them to run a 10 kilometer route wearing both watches at the same time (they all agreed to participate). At the end of their runs, the participants recorded the distance each watch said they traveled. Here are the data (all distances are in kilometers):
Runner | 1 | 2 | 3 | 4 | 5 |
---|---|---|---|---|---|
Watch A | 9, point, 8 | 9, point, 8 | 10, point, 1 | 10, point, 1 | 10, point, 2 |
Watch B | 10, point, 1 | 10 | 10, point, 2 | 9, point, 9 | 10, point, 1 |
Construct a 95, percent confidence interval to estimate the mean difference in the distances reported by these watches. Does the interval suggest that there is a difference between the two watches?
Step 1: Calculate the differences
Even though it appears we have two sets of data—watch A and watch B—these data didn't come from two independent samples. The magazine took a single sample of 5 runners, and each runner wore both watches, so this is a matched pairs design. The one set of data we're interested in is the difference between watch A and watch B for each runner. Let's define this variable as start text, d, i, f, f, e, r, e, n, c, e, end text, equals, start text, B, end text, minus, start text, A, end text and calculate the difference for each runner:
Runner | 1 | 2 | 3 | 4 | 5 |
---|---|---|---|---|---|
Watch A | 9, point, 8 | 9, point, 8 | 10, point, 1 | 10, point, 1 | 10, point, 2 |
Watch B | 10, point, 1 | 10 | 10, point, 2 | 9, point, 9 | 10, point, 1 |
Difference left parenthesis, start text, B, end text, minus, start text, A, end text, right parenthesis | 0, point, 3 | 0, point, 2 | 0, point, 1 | minus, 0, point, 2 | minus, 0, point, 1 |
Key idea: When dealing with paired data, we're most interested in the distribution of the differences.
Step 2: Check conditions
We want to use these n, equals, 5 differences to construct a confidence interval for the mean difference. Since we don't know the population standard deviation of the differences, we'll have to use the sample standard deviation in its place. This makes it appropriate to use a t interval instead of a z interval to estimate the mean difference. Let's check the conditions for making a t interval.
- Random: The magazine took a random sample of their subscribers.
- Normal: Since our sample of n, equals, 5 runners is small, we need to plot the data. The differences are roughly symmetric with no outliers, so it should be safe to proceed.
- Independent: It's reasonable to assume independence between each runner's measurements. They were randomly selected, and they shouldn't influence each other's results.
Step 3: Construct the interval
Here are the data:
Runner | 1 | 2 | 3 | 4 | 5 |
---|---|---|---|---|---|
Watch A | 9, point, 8 | 9, point, 8 | 10, point, 1 | 10, point, 1 | 10, point, 2 |
Watch B | 10, point, 1 | 10, point, 0 | 10, point, 2 | 9, point, 9 | 10, point, 1 |
Difference left parenthesis, start text, B, end text, minus, start text, A, end text, right parenthesis | 0, point, 3 | 0, point, 2 | 0, point, 1 | minus, 0, point, 2 | minus, 0, point, 1 |
Here are the summary statistics:
Mean | Standard deviation | |
---|---|---|
Watch A | x, with, \bar, on top, start subscript, start text, A, end text, end subscript, equals, 10, point, 00 | s, start subscript, start text, A, end text, end subscript, approximately equals, 0, point, 19 |
Watch B | x, with, \bar, on top, start subscript, start text, B, end text, end subscript, equals, 10, point, 06 | s, start subscript, start text, B, end text, end subscript, approximately equals, 0, point, 11 |
Difference left parenthesis, start text, B, end text, minus, start text, A, end text, right parenthesis | x, with, \bar, on top, start subscript, start text, D, i, f, f, end text, end subscript, equals, 0, point, 06 | s, start subscript, start text, D, i, f, f, end text, end subscript, approximately equals, 0, point, 21 |
Since we want to construct a confidence interval for the mean difference, we only need the summary statistics for the differences.
We'll use the formula for a one-sample t interval for a mean:
Components of formula:
Our statistic is the sample mean x, with, \bar, on top, start subscript, start text, D, i, f, f, end text, end subscript, equals, 0, point, 06, start text, space, k, m, end text.
Our sample size is n, equals, 5 runners.
Our sample standard deviation is s, start subscript, start text, D, i, f, f, end text, end subscript, equals, 0, point, 21, start text, space, k, m, end text.
Our degrees of freedom is start text, d, f, end text, equals, 5, minus, 1, equals, 4, so for 95, percent confidence our critical value is t, start superscript, times, end superscript, equals, 2, point, 776.
Computations:
Interval approximately equals, left parenthesis, minus, 0, point, 20, comma, 0, point, 32, right parenthesis
Step 4: Interpret the interval
Does the interval suggest that there is a difference between the two watches?
We're 95, percent confident that the interval left parenthesis, minus, 0, point, 20, comma, 0, point, 32, right parenthesis captures the mean difference between the distances (in kilometers) reported by the watches on this sort of run. Notice that the interval contains 0, start text, space, k, m, end text—which represents no difference—so it's plausible that there is no difference between the distances reported by Watch A and Watch B.
If the entire interval had been above 0 (all positive values), or if it had been entirely below 0 (all negative values), then it would have suggested a difference between the two watches.
Example 2—Try it!
An educational website offers a practice program for the Law School Admissions Test (LSAT). Users of the program take a pretest and posttest. Here are the scores and gains for a random sample of 6 users:
User | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|
Pre | 140 | 152 | 153 | 159 | 150 | 146 |
Post | 150 | 159 | 170 | 164 | 148 | 166 |
Gain left parenthesis, start text, p, o, s, t, end text, minus, start text, p, r, e, end text, right parenthesis | 10 | 7 | 17 | 5 | minus, 2 | 20 |
Here are summary statistics:
Mean | Standard deviation | |
---|---|---|
Pre | x, with, \bar, on top, start subscript, start text, p, r, e, end text, end subscript, equals, 150 | s, start subscript, start text, p, r, e, end text, end subscript, approximately equals, 6, point, 48 |
Post | x, with, \bar, on top, start subscript, start text, p, o, s, t, end text, end subscript, equals, 159, point, 5 | s, start subscript, start text, p, o, s, t, end text, end subscript, approximately equals, 8, point, 89 |
Gain left parenthesis, start text, p, o, s, t, end text, minus, start text, p, r, e, end text, right parenthesis | x, with, \bar, on top, start subscript, start text, g, a, i, n, end text, end subscript, equals, 9, point, 5 | s, start subscript, start text, g, a, i, n, end text, end subscript, approximately equals, 8, point, 07 |
The makers of the website say that this interval provides strong evidence that using their program will cause an increase in a user's LSAT score.
Chcesz dołączyć do dyskusji?
Na razie brak głosów w dyskusji