T test pdf - د. جواد

Community

Lect. 3

Total lect. 10

T-test

Dr. Jawad Al-Deewan

Sampling Distributions of a Mean

The sampling distributions of a mean (SDM)

describes the behavior of a sampling mean











where

Example: two-sample t-test

• In 1980, some researchers reported that “men have

more mathematical ability than women” as
evidenced by the 1979 SAT’s, where a sample of 30
random male adolescents had a mean score ± 1
standard deviation of 436±77 and 30 random female
adolescents scored lower: 416±81 (genders were
similar in educational backgrounds, socio-economic
status, and age). Do you agree with the authors’
conclusions?

Data Summary

Sample

Mean

Sample

Standard

Deviation

Group 1:
women

416

Group 2:
men

436

Two-sample t-test

1. Define your hypotheses (null, alternative)
H

: ♂-♀ math = 0

Ha: ♂-♀ math ≠ 0 [two-sided]

Two-sample t-test

2. Specify your null distribution:

F and M have similar standard

deviations/variances, so make a “pooled”

estimate of variance.

6245

)

(

)

(

)

(

)

(



















)

6245

(





6245





Two-sample t-test

4. Calculate the p-value of what you observed







0.3311563454

5. Do not reject null! No evidence that men are better
in math ;)

Compare the Computed Test Statistic

Against a Tabled Value

If |t

| > |t

| Reject H

If p value < α Reject H

Available data

• For a portion of the study, a pair of doctors were

shown the same set of tumor pictures. The volume
of the tumor was measured by two separate
physicians under similar conditions.

• Question of interest: Did the measurements from the

two physicians significantly differ?

• If not, then there would be no evidence that the

volume measurements change based on physician.

• 20 scans were measured

by each physician (10 are
shown here)

• Measurements in cm

• What can you say about

these samples?

– Two measurement on the

same person

– They are related so we

must account for this

– Much research in statistics

deals with how to handle
correlated data, but in this
case it is pretty easy

Tumor

Dr. 1

Dr. 2

15.8

17.2

22.3

20.3

14.5

14.2

15.7

18.5

26.8

28.0

24.0

24.8

21.8

20.3

23.0

25.4

29.3

27.5

20.5

19.7

Dependent sample

• We can measure the effect

of the treatment in each
person by taking the
difference

• Instead of having two

samples, we can consider
our dataset to be one
sample of differences

– Just like the one sample

problem

Tumor Dr. 1 Dr. 2 Difference
1

15.8 17.2 -1.4

22.3 20.3 2.0

14.5 14.2 0.3

15.7 18.5 -2.8

26.8 28.0 -1.2

24.0 24.8 -0.8

21.8 20.3 1.5

23.0 25.4 -2.4

29.3 27.5 1.8

20.5 19.7 0.8





Differences

• Volume from Dr. 1

– Population mean:
– Sample mean:

• Volume from Dr. 2

– Population mean:
– Sample mean:

• Difference

– Population mean:

– Sample mean:















Distribution of differences

• Assuming d

’s are normally distributed, can use t-

distribution with n-1 dof where n is the number of
differences

• Standard deviation of differences

• Test statistic acts just like one sample



















Paired t-test

1) Null hypothesis: No difference between physicians effect

2) Two dependent samples; alpha=0.05
3) Test statistic: t-statistic with dof

4) p-value=0.53
5) Fail to reject null hypothesis
6) Conclusion: there is no evidence of a difference in tumor

volume measurement based on physician

















646















A researcher investigate whether children exhibit
a higher number of aggressive acts after watching
a violent television show. The number of
aggressive acts for the same 10 participants
before and after watching the show are as
follows:
(a) Subtracting before-scores from after-scores,
what are H

and H

? (b) Compute t

obt

. (c) With

.05, what is t

crit

? (d) What should the researcher

conclude about this relationship? (e) Compute
the appropriate confidence interval. (f) If you
want to understand children’s aggression, how
important is it to consider whether they watch
violent television shows?

After Before

Difference Scores

After Before D

5-4=+1

6-6=0

4-3=+1

4-2=+2

7-4=+3

3-1=+2

2-0=+2

1-0=+1

4-5=-1

3-2=+1

Difference scores can be
calculated by subtracting before-
after or after-before. The same
answer will be obtained (opposite
sign though). I personally choose
the order which creates the
fewest negative numbers. When
we interpret the results we need
to be careful to remember the
order we used.

(a) Subtracting before-
scores from after-scores,
what are H

and H

9-4a

After Before D

5-4=+1

6-6=0

4-3=+1

4-2=+2

7-4=+3

3-1=+2

2-0=+2

1-0=+1

4-5=-1

3-2=+1









(b) Compute t

obt

D = 1+0+1+2+3+2+2+1+-1+1=12

= 1

+ 0

+ 1

+ 2

+ 3

+ 2

+ -1

+ 1

= 26

N = 10

After Before D

5-4=+1

6-6=0

4-3=+1

4-2=+2

7-4=+3

3-1=+2

2-0=+2

1-0=+1

4-5=-1

3-2=+1



(b) Compute t

obt

After Before D

5-4=+1

6-6=0

4-3=+1

4-2=+2

7-4=+3

3-1=+2

2-0=+2

1-0=+1

4-5=-1

3-2=+1

)

(











135



359

135



359











obt



= .05, what is t

crit

= 10

df = n – 1 = 9

Researcher predicts higher aggressive acts after
watching violence, therefore, this is a one-tailed
test.

t

crit

(9)

=.05

= +1.833

(d) What should the researcher conclude about this
relationship?

Since the t

obt

is in the tail created by t

crit

, we reject H

and

conclude the results are significant. In the population,
children exhibit more aggressive acts after watching the show
(with



about 3.9) than they do before the show (with



about 2.7).