HW 6

Question 1

A sample of $n = 300$ urban adult residents of CA revealed that 120 favorably approved of the incumbent president’s job performance, whereas a sample of $m = 180$ rural residents yielded 75 who favorably approved of the incumbent president. We are interested in testing whether or not there is a difference in perception of the incumbent president’s performance across the two groups.

(a) (5 points) Let $X_{1}, X_{2}, \dots, X_{n}$ be the responses of the urban residents and $Y_{1}, Y_{2}, \dots, Y_{m}$ be the responses of the rural residents. In the setting of this problem, describe the distributions these random variables are sampled from.

SOLUTION

X_{1}, X_{2}, ..., X_{300} Y_{1}, Y_{2}, ..., Y_{180} \sim ii d B er (p_{X}) \sim ii d B er (p_{Y})

(b) (5 points) Identify the main parameter of interest, $θ$ .

SOLUTION

θ = p_{X} - p_{Y}

(c) (5 points) Write down the expression for the statistic $\hat{θ}$ , which is our best guess for the population parameter $θ$ .

SOLUTION

\hat{θ} = \overset{p}{^}_{X} - \overset{p}{^}_{Y}

(d) (5 points) Write down the null and alternative hypothesis for the question.

SOLUTION

$H_{0}$ : $θ = θ_{0} = 0$
$H_{a}$ : $θ \neq = 0$

(e) (5 points) What would the ideal rejection region look like for rejecting $H_{0}$ in favor of $H_{a}$ ?

SOLUTION

The ideal rejection region will be the 2 sides away from $θ_{0}$ . We reject $H_{0}$ if our observed statistic $\hat{θ}$ is significantly larger than 0 or significantly smaller than 0.

(f) (5 points) Assuming the null hypothesis $H_{0}$ is true, what is the sampling distribution of $\hat{θ}$ ? (Hint: Use CLT approximation for the sampling distribution of the statistic $\hat{θ}$ which we have encountered in class earlier. You just need to write down what this sampling distribution is under $H_{0}$ )

SOLUTION

\frac{θ ^ - θ _{0}}{SE} \approx N (0, 1) \equiv Z

(g) (15 points) By setting $α$ to be the Type-I error probability, write down the final expression for the rejection region $R (α, θ)$ in terms of $z_{α /2}$ .

SOLUTION

R (α, θ) \overset{p}{^}_{p oo l e d} SE R (α, θ) = (- \infty, θ_{0} - z_{α /2} \times SE] \cup [θ_{0} + z_{α /2} \times SE, \infty) = \frac{n p ^ _{X} + m p ^ _{Y}}{n + m} = \frac{120 + 75}{300 + 180} = \frac{195}{480} = 0.406 = \overset{p}{^}_{p oo l e d} \times (1 - \overset{p}{^}_{p oo l e d}) \times {\frac{1}{n} + \frac{1}{m}} = (0.406) \times (0.594) \times {\frac{1}{300} + \frac{1}{180}} = 0.00214 = 0.0463 = (- \infty, 0 - z_{α /2} \times (0.09428)] \cup [0 + z_{α /2} \times (0.09428), \infty) = (- \infty, 0.0463 \cdot - z_{α /2}] \cup [0.0463 \cdot z_{α /2}, \infty)

(h) (5 points) Fixing $α = 0.01$ find the level- $α$ rejection region $R (α, θ)$ .

SOLUTION

$z_{α /2} = 2.58$

R (α, θ) = (- \infty, 0.0463 \cdot - z_{α /2}] \cup [0.0463 \cdot z_{α /2}, \infty) = (- \infty, 0.0463 \cdot - 2.58] \cup [0.0463 \cdot 2.58, \infty) = (- \infty, - 0.1194] \cup [0.1194, \infty)

(i) (5 points) What is your final decision is based on the level $α = 0.01$ hypothesis test?

SOLUTION

$\hat{θ} = \overset{p}{^}_{X} - \overset{p}{^}_{Y} = \frac{120}{300} - \frac{75}{180} = - 0.0166$

Since $\hat{θ}$ is not within the rejection region, we failed to reject $H_{0}$ with $α = 0.01$

(j) (5 points) Compute the p-value for the hypothesis test, and specify what your decision will be if you were to, instead, perform a level $α = 0.05$ hypothesis test.

SOLUTION

\hat{T}_{o b s} p-value = \frac{θ ^ - θ _{0}}{SE} = \frac{( - 0.0166 ) - 0}{0.0463} = - 0.3585 = 2 \times z_{α /2} = 2 \times 0.36 = 0.72

Since p-value $= 0.72 > 0.05 = α$ we have failed to reject the null hypothesis with $α = 0.05$

Question 2

In a study to estimate the average height of adult male basketball players, a researcher wants to test if the average height is greater than 200cm. Prior studies indicate that the variance in height is $16 cm^{2}$ .

(a) (10 points) Write down any assumptions about the data and identify the setting of the problem.

SOLUTION

$X$ : The average height of a random chosen male adult basketball player

X_{1}, X_{2}, ..., X_{n} \sim ii d N (μ, σ^{2})

(b) (10 points) From part (a), identify the relevant population parameter, $θ$ , and the sample statistic, $\hat{θ}$ , the researcher will use to make any statistical inference.

SOLUTION

$θ$ : the true population mean of the height of adult male basketball players ( $μ$ ) $\hat{θ}$ : The sample proportion mean of the height of adult male basketball players ( $\overline{X}$ )

(c) (10 points) The researcher wants to compute a two-sided 99% confidence interval for the sample statistic $θ$ . If they want the margin of error to be 0.01cm, what is the minimum number of samples needed?

SOLUTION

$α = 0.01$
$ME (α, θ) = 0.01$
$SE = \frac{σ}{n}$
$z_{α /2} = 2.58$

ME (α, θ) n n n = q_{α /2} \times SE = z_{α /2} \times \frac{σ}{n} = \frac{z _{α /2} \times σ}{ME} = \frac{2.58 \times 4}{0.01} = 1032

(d) (10 points) In part (c), the researcher uses a two-sided confidence interval. In words, describe why/why not this type of a confidence interval is appropriate for the research question they wish to investigate.

SOLUTION

The two-sided confidence interval is inappropriate because the research question is directional (testing if heights are greater than 200cm). A two-sided interval wastes statistical power by accounting for a ‘less than’ direction that the researcher is not investigating; a one-sided lower bound would more accurately align with the alternate hypothesis.

(e) (10 points) Write down the appropriate null and alternate hypotheses for the question.

SOLUTION

$θ = μ$

$H_{0}$ : $θ = θ_{0} = 200$ $H_{a}$ : $θ > 200$

(f) (10 points) The researcher aims to have a power of 80% to detect an actual average height of 202cm. What sample size is required for this test at a $α = 0.01$ significance level?

SOLUTION

$p o w er = 0.8$
$μ_{a} = 202$
$α = 0.01$

Sampling Distribution:

\frac{θ ^ - θ}{SE} \sim N (0, 1) \equiv Z

Reject $H_{0}$ if $\hat{θ} > θ_{0} + z_{α} \times SE$ :

SE = \frac{σ}{n}

Calculate Power:

p o w er = P (reject H_{0} ∣ H_{a} is true) = P (\hat{θ} > θ_{0} + z_{α} \times SE ∣ \frac{θ ^ - θ _{a}}{SE} \sim Z) = P (\frac{θ ^ - θ _{a}}{SE} > \frac{θ _{0} + z _{α} \times SE - θ _{a}}{SE}) = P (Z > x = z_{1 - p o w er} \frac{θ _{0} - θ _{a}}{SE} + z_{α})

We found the $x = z_{1 - p o w er}$ and use it to find $n$

$x = z_{1 - p o w er} = z_{1 - 0.8} = z_{0.2} = 0.84$

$z_{α} = z_{0.01} = - 2.33$

Z_{1 - p o w er} n n = \frac{θ _{0} - θ _{a}}{SE} + z_{α} = \frac{θ _{0} - θ _{a}}{\frac{σ}{n}} + z_{α} = ((z_{1 - p o w er} - z_{α}) \times \frac{σ}{θ _{0} - θ _{a}})^{2} = ((0.84 - (- 2.33)) \times \frac{4}{200 - 202})^{2} = ((3.17) \times - 2)^{2} = 6.3 4^{2} = 40.1956 \approx 41

Question 3

A clinical trial is needed to compare the efficacy of a new diabetes drug $X$ in comparison to the baseline $Y$ . Prior pilot studies found the standard deviations for both drugs to be $σ_{X} = 10.0$ units and $σ_{Y} = 12.0$ units. The FDA requires there to be a reduction of $5 μg / m l$ in blood sugar to be considered “innovation” in order to release the drug into the market. Furthermore, all results need to be reported at a statistical significance level of $α = 0.01$ .

(a) (10 points) State the main assumptions in this problem and identify the problem setting.

SOLUTION

$X$ : The reduced blood sugar of a randomly chosen participant who used the drug $X$ $Y$ : The reduced blood sugar of a randomly chosen participant who used the drug $Y$

X_{1}, X_{2}, ..., X_{n_{X}} Y_{1}, Y_{2}, ..., Y_{n_{Y}} \sim ii d N (μ_{X}, σ_{X}^{2}) \sim ii d N (μ_{Y}, σ_{Y}^{2})

(b) (10 points) Identify the population parameter $θ$ and sample statistic $\hat{θ}$ the researchers are interested in.

SOLUTION

$θ$ : The difference of the true means of the new drug $X$ with the baseline $Y$ ( $μ_{X} - μ_{Y}$ ) $\hat{θ}$ : The difference of the sample means of the new drug $X$ with the baseline $Y$ ( $\overset{μ}{^}_{X} - \overset{μ}{^}_{Y}$ )

(c) (10 points) Identify the null and alternate hypotheses for this problem which will enable the researchers to make the necessary statistical inference.

SOLUTION

$H_{0} : θ = θ_{0} = 0$ $H_{a} : θ > 0$

(d) (10 points) The units for the standard deviation are intentionally left as units. What units should these be for this problem to make sense?

SOLUTION

When calculating sample distribution, the values of the numerator and denominator must be the same for the distribution to be unitless:

\frac{θ ^ - θ}{SE} \sim N (0, 1)

Since we know $θ_{a} = 5 μg / m l$ , the unit for $SE \to σ$ must also have the unit: $μg / m l$

(e) (10 points) Suppose the researchers choose to recruit $n$ volunteers for the research study and randomly split half of them to the two groups, i.e., $n /2$ to take drug X and the remaining $n /2$ volunteers to take the drug Y. What is the minimum sample size, $n$ , needed to detect if the new drug improves on the baseline with power 90%?

SOLUTION

$p o w er = 0.9$
$α = 0.1$
$σ_{X} = 10$ , $σ_{Y} = 12$

I. Sampling Distribution:

\frac{θ ^ - θ}{SE} \sim N (0, 1) \equiv Z

where:

SE = Va r (\hat{θ}) = \frac{σ _{X}^{2}}{n /2} + \frac{σ _{Y}^{2}}{n /2} = \frac{σ _{X}^{2} + σ _{Y}^{2}}{n /2}

II. Sampling Distribution if $H_{a}$ is true:

\frac{θ ^ - θ _{a}}{SE} \sim N (0, 1)

III. Reject $H_{0}$ if:

\hat{θ} > θ_{0} + z_{α} \times SE

IV. power

p o w er = P (\hat{θ} > θ_{0} + z_{α} \times SE ∣ \frac{θ ^ - θ _{a}}{SE} \sim N (0, 1)) = P (\frac{θ ^ - θ _{a}}{SE} > \frac{θ _{0} + z _{α} \times SE - θ _{a}}{SE}) = P (Z > x = Z_{p o w er} \frac{θ _{0} - θ _{a}}{SE} + z_{α})

We found the $x = Z_{p o w er}$ and use it to find $n$

$x = z_{p o w er} = z_{0.9} = - 1.28$

$z_{α} = z_{0.01} = 2.33$

Z_{p o w er} Z_{p o w er} - z_{α} n /2 n n = \frac{θ _{0} - θ _{a}}{SE} + z_{α} = \frac{θ _{0} - θ _{a}}{\frac{σ _{X}^{2} + σ _{Y}^{2}}{n /2}} + z_{α} = \frac{θ _{0} - θ _{a}}{\frac{σ _{X}^{2} + σ _{Y}^{2}}{n /2}} = (Z_{p o w er} - z_{α}) \times \frac{σ _{X}^{2} + σ _{Y}^{2}}{θ _{0} - θ _{a}} = 2 \cdot ((Z_{p o w er} - z_{α}) \times \frac{σ _{X}^{2} + σ _{Y}^{2}}{θ _{0} - θ _{a}})^{2} = 2 \cdot ((- 1.28 - (2.33)) \times \frac{1 0 ^{2} + 1 2 ^{2}}{0 - 5})^{2} = 2 \cdot ((3.61) \times \frac{15.62}{- 5})^{2} = 2 \cdot (- 11.27764)^{2} = 254.37 \approx 256

Question 4

In an upcoming national election, you are in charge of conducting exit polls to predict the winner. The race is between two parties: the orange party and the purple party. You decide to conduct a one-sided population proportion hypothesis test to assess the proportion of voters favoring the purple party candidate.

(a) (10 points) Write down the appropriate assumptions about the data, identify the population parameter of interest, $θ$ , and the sample statistic, $\hat{θ}$ , you intend to use.

SOLUTION

$X$ : The response of a randomly chosen individual whether or not they favor the purple party over the orange party

X_{1}, X_{2}, ..., X_{n} \sim ii d B er (p)

$θ$ : The true population proportion that favors the purple party over the orange party ( $p$ ) $\hat{θ}$ : The sample proportion that favors the purple party over the orange party ( $\overset{p}{^}$ )

(b) (20 points) Identify the null and the alternate hypotheses which will enable you to make the necessary inference for this question. Describe the sampling distribution of the sample statistic $\hat{θ}$ under the null hypothesis and the alternate hypothesis.

SOLUTION

$H_{0} : θ = θ_{0} = 0.5$ $H_{a} : θ > 0.5$

Sampling Distribution if $H_{0}$ is true:

\frac{θ ^ - θ _{0}}{S E _{0}} \approx N (0, 1) \equiv Z

where

S E_{0} = \frac{p _{0} ( 1 - p _{0} )}{n}

Sampling Distribution if $H_{a}$ is true:

\frac{θ ^ - θ _{a}}{S E _{a}} \approx N (0, 1) \equiv Z

Standard Error:

S E_{a} = \frac{p _{a} ( 1 - p _{a} )}{n}

(c) (20 points) Based on prior studies in electoral contexts, an election is considered to have a “moderate level of support” when the true population proportion is 55% or greater. Anything less than that is considered to be a “small margin”. You want your test to have a power of at least 95%, when the true political sentiment in favor of the purple party candidate is a moderate level of support. What is the minimum sample size you would need to achieve this? Assume a significance level of $α = 0.01$ .

SOLUTION

$θ_{a} = p_{a} = 0.55$
$α = 0.01$
$p o w er = 0.95$

Reject $H_{0}$ if :

\hat{θ} > θ_{0} + z_{α} \times S E_{0}

Power:

p o w er = P (\hat{θ} > θ_{0} + z_{α} \times S E_{0} ∣ \frac{θ ^ - θ _{a}}{S E _{a}} \approx Z) = P (\frac{θ ^ - θ _{a}}{S E _{a}} > \frac{θ _{0} + z _{α} \times S E _{0} - θ _{a}}{S E _{a}}) = P (Z > x = Z_{p o w er} \frac{θ _{0} - θ _{a}}{S E _{a}} + z_{α} \frac{S E _{0}}{S E _{a}})

We found the $x = Z_{p o w er}$ and use it to find $n$

$x = z_{p o w er} = z_{0.95} = - 1.64$

$z_{α} = z_{0.01} = 2.33$

Z_{p o w er} \frac{n ( θ _{0} - θ _{a} )}{p _{a} ( 1 - p _{a} )} n n n = \frac{θ _{0} - θ _{a}}{S E _{a}} + z_{α} \cdot \frac{S E _{0}}{S E _{a}} = \frac{θ _{0} - θ _{a}}{\frac{p _{a} ( 1 - p _{a} )}{n}} + z_{α} \cdot \frac{\frac{p _{0} ( 1 - p _{0} )}{n}}{\frac{p _{a} ( 1 - p _{a} )}{n}} = \frac{θ _{0} - θ _{a}}{\frac{p _{a} ( 1 - p _{a} )}{n}} + z_{α} \cdot \frac{\frac{p _{0} ( 1 - p _{0} )}{n}}{\frac{p _{a} ( 1 - p _{a} )}{n}} = \frac{n ( θ _{0} - θ _{a} )}{p _{a} ( 1 - p _{a} )} + z_{α} \cdot \frac{p _{0} ( 1 - p _{0} )}{p _{a} ( 1 - p _{a} )} = Z_{p o w er} - z_{α} \cdot \frac{p _{0} ( 1 - p _{0} )}{p _{a} ( 1 - p _{a} )} = (\frac{p _{a} ( 1 - p _{a} )}{θ _{0} - θ _{a}}) \times (Z_{p o w er} - z_{α} \cdot \frac{p _{0} ( 1 - p _{0} )}{p _{a} ( 1 - p _{a} )}) = [(\frac{p _{a} ( 1 - p _{a} )}{θ _{0} - θ _{a}}) \times (Z_{p o w er} - z_{α} \cdot \frac{p _{0} ( 1 - p _{0} )}{p _{a} ( 1 - p _{a} )})]^{2} = [(\frac{0.55 \times 0.45}{0.5 - 0.55}) \times (- 1.64 - 2.33 \cdot \frac{0.5 \times 0.5}{0.55 \times 0.45})]^{2} = [(\frac{0.2475}{- 0.05}) \times (- 1.64 - 2.33 \cdot \frac{0.25}{0.2475})]^{2} = [(\frac{0.497}{- 0.05}) \times (- 1.64 - 2.33 \cdot \frac{0.5}{0.497})]^{2} = [(- 9.94) \times (- 1.64 - 2.33 \cdot 1.006)]^{2} = [(- 9.94) \times (- 1.64 - 2.344)]^{2} = [(- 9.94) \times (- 3.984)]^{2} = 39.60 1^{2} = 1568.23 \approx 1569

Jason's Notebook

Explorer

HW 6

Question 1

(a) (5 points) Let $X_{1}, X_{2}, \dots, X_{n}$ be the responses of the urban residents and $Y_{1}, Y_{2}, \dots, Y_{m}$ be the responses of the rural residents. In the setting of this problem, describe the distributions these random variables are sampled from.

(b) (5 points) Identify the main parameter of interest, $θ$ .

(c) (5 points) Write down the expression for the statistic $\hat{θ}$ , which is our best guess for the population parameter $θ$ .

(d) (5 points) Write down the null and alternative hypothesis for the question.

(e) (5 points) What would the ideal rejection region look like for rejecting $H_{0}$ in favor of $H_{a}$ ?

(g) (15 points) By setting $α$ to be the Type-I error probability, write down the final expression for the rejection region $R (α, θ)$ in terms of $z_{α /2}$ .

(h) (5 points) Fixing $α = 0.01$ find the level- $α$ rejection region $R (α, θ)$ .

(i) (5 points) What is your final decision is based on the level $α = 0.01$ hypothesis test?

(j) (5 points) Compute the p-value for the hypothesis test, and specify what your decision will be if you were to, instead, perform a level $α = 0.05$ hypothesis test.

Question 2

(a) (10 points) Write down any assumptions about the data and identify the setting of the problem.

(b) (10 points) From part (a), identify the relevant population parameter, $θ$ , and the sample statistic, $\hat{θ}$ , the researcher will use to make any statistical inference.

(c) (10 points) The researcher wants to compute a two-sided 99% confidence interval for the sample statistic $θ$ . If they want the margin of error to be 0.01cm, what is the minimum number of samples needed?

(d) (10 points) In part (c), the researcher uses a two-sided confidence interval. In words, describe why/why not this type of a confidence interval is appropriate for the research question they wish to investigate.

(e) (10 points) Write down the appropriate null and alternate hypotheses for the question.

(f) (10 points) The researcher aims to have a power of 80% to detect an actual average height of 202cm. What sample size is required for this test at a $α = 0.01$ significance level?

Question 3

(a) (10 points) State the main assumptions in this problem and identify the problem setting.

(b) (10 points) Identify the population parameter $θ$ and sample statistic $\hat{θ}$ the researchers are interested in.

(c) (10 points) Identify the null and alternate hypotheses for this problem which will enable the researchers to make the necessary statistical inference.

(d) (10 points) The units for the standard deviation are intentionally left as units. What units should these be for this problem to make sense?

Question 4

(a) (10 points) Write down the appropriate assumptions about the data, identify the population parameter of interest, $θ$ , and the sample statistic, $\hat{θ}$ , you intend to use.

(b) (20 points) Identify the null and the alternate hypotheses which will enable you to make the necessary inference for this question. Describe the sampling distribution of the sample statistic $\hat{θ}$ under the null hypothesis and the alternate hypothesis.

Graph View

Table of Contents

Jason's Notebook

Explorer

HW 6

Question 1

(a) (5 points) Let X1​,X2​,…,Xn​ be the responses of the urban residents and Y1​,Y2​,…,Ym​ be the responses of the rural residents. In the setting of this problem, describe the distributions these random variables are sampled from.

(b) (5 points) Identify the main parameter of interest, θ.

(c) (5 points) Write down the expression for the statistic θ^, which is our best guess for the population parameter θ.

(d) (5 points) Write down the null and alternative hypothesis for the question.

(e) (5 points) What would the ideal rejection region look like for rejecting H0​ in favor of Ha​?

(g) (15 points) By setting α to be the Type-I error probability, write down the final expression for the rejection region R(α,θ) in terms of zα/2​.

(h) (5 points) Fixing α=0.01 find the level-α rejection region R(α,θ).

(i) (5 points) What is your final decision is based on the level α=0.01 hypothesis test?

(j) (5 points) Compute the p-value for the hypothesis test, and specify what your decision will be if you were to, instead, perform a level α=0.05 hypothesis test.

Question 2

(a) (10 points) Write down any assumptions about the data and identify the setting of the problem.

(b) (10 points) From part (a), identify the relevant population parameter, θ, and the sample statistic, θ^, the researcher will use to make any statistical inference.

(c) (10 points) The researcher wants to compute a two-sided 99% confidence interval for the sample statistic θ. If they want the margin of error to be 0.01cm, what is the minimum number of samples needed?

(d) (10 points) In part (c), the researcher uses a two-sided confidence interval. In words, describe why/why not this type of a confidence interval is appropriate for the research question they wish to investigate.

(e) (10 points) Write down the appropriate null and alternate hypotheses for the question.

(f) (10 points) The researcher aims to have a power of 80% to detect an actual average height of 202cm. What sample size is required for this test at a α=0.01 significance level?

Question 3

(a) (10 points) State the main assumptions in this problem and identify the problem setting.

(b) (10 points) Identify the population parameter θ and sample statistic θ^ the researchers are interested in.

(c) (10 points) Identify the null and alternate hypotheses for this problem which will enable the researchers to make the necessary statistical inference.

(d) (10 points) The units for the standard deviation are intentionally left as units. What units should these be for this problem to make sense?

Question 4

(a) (10 points) Write down the appropriate assumptions about the data, identify the population parameter of interest, θ, and the sample statistic, θ^, you intend to use.

(b) (20 points) Identify the null and the alternate hypotheses which will enable you to make the necessary inference for this question. Describe the sampling distribution of the sample statistic θ^ under the null hypothesis and the alternate hypothesis.

Graph View

Table of Contents

(a) (5 points) Let $X_{1}, X_{2}, \dots, X_{n}$ be the responses of the urban residents and $Y_{1}, Y_{2}, \dots, Y_{m}$ be the responses of the rural residents. In the setting of this problem, describe the distributions these random variables are sampled from.

(b) (5 points) Identify the main parameter of interest, $θ$ .

(c) (5 points) Write down the expression for the statistic $\hat{θ}$ , which is our best guess for the population parameter $θ$ .

(e) (5 points) What would the ideal rejection region look like for rejecting $H_{0}$ in favor of $H_{a}$ ?

(g) (15 points) By setting $α$ to be the Type-I error probability, write down the final expression for the rejection region $R (α, θ)$ in terms of $z_{α /2}$ .

(h) (5 points) Fixing $α = 0.01$ find the level- $α$ rejection region $R (α, θ)$ .

(i) (5 points) What is your final decision is based on the level $α = 0.01$ hypothesis test?

(j) (5 points) Compute the p-value for the hypothesis test, and specify what your decision will be if you were to, instead, perform a level $α = 0.05$ hypothesis test.

(b) (10 points) From part (a), identify the relevant population parameter, $θ$ , and the sample statistic, $\hat{θ}$ , the researcher will use to make any statistical inference.

(c) (10 points) The researcher wants to compute a two-sided 99% confidence interval for the sample statistic $θ$ . If they want the margin of error to be 0.01cm, what is the minimum number of samples needed?

(f) (10 points) The researcher aims to have a power of 80% to detect an actual average height of 202cm. What sample size is required for this test at a $α = 0.01$ significance level?

(b) (10 points) Identify the population parameter $θ$ and sample statistic $\hat{θ}$ the researchers are interested in.

(a) (10 points) Write down the appropriate assumptions about the data, identify the population parameter of interest, $θ$ , and the sample statistic, $\hat{θ}$ , you intend to use.

(b) (20 points) Identify the null and the alternate hypotheses which will enable you to make the necessary inference for this question. Describe the sampling distribution of the sample statistic $\hat{θ}$ under the null hypothesis and the alternate hypothesis.