Introduction To Statistics And Data Analysis
Introduction To Statistics And Data Analysis
6th Edition
ISBN: 9781337793612
Author: PECK, Roxy.
Publisher: Cengage Learning,
bartleby

Videos

Question
Book Icon
Chapter 14.3, Problem 46E

a.

To determine

Test whether the given model is useful or not at the 0.05 level of significance.

a.

Expert Solution
Check Mark

Answer to Problem 46E

There is convincing evidence that the given model is useful at the 0.05 level of significance.

Explanation of Solution

Calculation:

It is given that the variable y is absorption, x1 is flour protein, and x2 is starch damage. The sample size is 28 and the number of variables (k) is 2. The given regression equation is y=19.4+1.44x1+0.336x2.

1.

The model is y=α+β1x1+β2x2+e.

2.

Null hypothesis:

H0:β1=β2=0

That is, there is no useful relationship between y and any of the predictors.

3.

Alternative hypothesis:

Ha: At least one among βi's is not zero.

That is, there is a useful relationship between y and any of the predictors.

4.

Here, the significance level is α=0.05.

5.

Test statistic:

F=R2k(1R2)n(k+1)

Here, n is the sample size, and k is the number of variables in the model.

6.

Assumptions:

Since there is no availability of original data to check the assumptions, there is a need to assume that the variables are related to the model, and the random deviation is distributed normally with mean 0 and the fixed standard deviation.

7.

Calculation:

From the MINITAB output, the value of R2 is 0.964.

The value of F-test statistic is calculated as follows:

F=R2k(1R2)n(k+1)=0.9642(10.964)(28(2+1))=0.4820.03625=0.482×250.036=12.050.036=334.722

8.

P-value:

Software procedure:

Step-by-step procedure to find the P-value using the MINITAB software:

  • Choose Graph > Probability Distribution Plot choose View Probability > OK.
  • From Distribution, choose ‘F’ distribution.
  • Enter the Numerator df as 2 and Denominator df as 25.
  • Click the Shaded Area tab.
  • Choose X Value and Right Tail for the region of the curve to shade.
  • Enter the X value as 334.722.
  • Click OK.

Output obtained using the MINITAB software is represented as follows:

Introduction To Statistics And Data Analysis, Chapter 14.3, Problem 46E , additional homework tip  1

From the MINITAB output, the P-value is 0.

9.

Conclusion:

If the P-valueα, then reject the null hypothesis.

Therefore, the P-value of 0 is less than the 0.05 level of significance.

Hence, reject the null hypothesis.

Thus, there is convincing evidence that the given model is useful at the 0.05 level of significance.

b.

To determine

Calculate a 95% confidence interval for β2 and interpret it.

b.

Expert Solution
Check Mark

Answer to Problem 46E

The 95% confidence interval for β2 is (0.298,0.373_).

Explanation of Solution

Calculation:

Here, β2 is the coefficient of variable x2.

Since there is no availability of original data to check the assumptions, there is a need to assume that the variables are related to the model, and the random deviation is distributed normally with mean 0 and the fixed standard deviation.

The formula for confidence interval for β2 is as follows:

b2±(t critical value)sb2.

Where, b2 is the estimated value of β2 and sb2 is the estimated standard deviation of b2.

Degrees of freedom:

df=n(k+1)=28(2+1)=283=25

Software procedure:

Step-by-step procedure to find the P-value using the MINITAB software:

  • Choose Graph > Probability Distribution Plot choose View Probability > OK.
  • From Distribution, choose ‘t’ distribution.
  • Enter the Degrees of freedom as 25.
  • Click the Shaded Area tab.
  • Choose Probability and Both for the region of the curve to shade.
  • Enter the Probability value as 0.05.
  • Click OK.

Output obtained using the MINITAB software is represented as follows:

Introduction To Statistics And Data Analysis, Chapter 14.3, Problem 46E , additional homework tip  2

From the MINITAB output, the critical value is 2.060.

From the given MINITAB output, the value of b2 is 0.33563 and the value of sb2 is 0.01814.

The confidence interval for mean change in exam score is calculated as follows:

b2±(t critical value)sb2=0.33563±(2.060)0.01814=0.33563±0.03737=(0.335630.03737,0.33563+0.03737)=(0.29826,0.373)(0.298,0.373)

Thus, the 95% confidence interval for β2 is (0.298,0.373_).

There is 95% confident that the average increase in y is associated with 1-unit increase in starch damage that is between 0.298 and 0.373, when the other predictors are fixed.

c.

To determine

  1. i. Test the hypothesis H0:β1=0 versus Ha:β10.
  2. ii. Test the hypothesis H0:β2=0 versus Ha:β20

c.

Expert Solution
Check Mark

Answer to Problem 46E

It can be concluded that the quadratic term should not be eliminated from the model and simple linear model should not be sufficient.

Explanation of Solution

Calculation:

i.

1.

The predictor β1 is the coefficient of x1. Here, x1 is the flour protein.

2.

Null hypothesis:

H0:β1=0.

3.

Alternative hypothesis:

Ha:β10.

4.

Here, the common significance levels are α=0.05,0.01,0.10.

5.

Test statistic:

t=b1sb1

Here, b1 is the estimated value of β1, and sb1 is the estimated standard deviation of b1.

6.

Assumptions:

The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.

7.

Calculation:

From the MINITAB output, the t test statistic value of x1 is 6.95.

8.

P-value:

From the MINITAB output, the P-value for x1 is 0.000.

9.

Conclusion:

If the P-valueα, then reject the null hypothesis.

Therefore, the P-value of 0 is less than any common levels of significance, such as 0.05, 0.01, and 0.10.

Hence, reject the null hypothesis.

Thus, there is convincing evidence that the flour protein variable is important and it is β10_.

ii.

1.

The predictor β2 is the coefficient of x2. Here, x2 is the starch damage.

2.

Null hypothesis:

H0:β2=0.

3.

Alternative hypothesis:

Ha:β20.

4.

Here, the common significance levels are α=0.05,0.01,0.10.

5.

Test statistic:

t=b2sb2

Here, b2 is the estimated value of β2, and sb2 is the estimated standard deviation of b2.

6.

Assumptions:

The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.

7.

Calculation:

From the MINITAB output, the t test statistic value of x2 is 18.51.

8.

P-value:

From the MINITAB output, the P-value for x1 is 0.000.

9.

Conclusion:

If the P-valueα, then reject the null hypothesis.

Therefore, the P-value of 0 is less than any common levels of significance, such as 0.05, 0.01, and 0.10.

Hence, reject the null hypothesis.

Thus, there is convincing evidence that the starch damage variable is important and it is β20_.

d.

To determine

Explain whether both the independent variables are important or not.

d.

Expert Solution
Check Mark

Explanation of Solution

From the results of Part (c), there is evidence that the variables of flour protein and starch damage are important.

e.

To determine

Calculate a 90% confidence interval and interpret it.

e.

Expert Solution
Check Mark

Answer to Problem 46E

The 90% confidence interval is (54.5084,56.2916_).

Explanation of Solution

Calculation:

It is given that the value of x1 is 11.7 and the value of x2 is 57. The standard deviation is 0.522.

Assume that the exam score is associated with the predictors according to the model. The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.

The formula for prediction interval for mean y value is as follows:

y^±(tcritical value)sy^.

The value of y^ is calculated as follows:

y=19.4+1.44x1+0.336x2=19.4+1.44(11.7)+0.336(57)=19.4+16.848+19.152=55.4

Degrees of freedom:

df=n(k+1)=28(2+1)=283=25

Software procedure:

Step-by-step procedure to find the P-value using the MINITAB software:

  • Choose Graph > Probability Distribution Plot choose View Probability > OK.
  • From Distribution, choose ‘t’ distribution.
  • Enter the Degrees of freedom as 25.
  • Click the Shaded Area tab.
  • Choose Probability and Both for the region of the curve to shade.
  • Enter the Probability value as 0.10.
  • Click OK.

Output obtained using the MINITAB software is represented as follows:

Introduction To Statistics And Data Analysis, Chapter 14.3, Problem 46E , additional homework tip  3

From the MINITAB output, the critical value is 1.708.

The prediction interval for mean y value is calculated as follows:

y^±(tcritical value)sy^=55.4±(1.708)0.522=55.4±0.8916=(55.40.8916,55.4+0.8916)=(54.5084,56.2916)

Thus, the 90% confidence interval is (54.5084,56.2916_).

There is 90% confident that the mean water absorption for wheat with 11.7 flour protein and 57 starch damage is between 54.5084 and 56.2916.

f.

To determine

Predict the water absorption for the shipment by 90% interval.

f.

Expert Solution
Check Mark

Answer to Problem 46E

The water absorption values for the particular shipment are between 53.3297 and 57.4703 by 90% prediction interval.

Explanation of Solution

Calculation:

It is given that the value of x1 is 11.7 and the value of x2 is 57. The standard deviation is 0.522.

Assume that the exam score is associated with the predictors according to the model. The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.

The formula for prediction interval for mean y value is as follows:

y^±(tcritical value)se2+sy^2.

From the given MINITAB output, the standard deviation is 1.094.

From Part e., the value of y^ is 55.4.

From the MINITAB output in Part e., the critical value is 1.708.

The prediction interval for water absorption for shipment is calculated as follows:

y^±(tcritical value)se2+sy^2=55.4±(1.708)1.0942+0.5222=55.4±(1.708)1.1968+0.2725=55.4±(1.708)1.2121=55.4±2.0703=(55.42.0703,55.4+2.0703)=(53.3297,57.4703)

Thus, the 90% prediction interval is (53.3297,57.4703_).

By prediction at 90% interval, the water absorption values for the particular shipment are between 53.3297 and 57.4703.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!
Students have asked these similar questions
A survey of high school students was done to examine whether students had ever driven a car after consuming a substantial amount of alcohol (1=yes, 0=no). Data was collected on their sex (male/female), race (White/non-White), and grade level (9,10,11,12). Researchers realized that the impact of race on consuming alcohol before driving might vary by grade level and decided to fit the following model. Compute the OR of drinking before driving for students who self-reported as Black versus non-Black in the 9th grade, adjusting for gender. Hello, thank you for helping me out with this. I'm just confused about the concept of adjusting for gender in this case. I am not sure if I should plug in 0 and 1 for the gender and report those out separately or if I should take both odds ratios and divide them?
Consider a single variable model to estimate the effect of in-person lecture attendance on university students' WAMs (weighted average marks): (C1) WAM = β0 + β1Attend + u Where: WAMis a student's weight average mark Attendis the proportion of lectures attended by a student in an academic year Using the information above, answer the following 3 questions. [i] Explain why Attend might be endogenous in Model (C1). What does this suggest about E [u ∣ Attend]?    [ii] Suppose that student attendance is related to a student's conscientiousness. Does this mean that conscientiousness would be a good instrumental variable for Attend? Why or why not? Explain your reasoning.    [iii] Unfortunately a major strike by railway works disrupts transport links to 3 of the 7 universities in a city for a period of 2 weeks. This temporarily means that students who are enrolled at those 3 affected universities cannot attend lectures in person. Carefully explain how you would use this information to…
An article in Journal of Food Science [“Prevention of Potato Spoilage During Storage by Chlorine Dioxide” (2001, Vol. 66(3), pp. 472–477)] reported on a study of potato spoilage based on different conditions of acidified oxine (AO), which is a mixture of chlorite and chlorine dioxide. The data follow: ∑X1i=230, ∑X21i=17200, n1=3. ∑X2i=150, ∑X22i=8100, n2=3. ∑X3i=60, ∑X23i=4500, n3=3. ∑X4i=65, ∑X24i=1625, n4=3. where group 1 has AO = 50ppm, group 2 has AO = 100ppm, group 3 has AO = 200ppm, group 4 has AO = 400ppm. Complete the ANOVA table below Source df sum of square mean square F p-value Between           Within       N/A N/A Total 11   N/A N/A N/A   At α=0.05, do we reject the null hypothesis that AO does not affect spoilage?

Chapter 14 Solutions

Introduction To Statistics And Data Analysis

Ch. 14.1 - Prob. 11ECh. 14.1 - A manufacturer of wood stoves collected data on y...Ch. 14.1 - Prob. 13ECh. 14.1 - Prob. 14ECh. 14.1 - Prob. 15ECh. 14.2 - Prob. 16ECh. 14.2 - State as much information as you can about the...Ch. 14.2 - Prob. 18ECh. 14.2 - Prob. 19ECh. 14.2 - Prob. 20ECh. 14.2 - The ability of ecologists to identify regions of...Ch. 14.2 - Prob. 22ECh. 14.2 - Prob. 23ECh. 14.2 - Prob. 24ECh. 14.2 - Prob. 25ECh. 14.2 - Prob. 26ECh. 14.2 - This exercise requires the use of a statistical...Ch. 14.2 - Prob. 28ECh. 14.2 - The article The Undrained Strength of Some Thawed...Ch. 14.2 - Prob. 30ECh. 14.2 - Prob. 31ECh. 14.2 - Prob. 32ECh. 14.2 - Prob. 33ECh. 14.2 - This exercise requires the use of a statistical...Ch. 14.2 - This exercise requires the use of a statistical...Ch. 14.3 - Prob. 36ECh. 14.3 - Prob. 37ECh. 14.3 - When Coastal power stations take in large amounts...Ch. 14.3 - Prob. 39ECh. 14.3 - The article first introduced in Exercise 14.28 of...Ch. 14.3 - Data from a random sample of 107 students taking a...Ch. 14.3 - Benevolence payments are monies collected by a...Ch. 14.3 - Prob. 43ECh. 14.3 - Prob. 44ECh. 14.3 - Prob. 45ECh. 14.3 - Prob. 46ECh. 14.3 - Exercise 14.26 gave data on fish weight, length,...Ch. 14.3 - Prob. 48ECh. 14.3 - Prob. 49ECh. 14.3 - Prob. 50ECh. 14.4 - Prob. 51ECh. 14.4 - Prob. 52ECh. 14.4 - The article The Analysis and Selection of...Ch. 14.4 - Prob. 54ECh. 14.4 - Prob. 55ECh. 14.4 - Prob. 57ECh. 14.4 - Prob. 58ECh. 14.4 - Prob. 59ECh. 14.4 - Prob. 60ECh. 14.4 - This exercise requires use of a statistical...Ch. 14.4 - Prob. 62ECh. 14 - Prob. 63CRCh. 14 - Prob. 64CRCh. 14 - The accompanying data on y = Glucose concentration...Ch. 14 - Much interest in management circles has focused on...Ch. 14 - Prob. 67CRCh. 14 - Prob. 68CRCh. 14 - Prob. 69CRCh. 14 - A study of pregnant grey seals resulted in n = 25...Ch. 14 - Prob. 71CRCh. 14 - Prob. 72CRCh. 14 - This exercise requires the use of a statistical...
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Similar questions
SEE MORE QUESTIONS
Recommended textbooks for you
  • Text book image
    Calculus For The Life Sciences
    Calculus
    ISBN:9780321964038
    Author:GREENWELL, Raymond N., RITCHEY, Nathan P., Lial, Margaret L.
    Publisher:Pearson Addison Wesley,
Text book image
Calculus For The Life Sciences
Calculus
ISBN:9780321964038
Author:GREENWELL, Raymond N., RITCHEY, Nathan P., Lial, Margaret L.
Publisher:Pearson Addison Wesley,
Introduction to experimental design and analysis of variance (ANOVA); Author: Dr. Bharatendra Rai;https://www.youtube.com/watch?v=vSFo1MwLoxU;License: Standard YouTube License, CC-BY