Wednesday, December 12, 2018
'Final Exam Ec315\r'
' disunite I. HYPOTHESIS TESTING PROBLEM 1 A certain brand of fluorescent light piping was advertised as having an effective life bilk before burning out of 4000 hours. A haphazard pattern of 84 bulbs was burned out with a mean illumination life span of 1870 hours and with a sample standard deviation of 90 hours. piddle a 95 pledge interval put on this sample and be sure to render this interval. Answer Since population standard deviation is unknown, t distribution mint be used construct the say-so interval. ? The 95% confidence interval is presumption by ? X ? t? / 2,n ? 1 ? S S? , X ? ? /2,n ? 1 ? n n? Details assertion Interval Estimate for the ungenerous Data audition stock release try out Mean adjudicate sizing Confidence aim 90 1870 84 95% In shapeediate Calculations meter erroneous belief of the Mean 9. 819805061 Degrees of emancipation 83 t quantify 1. 988959743 Interval Half Width 19. 53119695 Confidence Interval Interval Lower Limit 1850. 47 Interval velocity Limit 1889. 53 2 PROBLEM 2 Given the fol put downs info from two independent selective information sets, conduct a one -tail theory leaven to reconcile if the nitty-gritty ar statistic all in ally equal using of import=0. 05. Do NOT do a confidence interval. 1 = 35 n2 = 30 xbar1= 32 xbar2 = 25 s1=7 s2 = 6 Answer H0:õ1=õ2 H1: õ1>õ2 campaign statistics used is t ? X1 ? X 2 S 2 (n1 ? 1) S12 ? (n2 ? 1) S2 n1n2 ~ tn1 ? n1 ? 2 where S ? n1 ? n2 ? 2 n1 ? n2 Decision sway : lour the trifling hypothesis, if the calculated think of of experiment statistic is great than the fine value. Details t ravel for variations in devil federal agency Data Hypothe sized Difference take aim of signifi dropce Population 1 essay assay size of it warning Mean precedent mensuration deviation Population 2 Sample Sample Size Sample Mean Sample ideal Deviation 0 0. 05 35 32 7 30 25 6Intermediate Calculations Population 1 Sample Degrees of Freedom 34 Populatio n 2 Sample Degrees of Freedom 29 occur Degrees of Freedom 63 Pooled Variance 43. 01587 Difference in Sample Means 7 t Test Statistic 4. 289648 Upper-Tail Test Upper searing Value p-Value Reject the nugatory hypothesis 1. 669402 3. 14E-05 expiry: Reject the null hypothesis. The sample provides fair to middling evince to support the claim that means ar different. 3 PROBLEM 3. A running game was conducted to determine whether sexual activity of a display mannequin af fected the likeliness that consumers would prefer a cutting product.A survey of consumers at a trade show which used a female spokesperson fixed that 120 of ccc customers favorite(a) the product while 92 of 280 customers best-loved the product when it was shown by a female spokesperson. Do the samples provide sufficient evidence to indicate that the grammatical gender of the salesperson affect the likelihood of the product macrocosm favorably regarded by consumers? Evaluate with a two-tail, important =. 01 test. Do NOT do a confidence interval. Answer H0: There no epoch-making gender wise dissimilarity in the symmetry customers who preferred the product.H1: There substantive gender wise dissimilarity in the proportion customers who preferred the product. P ? P2 n p ? n p 1 The test Statistic used is Z test Z ? where p= 1 1 2 2 n1 ? n2 ?1 1? P(1 ? P) ? ? ? ? n1 n2 ? Decision rule : Reject the null hypothesis, if the calculated value of test statistic is greater than the vituperative value. Details Z Test for Differences in devil equilibriums Data Hypothesized Difference train of Significance multitude 1 Number of Successes Sample Size stem 2 Number of Successes Sample Size 0 0. 01 virile 120 300 Female 92 80 Intermediate Calculations Group 1 Proportion 0. 4 Group 2 Proportion 0. 328571429 Difference in Two Proportions 0. 071428571 Average Proportion 0. 365517241 Z Test Statistic 1. 784981685 Two-Tail Test Lower Critical Value -2. 575829304 Upper Critical Value 2. 575829304 p-Value 0. 074264288 Do non bend the null hypothesis Conclusion: Fails to reject the null hypothesis. The sample does not provide generous evidence to support the claim that there substantive gender wise end in the proportion customers who preferred the product. 4PROBLEM 4 Assuming that the population variances argon equal for antheral and Female GPAââ¬â¢s, test the following sample data to see if Male and Female PhD candidate GPAââ¬â¢s (Means) argon equal. Conduct a two-tail hypothesis test at ? =. 01 to determine whether the sample means are different. Do NOT do a confidence interval. Male GPAââ¬â¢s Female GPAââ¬â¢s Sample Size 12 13 Sample Mean 2. 8 4. 95 Sample hackneyed Dev .25 .8 Answer H0: There is no crucial difference in the mean GPA of males and Females H1: There is evidential difference in the mean GPA of males and Females. Test Statistic used is independent sample t test. ? X1 ? X 2 S 2 (n1 ? 1) S12 ? (n2 ? 1) S2 n1n2 ~ tn1 ? n1 ? 2 where S ? n1 ? n 2 ? 2 n1 ? n2 Decision rule: Reject the null hypotheses, if the calculated value of test statistic is greater than the critical value. Details t Test for Differences in Two Means Data Hypothesized Difference Level of Significance Population 1 Sample Sample Size Sample Mean Sample Standard Deviation Population 2 Sample Sample Size Sample Mean Sample Standard Deviation Intermediate Calculations Population 1 Sample Degrees of Freedom Population 2 Sample Degrees of Freedom Total Degrees of Freedom Pooled Variance 0. 05 12 2. 8 0. 25 13 4. 95 0. 8 11 12 23 0. 363804 5 Difference in Sample Means t Test Statistic -2. 15 -8. 90424 Two-Tail Test Lower Critical Value Upper Critical Value p-Value Reject the null hypothesis -2. 80734 2. 807336 0. 0000 Conclusion: Reject the null hypotheses. The sample provides enough evidence to support the claim that there is significant difference in the mean GP A fall guy among the males and females. 6 pick II REGRESSION psychoanalysis Problem 5 You wish to run the arrested development exemplification (less(prenominal) Intercept and coefficients) shown below: VOTE = urban + INCOME + EDUCATEGiven the jump spreadsheet below for annual data from1970 to 2006 (with the data for row 5 thru row 35 not shown), complete all necessary entries in the Excel Regression Window shown below the data. 1 2 3 4 A YEAR 1970 1971 1972 B VOTE C URBAN D INCOME E EDUCATE 49. 0 58. 3 45. 2 62. 0 65. 2 75. 0 7488 7635 7879 4. 3 8. 3 4. 5 36 37 38 2004 2005 2006 50. 1 92. 1 94. 0 95. 6 15321 15643 16001 4. 9 4. 7 5. 1 67. 7 54. 2 Regression Input OK Input Y tramp: A1:A38 Input X Range: B1:E38 Cancel Help ? Labels Confidence Level: x X X Output options X Constant is Zero 95 % Output Range: New Worksheet Ply:New W orkbook symmetricalnesss Residuals Residual Plots Standardized Residuals Line Fit Plots Normal Probabilit y Normal Probability Plots 7 PROBLEM 6. implement the following reversal output to determine the following: A real estate investor has dev ised a model to estimate legal residence termss in a new suburban development. Data for a random sample of 100 homes were gathered on the selling price of the home ($ thousands), the home size (square feet), the lot size (thousands of square feet), and the itemize of bed board. The following threefold atavism output was generated: Regression Statistics Multiple R 0. 8647 R Square . 7222 Adjusted R Square 0. 6888 Standard Error 16. 0389 Observations 100 Intercept X1 (Square Feet) X2 (Lot Size) X3 (Bedrooms) Coefficients -24. 888 0. 2323 11. 2589 15. 2356 Standard Error 38. 3735 0. 0184 1. 7120 6. 8905 t Stat -0. 7021 9. 3122 4. 3256 3. 2158 P-value 0. 2154 0. 0000 0. 0001 0. 1589 a. Why is the coefficient for BEDROOMS a positive number? The selling price increase when the number of rooms increases. Thus the relationship is positive. b. Which is the most(prenominal) statistically significant versatile? What evidence shows this? Most statistically significant variable is one wit h least(prenominal) p value.Here most statistically significant variable is Square feet. c. Which is the least statistically significant variable? What evidence shows this? to the lowest degree statistically significant variable is one with high-pitched p value. Here least statistically significant variable is bedrooms d. For a 0. 05 level of significance, should any variable be dropped from this model? Why or wherefore not? The variable bed rooms can be dropped from the model as the p value is greater than 0. 05. e. Interpret the value of R shape? How does this value from the adjusted R squared? The R2 gives the model adequacy. Here R2 suggest that 72. 22% variability can e explained by the model. Adjusted R2 is a adaption of R2 that adjusts for the number of explanatory terms in a model. Unlike R2, the adjusted R2 increases only if the new term improves the model more than would be expected by chance. f. Predict the sales price of a 1134-square-foot home with a lot size of 1 5,400 square feet and 2 bedrooms. Selling Price =-24. 888+0. 02323*1134+11. 2589*15400+15. 2356*2=173419 8 PART III SPECIFIC KNOWLEDGE SHORT-ANSWER QUESTIONS. Problem 7 fructify Auto coefficient of correlational statistics in the following terms: a. In what token of regression is it likely to occur? Regressions involving conviction series data . What is bad about autocorrelation in a regression? The standard error of the estimates lead high. c. What method is used to determine if it exists? (Think of statistical test to be used) Durbin Watson Statistic is used determine auto correlation in a regression. d. If found in a regression how is it eliminated? Appropriate transformations can be adoptive to eliminate auto correlation. Problem 8 Define Multicollinearity in the following terms: a) In what type of regression is it likely to occur? Multicollinearity occurs in multiple regressions when two or more independent variables are highly cor think. ) Why is multicollinearity in a re gression a difficulty to be resolved? Multicollinearity in Regression Models is an unacceptably high level of intercorrelation among the independents, much(prenominal) that the effects of the independents cannot be separated. Under multicollinearity, estimates are frank but valuatements of the relative strength of the explanatory variables and their articulation effect are unreliable. c) How can multicollinearity be determined in a regression? Multicollinearity refers to excessive correlation of the predictor variables. When correlation is excessive (some use the rule of thumb of r > 0. 90), tandard errors of the b and beta coefficients pop off large, making it difficult or impossible to assess the relative importance of the predictor variables. The measures Tolerance and VIF are commonly used to measure multicollinearity. Tolerance is 1 â⬠R2 for the regression of that independent variable on all the other independents, ignoring the dependent. There will be as many toler ance coefficients as there are independents. The higher the inter-correlation of the independents, the more the tolerance will surface zero. As a rule of thumb, if tolerance is less than . 20, a problem with multicollinearity is indicated.When tolerance is close to 0 there is high multicollinearity of that variable with other independents and the b and beta coefficients will be unstable. The more the multicollinearity, the lower the tolerance, the more the standard error of the regression coefficients. d) If multicollinearity is found in a regression, how is it eliminated? Multicollinearity occurs because two (or more) variables are related â⬠they measure essentially the same thing. If one of the variables doesnââ¬â¢t seem logically essential to your model, removing it may slash or eliminate multicollinearity.\r\n'
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment