或
~(Y???t1Xt)?0 ~Y???t1?Xt ~求解得 ?1?Y/X 由第2个下规方程
?Xt?X)?0得 (Yt??1t
?XYtt2???1?Xt
??(求解得 ?1~?XY)/(?Xtt2t)
(2)对于?1?Y/X,求期望
11~E(?1)?E(YX)?E[(?1Xt??t)]Xn?X1 ?[E{1t)?E(?t)]
XnX??1??1X这里用到了Xt的非随机性。
??( 对于?1?XY)/(?Xtt2t),求期望
?)?E(XY/X2)E(??tt?t1?(?(11)E(XY)?()E[Xt(?1Xt??t)]tt2?2??Xt?Xt112)?(X)?()XtE(?t)??1?1t22??Xt?Xt
?X??X必须等于Y。但??X通过点(X,Y),????(3)要想拟合值Y111?XY?Xt2ttX,
?X上。 ???通常不等于Y。这就意味着点(X,Y)不太可能位于直线Y1???X经过点(X,Y)。 相反地,由于?1X?Y,所以直线Y1(4)OLS方法要求残差平方和最小
Min RSS?2?X)2 e?(Y???t?t1t~~?求偏导得 关于?1
10
?RSS?X)(?X)?0 ?2?(Yt??1tt???1即
?Xt?X)?0 (Yt??1t???XY???ii1?是OLS估计量。 可见?1 9、解:
??X?
2i??50?0.6?1000?650(元) (1) 当Yf?1000时,消费支出C的点预测值: Ci(2)平均值的预测区间:
??650,t(10)?2.23,??已知: C0.025i2?e?2in?2?300?30,
12?222(Y?Y)(Y?Y)11ff??t???t?[(C?),(C?)] f?2?f?2?22nn?yi?yi1(1000?800)21(1000?800)2?[(650?2.23?30??),(650?2.23?30??)]
128000128000=(650-27.5380,650+27.5380) =(622.46,677.54)
当Yf?1000时,在95%的置信概率下消费支出C平均值的预测区间为(622.46,677.54)元。 (3)个别值的预测区间:
22(Y?Y)(Y?Y)11ff??t???t?[(C?),(C?)] f?2?1?f?2?1?22nn?yi?yi1(1000?800)21(1000?800)2?[(650?2.23?30?1??),(650?2.23?30?1??)]128000128000=(650-30.1247,650+30.1247) =(619.88,680.12)元
当Yf?1000时,在95%的置信概率下消费支出C个别值的预测区间为(619.88,680.12)元。 10、解:
11
X?(1) ?X?ni?168,YY??ni?111
??(Xi?X)(Yi?Y)??(XiYi?YXi?YiX?XY)?204200?1680?111?168?1110?10?168?111 ?17720又??(Xi?X)2??(Xi2?2XiX?X2)??Xi2?2?10X2?10X2?315400?10?168?168?33160
??2??(X?X)(Y?Y)?17720?0.5344
33160?(X?X)ii2i?1?Y??2X?111?0.5344?168?21.22
?2(2) ?e??2in?2(Y??i?i)2?Y10?2(Y??2i?i?Y?i2)?2YiY8
?i?21.22?0.5344Xi ?Y?i?Y?i2)??(Yi2?2?21.22Yi?2?0.5344XiYi??12??22Xi2?2?1?2Xi)??(Yi2?2YiY?133300?2?21.22?1110?2?0.5344?204200?10?21.22?21.22?0.5344?0.5344?315400?2?21.22?0.5344?1680?620.81620.81?2????77.60
8?Var(?1)?n?(Xi?X)222X?i??77.60?315400?73.81,se(?1)?73.81?8.5913
10?33160Var(?2)??2?x2i?77.60?0.0023,se(?2)?0.0023?0.0484
331602i(3) r?1?2?(Yi?Y)22?e,
??ei2?620.81,620.81?0.9385
10090又??(Yi?Y)?133300?123210?10090?r2?1?
(4) ?p(t?2.306)?95%,自由度为8
12
21.22??1?2.306,解得: 1.4085??1?41.0315为?1的95%的置信区间。
8.59130.5344??2?2.306,解得:0.4227??2?0.646为?2的95%的置同理,??2.306?0.0484??2.306?信区间。
由于?2?0不在?2的置信区间内,故拒绝零假设:?2?0。
六、上机练习题 1、解:
(1)使用Eviews软件,ASP对GPA分数的回归结果如表所示。 Dependent Variable: ASP Variable Coefficient 105117.58 -273722.5 0.3624466 0.3396769 14779.439 6.116E+09 -329.563 1.0062756 t-Statistic 3.9897234 -3.19179 Std. Error Prob. GPA C R-squared Adjusted R-squared S.E. of regression Sum squared resid Log likelihood Durbin-Watson stat 26347.086 85758.314 0.0004319 0.0034766 68260 18187.78 22.104202 22.197615 15.917893 0.0004319 Mean dependent var S.D. dependent var Akaike info criterion Schwarz criterion F-statistic Prob(F-statistic) 从回归结果看,GPA分数的系数是统计显著的,对ASP有正的影响。 (2)使用Eviews软件,ASP对GMAT分数的回归结果如表所示。 Dependent Variable: ASP Variable GMAT C R-squared Adjusted R-squared S.E. of regression Sum squared resid Coefficient 641.6598 -332306.8 0.717175 0.707074 9843.701 2.71E+09 Std. Error 76.15036 47572.09 t-Statistic 8.426222 -6.985332 Prob. 0 0 68260 18187.78 21.29139 21.3848 Mean dependent var S.D. dependent var Akaike info criterion Schwarz criterion 13
Log likelihood Durbin-Watson stat -317.3709 1.128809 F-statistic Prob(F-statistic) 71.00122 0 从回归结果看,GMAT分数与ASP显著正相关。
(3)使用Eviews软件,ASP对学费X的回归结果如表所示。 Dependent Variable: ASP Variable X C R-squared Adjusted R-squared S.E. of regression Sum squared resid Log likelihood Durbin-Watson stat Coefficient 2.633483 23126.32 0.448748 0.429061 13742.78 5.29E+09 -327.3813 1.142178 Std. Error 0.551601 9780.863 t-Statistic 4.774252 2.364446 Prob. 0.0001 0.0252 68260 18187.78 21.95876 22.05217 22.79348 0.000051 Mean dependent var S.D. dependent var Akaike info criterion Schwarz criterion F-statistic Prob(F-statistic) 从计算结果看,每年的学费与ASP显著正相关。学费高,ASP就高;但学费仅解释了ASP变化的一部分(不到50%),明显还有其他因素影响着ASP。 (4)使用Eviews软件回归结果如表所示。
Dependent Variable: GPA Variable Coefficient 6.17E-06 Std. Error 4.09E-06 0.072559 Mean var S.D. var Akaike criterion Schwarz 14
t-Statistic 1.507952 Prob. X 0.1428 C 3.147579 43.37936 dependent 0 3.253333 R-squared Adjusted R-squared S.E. of regression Sum squared resid
0.075112 0.04208 dependent 0.104166 0.101951 0.291032 info -1.664311 -1.5708