当前位置：和泉文库 > 统计 > 《数理统计》课程教学资源（参考资料）Large sample properties of MLE 05

《数理统计》课程教学资源（参考资料）Large sample properties of MLE 05

文件格式：PDF，文件大小：92.42KB，售价：5.32元

文档详细内容（约18页）

84 Section 8.5.Breakdown of assumptions Non-Existence of the MLE Multiple Solutions to Maximization Problem Multiple Solutions to Score Equations Number of Parameters Increase with the Sample Size Support of p(z;0)depends on 0 ●Non-I.I.D.Data

84 Section 8.5. Breakdown of assumptions • Non-Existence of the MLE • Multiple Solutions to Maximization Problem • Multiple Solutions to Score Equations • Number of Parameters Increase with the Sample Size • Support of p(x; θ) depends on θ • Non-I.I.D. Data

85 Non-Existence of the MLE The non-existence of the MLE may occur for all values of m or for only some of them.In general,this is due either to the fact that the parameter space is not compact or that the log-likelihood is discontinuous in 0. Example 8.1:Suppose that X~Bernoulli(1/(1+exp(0)),where e=R.If we observe z =1,then L(;1)=1/(1+exp(0)).The likelihood function is a decreasing function of 0 and the maximum is not attained on If were closed,i.e.,=R,the MLE would be -oo. Example 8.2:Suppose that X~Normal(u,o2).So,0=(u,o2) and日=R×R+.Now,l(0;x)ox-logo-a(z-)2.Take u=x.Then as o→0，l(0;x)→+o.So,the MLE does not exist

85 Non-Existence of the MLE The non-existence of the MLE may occur for all values of xn or for only some of them. In general, this is due either to the fact that the parameter space is not compact or that the log-likelihood is discontinuous in θ. Example 8.1: Suppose that X ∼ Bernoulli(1/(1 + exp(θ)), where Θ = R. If we observe x = 1, then L(θ; 1) = 1/(1 + exp(θ)). The likelihood function is a decreasing function of θ and the maximum is not attained on Θ. If Θ were closed, i.e., Θ = R ¯ , the MLE would be −∞. Example 8.2: Suppose that X ∼ Normal(µ, σ2). So, θ = (µ, σ2) and Θ = R × R+. Now, l(θ; x) ∝ − log σ − 12σ2 (x − µ)2. Take µ = x. Then as σ → 0, l(θ; x) → +∞. So, the MLE does not exist

86 Multiple Solutions One reason for multiple solutions to the maximization problem is non-identification of the parameter 0. Example 8.3:Suppose that Y~Normal(X0,I),where X is an n×k matrix with rank smaller than k and 0∈曰cRk.The density function is pv:0)-(2z)-a/2exp(-j(u-x0Y(v-X0) Since X is not full rank,there exists an infinite number of solutions to xo=0.That means that there exists an infinite number of 0's that generate the same density function.So,0 is not identified. Furthermore,note that the likelihood is maximized at all values of 0 satisfying X'X=X'y

86 Multiple Solutions One reason for multiple solutions to the maximization problem is non-identification of the parameter θ. Example 8.3: Suppose that Y ∼ Normal(Xθ, I), where X is an n × k matrix with rank smaller than k and θ ∈ Θ ⊂ Rk. The density function is p(y; θ) = (2π)−n/2 exp(−12(y − Xθ)(y − Xθ)) Since X is not full rank, there exists an infinite number of solutions to Xθ = 0. That means that there exists an infinite number of θ’s that generate the same density function. So, θ is not identified. Furthermore, note that the likelihood is maximized at all values of θ satisfying XXθ = Xy

87 Multiple Roots to the Score Equations Even though the score equations may have multiple roots for fixed n,we can still use our theorems to show consistency and asymptotic normality.This will work provided that as n gets large there is a unique maximum with large probability. Example 8.4:Suppose that Xn=(X1,...,Xn),where the Xi's are i.i.d.Cauchy(0,1).We assume that 0o lies in the interior of a compact setΘcR.So, 1 p(x;0)= π(1+(x-0)2) So,the log-likelihood for the full sample is l(0:x)=-nlogπ-∑log(1+(-0)2) i=1 Note that as0→±o,l(0;c)→-o

87 Multiple Roots to the Score Equations Even though the score equations may have multiple roots for fixed n, we can still use our theorems to show consistency and asymptotic normality. This will work provided that as n gets large there is a unique maximum with large probability. Example 8.4: Suppose that Xn = (X1,...,Xn), where the Xi’s are i.i.d. Cauchy(θ, 1). We assume that θ0 lies in the interior of a compact set Θ ⊂ R. So, p(x; θ) = 1 π(1 + (x − θ)2) So, the log-likelihood for the full sample is l(θ; x) = −n log π − n i=1 log(1 + (xi − θ)2) Note that as θ → ±∞, l(θ; x) → −∞

88 The score for 0 is given by 立部 2(xc-0) de =1 As the picture below demonstrates,there can be multiple roots to the score equations

88 The score for θ is given by dl(θ; x) dθ = n i=1 2(xi − θ) 1+(xi − θ)2 As the picture below demonstrates, there can be multiple roots to the score equations

点击进入文档下载页（PDF格式）

共18页，可试读7页，点击继续阅读 ↓↓

您可能感兴趣的文档

《数理统计》课程教学资源（参考资料）Large sample properties of MLE 04
《数理统计》课程教学资源（参考资料）Large sample properties of MLE 03
《数理统计》课程教学资源（参考资料）Large sample properties of MLE 02
《数理统计》课程教学资源（参考资料）Large sample properties of MLE 01
《数理统计》课程教学资源（参考资料）An Inconsistent maximum likelihood estimate
《数理统计》课程教学资源（参考资料）Maximum Likelihood - An Introduction
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第五讲点估计方法（二）极大似然估计方法
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第四讲点估计方法（一）矩估计方法
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第三讲指数族与充分完备统计量
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第二讲统计量的分布（抽样分布）
《数理统计》课程教学资源（参考资料）Hoeffding's Indequality 证明中的一个不等式的证明 complementary
《数理统计》课程教学资源（参考资料）Glivenko-Cantelli 定理的证明
《数理统计》课程教学资源（参考资料）THE MM, ME, ML, EL, EF AND GMM APPROACHES TO ESTIMATION - A SYNTHESIS
《数理统计》课程教学资源（参考资料）Another scratch proof of consistency and asymptotical normal of MLE
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第六讲点估计方法（三）一致最小方差无偏估计
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第七讲区间估计（一）置信区间
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第八讲区间估计（二）容忍区间
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第九讲参数假设检验（一）
《数理统计》课程教学资源（参考资料）How do we do hypothesis testing
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第十讲参数假设检验（二）
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第十一讲参数假设检验（三）
《数理统计》课程教学资源（参考资料）Likelihood Ratio, Wald, and（Rao）Score Tests
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第十二讲非参数检验（一）
中国科学技术大学：《数理统计》课程教学资源（课件讲义）第十二讲非参数检验（二）

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录