solving the problem of overfitting:regularization
- 发生的在linear regression上面的overfitting问题
data:image/s3,"s3://crabby-images/0b779/0b7791fb2e04eed424a4dda6e800981e01bd3971" alt="技术分享"
- 发生在logistic regression上面的overfitting
data:image/s3,"s3://crabby-images/5e22f/5e22f1b8e6850dab9129bf2a201e238ba2e5596d" alt="技术分享"
data:image/s3,"s3://crabby-images/221eb/221ebe30b912f575bd973adf410c22ebf87baead" alt="技术分享"
- regularization: cost function of linear regression
data:image/s3,"s3://crabby-images/d9237/d9237a7eed5d7e23920b0f891cd48478ba480e75" alt="技术分享"
-
- parameters小的话,这样hypothesis就会变得简单,这样就不会overfitting
- 一般不会对θ0进行regularization
- 上式是进行regularization的linear regression的cost function,要使上式的值取最小值
data:image/s3,"s3://crabby-images/34693/34693046c2ef2b1eaaeffc90dc833b1eb23250cb" alt="技术分享"
-
- 由两个式子(两个目标)组成,第一个式子是为了对trainning data更好的拟合(fitting the training data),第二个式子是为了避免overfitting
- 第二个式子叫regularization term, λ叫regularization parameter, λ是为了平衡两个目标用的
- 如果 λ非常大的话(这时θ1-θn几乎为0,hypothesis变得很简单,只有常数),就会出现underfitting,对trainning data很低的fitting
machine learning(12) -- solving the problem of overfitting:regularization
原文:http://www.cnblogs.com/yan2015/p/4844930.html