首页 > 其他 > 详细

ML-Essential Techniques for Predictive Analysis 读书笔记

时间:2017-02-25 19:47:40      阅读:189      评论:0      收藏:0      [点我收藏+]

                                            CHAPTER   ONE

Broadly speaking, this book covers two classes of algorithms for solving   function approximation problems: penalized linear regression methods and ensemble methods. This chapter introduces you to both of these algorithms, outlines some of their characteristics, and reviews the results of comparative studies of algorithm performance in order to demonstrate their consistent high performance.

 

Why Are These Two Algorithms So Useful?  

 

Several factors make the penalized linear regression and ensemble methods a  useful collection. Stated simply, they will provide optimum or near-optimum  performance on the vast majority of predictive analytics (function approximation) problems encountered in practice, including big data sets, little data sets, wide data sets, tall skinny data sets, complicated problems, and simple problems.

 

One of their most important features is that they indicate which of their input  variables is most important for producing predictions. This turns out to be an invaluable feature in a machine learning algorithm. One of the most time‐consuming steps in the development of a predictive model is what is sometimes  called feature selectionor feature engineering.

What Are Penalized Regression Methods?

it is a derivative of ordinary least squares(OLS) regression—a method developed by Gauss and Legendre roughly 200 years ago. Penalized linear regression methods were designed to overcome some basic limitations of OLS regression. The basic problem with OLS is that sometimes it  overfits the problem.

Penalized linear regression provides a way to systematically reduce degrees of freedom to match the amount of data available and the complexity of the underlying phenomena.These methods have become very popular for problems  with very many degrees of freedom. They are a favorite for genetic problems  where the number of degrees of freedom (that is, the number of genes) can be several tens of thousands and for problems like text classification where the number of degrees of freedom can be more than a million

 

 

 

ML-Essential Techniques for Predictive Analysis 读书笔记

原文:http://www.cnblogs.com/hyqxln/p/6441578.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!