SAM--Chap 6 逻辑斯蒂回归与最大熵模型自我梳理

时间：2020-07-03 13:48:48 阅读：118 评论：0 收藏：0 [点我收藏+]

July 3

梦入少年丛歌舞匆匆老僧夜半误鸣钟
惊起西窗眠不得卷地西风

1. Logistic regression

Some basic logic

source: https://www.vebuso.com/2020/02/linear-to-logistic-regression-explained-step-by-step/

对我这种线代忘的一干二净的人还是要从头捋一捋

The basic idea behind logits is to use a logarithmic function to restrict the probability values between 0 and 1.

技术分享图片

关于logit和probit，最大的区别就是一个是用log distribution，一个是normal distribution

Logistic assumptions

The model is correctly specified.
The cases are independent.
The independent variables are not linear combinations of each other.

对于Logits：

If the probability of Success is P, then the odds of that event is: P / (1-P)

技术分享图片

It’s time…. to transform the model from linear regression to logistic regression using the logistic function.

技术分享图片

We can see from the below figure that the output of the linear regression is passed through a sigmoid function (logit function) that can map any real value between 0 and 1.

技术分享图片

给一个logistic分布的定义：

技术分享图片

线性函数的值越接近于正无穷，概率接近于1. 越接近负无穷，概率越接近0

多项逻辑：

Multinomial logistic regression is an expansion of logistic regression in which we set up one equation for each logit relative to the reference outcome

技术分享图片

Multinomial logistic regression is used when you have a categorical dependent variable with two or more unordered levels (i.e. two or more discrete outcomes).

This type of regression is usually performed with software. Essentially, the software will run a series of individual binomial logistic regressions for M – 1 categories (one calculation for each category, minus the reference category).

2. 最大熵模型 maximum entropy model ((MaxEnt)

概率学习模型的一个准则即熵最大的模型就是最好的模型

几何解释：

技术分享图片

先回忆一下熵的概念：

技术分享图片

The maximum entropy principle (MaxEnt) states that the most appropriate distribution to model a given set of data is the one with highest entropy among all those that satisfy the constrains of our prior knowledge.
也就是有着一堆期望

技术分享图片