views, comments.

Author:LYY


Activation Functions

Sigmoid

Transfer logit to a probability by mapping to .

Definition: Derivative:

It means .

SoftMax

Transfer logits to probabilities by normalizing sequence to , where the summation is equal to 1.

Definition:

Derivative:

It means .

It means .

Odds

Introduction:

  • Win rate: 4 to 1, odds = {number of wins}/{number of loses} = 4

Definition: Assume have probability to win,

We usually use , because it has better symmetry property.

  • Win rate: 4 to 1, odds = 4, but .
  • Win rate: 1 to 4, odds = 1/4, but .

Relationship between probability and : sigmoid

Example for machine learning: 1.Logistic Regression: 对数线性模型

Easy to find

2.XgBoost for Classification

Derivative

Second Order Derivative