Statictical Learning

Probability Distribution

  • Probability that a random variable (r.v.) takes every possible value.
  • Satisfying:

Discrete Random Variable

Probability Mass Function (PMF)

  • Probability Mass Function (PMF) is the probability that the value of r.v. is :

Bernoulli Distribution

  • In a test, event happens with probability , does not happen with probability .

  • If using r.v. to indicate the number of occurrences of event , then can be 0 or 1. Its distribution is:

Binomial Distribution

  • In the times Bernoulli distribution, if r.v. represents the number of occurrences of event , the value of is , and its corresponding distribution:
  • The binomial coefficient represents the total number of combinations of elements taken out of elements regardless of their order.

Continuous Random Variable

Probability Density Function (PDF)

Probability distribution can be described by the Probability Density Function (PDF) :

Cumulative Distribution Function (CDF)

The Cumulative Distribution Function (CDF) is the probability that the value of r.v. is less than or equal to :

  • For continuous r.v., we have:

Gaussian Distribution

Marginal Distribution

Marginal Probability Mass Function

  • 的边际概率质量函数:

  • 的边际概率质量函数:

Marginal Probability Density Function

  • 的边际概率密度函数:

  • 的边际概率密度函数:

Conditional Probability

For a discrete random vector , when is known, the conditional probability of r.v. is:

Sampling

Sampling: given a probability distribution p(x), generate samples that meet the conditions

Expectation

Expectation: the average of random variable

For discrete r.v. :

For continuous r.v. :

Law of Large Numbers

When the number of samples is large, the sample mean and the real mean (expectation) are fully close.

Given N independently and identically distributed (I.I.D.) samples

Its mean value converges to the expected value: