date: 2024-12-24
title: "ML-Density Estimation"
status: UNFINISHED
author:
- AllenYGY
tags:
- NOTE
publish: True
ML-Density Estimation
For a random vector x, assuming that it obeys an unknown distribution p(x), the probability of falling into a small area R in the space is
Given
Approximation when
When n is very large, we can approximately think that
Assuming
Final approximation for
To accurately estimate
Fixed area size, counting the number falling into different areas, which includes histogram method and kernel method.
the area size so that the number of samples falling into each area is zero is called K-nearest neighbor method.
For low dimensional data we can use a histogram as a density model.
Kernel Density Estimation (KDE) is a non-parametric method to estimate the probability density function (PDF) of a random variable.
We have 5 data points:
For each
For
For
For
For
For
The total density at
In this example, the estimated density at