- Quantity of interest or parameter of a model
Frequentist:
is described as unknown and deterministic
Example: Throwing a coin for 100 times, there are 60 times that coin head appears, then. When sample data approaches infinity, this method from frequentist is able to give an accurate estimation of
. However, if sample data is very scarce, then severe bias could occur. To conclude, more data, more accurate estimation of
with frequentist method.
Bayesian:
is described as random. There are two inputs: prior
, likelihood (似然)
. There is one output: Posterior
Bayesian estimation is based on Bayesian rule:
Therefore, we have. Because,
as observation (or sample data) is given as a condition in
, so
.
Example: Considering this example of flipping a coin again.is a distribution, instead of a deterministic value of 0.6 in this example. With the increase of sample data,
trusts measurement more than prior.
Note: if prioris uniform distribution, then Bayesian method is equal to frequentist method.
Maximum likelihood Estimation (MLE) - frequentist method
A given set of observations, random sample data, which is independent and identical distribution. The estimation of
using MLE method can be expressed below:
The last line of above equation is called Negative Log likelihood (NLL)Maximum A Posteriori (MAP) - Bayesian method
A given set of observations, random sample data, which is independent and identical distribution. The estimation of
using MAP method can be expressed below:
Given that prior is a Gaussian distribution:
Then