Maximum A Posteriori (MAP) Estimation

November 24, 2024

The goal is essentially the same as MLE. We have an assumed model for $p (x_{j} ∣ ω_{j})$ parameterized by $θ$ . We want to classify a feature $x$ into some class $ω_{j}$ based on a labeled dataset $D$ . In MLE, we were trying to maximize the likelihood:

\hat{θ}_{MLE} = ar g θ max p (D ∣ θ)

In MAP, we instead maximize the a posteriori:

\hat{θ}_{MAP} = ar g θ max p (θ ∣ D) = ar g θ max p (D ∣ θ) p (θ)

We immediately notice that if $p (θ)$ is uniform, $\hat{θ}_{MAP} = \hat{θ}_{MLE}$ .

✦ No LLMs were used in the ideation, research, writing, or editing of this article.