The goal is essentially the same as MLE. We have an assumed model for parameterized by . We want to classify a feature into some class based on a labeled dataset . In MLE, we were trying to maximize the likelihood:
In MAP, we instead maximize the a posteriori:
We immediately notice that if is uniform, .