Cognitive Science > Statistical Natural Language Processing > 2
[edit] Properties of probability distributions
- Normalization
- 0 <= Pr(w) <= 1
- Additivity
if E1 & E2 are mutually exclusive
- Conditional Probability
- not symmetric
- Chain rule
- mutually exclusive:
- not symmetric
- Bayes' theorem
[edit] Information theory
- Amount of info
- -log2Pr(E)
- in bits
- associated with an event E
- Joint Info
- Entropy
- average info provided
- measure of homogeneity
- H[C] is max if all values of C have same prob