Bigram Soft Pattern Model
Bigram prob
To estimate the interpolation mixture weight λ
Expectation Maximization (EM) algorithm
Count words and general tags separately
Avoid overwhelming frequency count of general tags