Parts-of-Speech Tagging (POS) using Hidden Markov Model (HMM )and (MEMM)

Parts of speech tagging the process of assigning a part-of-speech tag (Noun, Verb, Adjective…) to each word in an input text. I have trained two different model HMM which is generative model and MEMM which is discriminative.

  • Train a Hidden Markov Model(HMM) and Bigram Maximum Entropy Model for POS tagging

  • Hidden Markov Model(HMM)

    • Compute transition matrix and emission matrix
    • Compute viterbi algorithm and evaluate accuracy of model on Wall Street Journal (WSJ)
  • Maximum Entropy Model

    • Preprocess and create features (eg. wordshape,number,hyphen) for each word in sentence
    • Train a logistic regression and compute log probability distributions over the tagset.
    • Compute viterbi algorithm and evaluate accuracy of model on Brown corpus

Related