Unigram model, Training corpus:


<s> I am sam </s>
<s> sam I am </s>

Find the probability p(sam)

Unigram model, Training corpus:

<s> I am sam </s>
<s> sam I am </s>

Find the probability p(sam)

4/11

3/11

4/10

2/10

To calculate the probability p(sam), you need to count the number of occurrences of the word "sam" in the training corpus and divide it by the total number of words in the corpus.

In the given training corpus, "sam" appears two times and the total number of words in the corpus is 11. Therefore, p(sam) = 2/11.

So, the correct answer is 2/11.

To find the probability of the word "sam" using a unigram model, we need to calculate the frequency of the word "sam" in the training corpus and divide it by the total number of words in the corpus.

Counting the occurrences of "sam" in the given training corpus, we find that it appears twice.

The total number of words in the corpus is 11.

Therefore, the probability of the word "sam" is 2/11.

So, the correct answer is 2/11.