Differences

This shows you the differences between two versions of the page.


faq:newwords [2008/02/08 19:49] (current) – created - external edit 127.0.0.1
Line 1: Line 1:
 +===== How does POPFile evaluate new words? =====
 +
 +POPFile assigns a probability to every word, for words that it has seen it naturally assigns the probability from the corpus. For unseen words it assigns the probability 1/(10 * size of that bucket's corpus)---i.e. a probability that indicates that the word is "unlikely" to appear.
 +
 +The other possible choices are 0 (which would screw up classification since all classifications would be 0) or 1 (which would be a mistake since it would indicate that the word always appears).
  
 
Old revisions

Should you find anything in the documentation that is incomplete, unclear, outdated or just plain wrong, please let us know and leave a note in the Documentation Forum.

Recent changes RSS feed Donate Driven by DokuWiki
The content of this wiki is protected by the GNU Fee Documentation License