Ticket #169 (new task)

Opened 6 years ago

Unicode support

Reported by: amatubu Assigned to: amatubu
Priority: normal Milestone: 2.0.0
Component: Parser Version: 1.1.3
Severity: normal Keywords: Unicode, UTF-8, Multi-language
Cc:

Description

POPFile currently does not support Unicode. POPFile v2 will support it.

TODO:

  • Convert encoding of e-mails.
  • Convert encoding of existing corpus and stopwords.
  • Convert encoding of language files.
  • POPFile UI will be encoded in UTF-8.
  • Some works for Japanese and Korean. (currently depends on euc-jp/euc-kr)