summaryrefslogtreecommitdiff
path: root/src/nsMBCSGroupProber.cpp
AgeCommit message (Expand)AuthorFilesLines
2023-07-17src: handle long sequences of characters.Jehan1-10/+21
2023-07-17Issue #33: crafted sequence of bytes triggers memory write past the bounds of…Jehan1-2/+13
2022-12-20script, src, test: new Georgian support.Jehan1-0/+1
2022-12-20script, src, test: adding Catalan support.Jehan1-0/+1
2022-12-17script, src, test: add Serbian support.Jehan1-0/+1
2022-12-17src, script: add Macedonian support.Jehan1-0/+1
2022-12-17script, src: regenerate Russian models and add UTF-8/Russian support.Jehan1-0/+1
2022-12-17script, src, test: add Ukrainian support.Jehan1-0/+1
2022-12-17script, src, test: adding Belarusian support.Jehan1-0/+1
2022-12-17script, src, test: Bulgarian language models added.Jehan1-0/+1
2022-12-14src: when checking for candidates, make sure we haven't any unprocessed…Jehan1-1/+8
2022-12-14src: process pending language data when we are going to pass buffer size.Jehan1-0/+11
2022-12-14script, src: update Norwegian model with the new language features.Jehan1-0/+1
2022-12-14script, src: add English language model.Jehan1-0/+1
2022-12-14script, src: remove generated statistics data for Korean.Jehan1-1/+0
2022-12-14src: new nsCJKDetector specifically Chinese/Japanese/Korean recognition.Jehan1-1/+3
2022-12-14src: add Hindi/UTF-8 support.Jehan1-1/+2
2022-12-14script, src: add generic Korean model.Jehan1-0/+1
2022-12-14src, test: fix the new Johab prober and add a test.Jehan1-2/+2
2022-12-14src: build new charset prober for Johab Korean.Jehan1-1/+2
2022-12-14add charset prober for Johab KoreanLSY1-1/+4
2022-12-14script, src: generate the Hebrew models.Jehan1-0/+1
2022-12-14src: make nsMBCSGroupProber report all valid candidates.Jehan1-61/+161
2022-12-14src: allow for nsCharSetProber to return several candidates.Jehan1-10/+10
2022-12-14src: nsMBCSGroupProber confidence weighed by language confidence.Jehan1-2/+16
2022-12-14src: reset language detectors when resetting a nsMBCSGroupProber.Jehan1-0/+6
2022-12-14src, script: regenerate all existing language models.Jehan1-6/+28
2022-12-14Using the generic language detector in UTF-8 detection.Jehan1-10/+99
2022-12-14src: new API to get the detected language.Jehan1-0/+12
2015-11-17uchardet_get_charset() must return iconv-compatible names.Jehan1-4/+4
2011-07-11Update code from upstream.BYVoid1-49/+70
2011-07-10Initial release.BYVoid1-0/+209