summaryrefslogtreecommitdiff
path: root/script
AgeCommit message (Expand)AuthorFilesLines
2022-12-20script: improve a bit create-table.py and regenerate the Georgian charsets.Jehan3-36/+53
2022-12-20script, src, test: new Georgian support.Jehan5-0/+473
2022-12-20script: new create-table script.Jehan1-0/+137
2022-12-20script: update the README.Jehan1-6/+5
2022-12-20script, src, test: adding Catalan support.Jehan3-0/+318
2022-12-18Issue #21: Greek CP737 support.Jehan3-228/+291
2022-12-18script: fix a notice message.Jehan1-1/+1
2022-12-18script: add a requirements.txt for our generation script.Jehan2-0/+4
2022-12-18script, src: generate more code for language and sequence model listing.Jehan3-693/+836
2022-12-17script, src, test: add Serbian support.Jehan2-0/+309
2022-12-17src, script: add Macedonian support.Jehan2-0/+306
2022-12-17script, src: regenerate Russian models and add UTF-8/Russian support.Jehan6-0/+621
2022-12-17script, src, test: add Ukrainian support.Jehan2-0/+337
2022-12-17script, src, test: adding Belarusian support.Jehan2-0/+298
2022-12-17script, src, test: Bulgarian language models added.Jehan4-0/+468
2022-12-17script: add an error handling for when iconv fail to convert from a codepoint.Jehan1-0/+3
2022-12-16Issue #22: Hebrew CP862 support.Jehan3-265/+334
2022-12-15src: all language models now rebuilt after the fix.Jehan31-8018/+7982
2022-12-15script: fix BuildLangModel.py.Jehan1-4/+6
2022-12-14scripts: all language models rebuilt with the new ratio data.Jehan33-5285/+8253
2022-12-14script: model-building script updated to produce the 2 new ratios…Jehan1-1/+26
2022-12-14script, src: rebuild the English model.Jehan1-164/+235
2022-12-14script, src: rebuild the Danish model.Jehan1-139/+223
2022-12-14script, src: update Norwegian model with the new language features.Jehan2-1/+235
2022-12-14script: further fixing BuildLangModel.py.Jehan1-0/+2
2022-12-14script: improve a bit the management of use_ascii option.Jehan1-7/+5
2022-12-14script: work around recent issue of python wikipedia module.Jehan1-3/+3
2022-12-14script, src: add English language model.Jehan2-0/+245
2022-12-14script: generate more complete frequent characters when range is set.Jehan1-19/+16
2022-12-14script, src: regenerate the Thai model.Jehan2-119/+131
2022-12-14src, script: fix the order of characters for Vietnamese.Jehan1-110/+104
2022-12-14src, script: add concept of alphabet_mapping in language models.Jehan3-136/+87
2022-12-14script: regenerate Slovak and Slovene with better alphabet support.Jehan4-275/+300
2022-12-14script: fix a stupid bug making same ratio for all frequent characters.Jehan1-1/+1
2022-12-14script, src: regenerate the Vietnamese model.Jehan2-70/+117
2022-12-14script, src: remove generated statistics data for Korean.Jehan1-0/+2
2022-12-14src: add Hindi/UTF-8 support.Jehan2-0/+265
2022-12-14script: fix a bit BuildLangModel.py when use_ascii is True.Jehan1-3/+8
2022-12-14script, src: add generic Korean model.Jehan3-40/+907
2022-12-14script, src: generate the Hebrew models.Jehan4-0/+397
2022-12-14src, script: regenerate all existing language models.Jehan20-2620/+2788
2022-12-14Rebuild a bunch of language models.Jehan8-847/+947
2022-12-14script: update BuildLangModel.py to updated SequenceModel struct.Jehan1-1/+2
2022-11-30script, src, test: add IBM865 support for Danish.Jehan3-145/+243
2022-11-30script: fix small issues with commits e41e8a4 and 8d15d6b.Jehan1-6/+10
2022-11-30Add norwegian supportMartin T. H. Sandsmark2-0/+126
2022-11-30improve model building script a bitMartin T. H. Sandsmark1-1/+14
2022-11-30make the logfile usableMartin T. H. Sandsmark1-1/+3
2020-04-22Issue #8: have BuildLangModel.py add ending newline to generated source.Jehan1-0/+1
2016-09-28LangModels: add Swedish support.Jehan2-0/+207