Age | Commit message (Collapse) | Author | Files | Lines |
|
And instead pass it as an argument to an implementation function.
Otherwise this is thread-unsafe for Calc's threaded calculation,
and transliteration is used in various places in Calc code.
Change-Id: Ibdf95e4b6867ec251618f6ff91e605acb69667c0
Reviewed-on: https://gerrit.libreoffice.org/56290
Tested-by: Jenkins
Reviewed-by: Luboš Luňák <l.lunak@collabora.com>
(cherry picked from commit e9430571c41367625110cdcde168da178b8b30a9)
Reviewed-on: https://gerrit.libreoffice.org/56450
|
|
Change-Id: I50b842afc505ce603225fb2d25281cc8e9240200
Reviewed-on: https://gerrit.libreoffice.org/56537
Reviewed-by: Eike Rathke <erack@redhat.com>
Tested-by: Eike Rathke <erack@redhat.com>
(cherry picked from commit d9388ab070ff99055650bf54b0e683a4be9bcdc9)
Reviewed-on: https://gerrit.libreoffice.org/56540
Tested-by: Jenkins
|
|
and spelling out using the new NatNum12 modifier
to support common "in", "on", "from", "to", etc. date
suffix variants, "dash-free" spell out format of years
of the new Hungarian orthographical reform, and months
with articles.
For example, "in 2018" is "2018-ban", "in 2019" is "2019-ben"
in Hungarian because of "vowel harmony", the word stem
dependent fluctuation of the suffix variants, a frequent
linguistic feature of other agglutinative languages, too,
including Estonian, Finnish and Turkish.
Note: some of the new date formats will work correctly only
with the upcoming update of the external libnumbertext.
Note 2: add also alternative (real) abbreviated month names,
because the default abbreviated month names are Roman
numbers.
Change-Id: Ibb33ff6a627b8e27fd02388653e3b33ebd446a10
Reviewed-on: https://gerrit.libreoffice.org/55637
Tested-by: Jenkins
Reviewed-by: László Németh <nemeth@numbertext.org>
Reviewed-on: https://gerrit.libreoffice.org/56530
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
to support variants of preposition, suffixation,
article or their combinations. For example, Catalan
"de març"/"d'abril", English "1st of May"/"First of
May", Hungarian "május 1-je/május 2-a" (May 1/2) or
Turkish "2018'de/2019'da" (in 2018/2019) usage can
be automatic in templates and mail merge.
When the date format contains more than a date keyword,
it needs to specify in the NatNum12 argument which date
element has got special formatting (described by
libnumbertext formatting codes, and an optional
initial capitalize/upper/title argument):
'[NatNum12 ordinal-number]D' -> "1st"
'[NatNum12 D=ordinal-number]D" of "MMMM' -> "1st of April"
'[NatNum12 D=ordinal]D" of "MMMM' -> "first of April"
'[NatNum12 YYYY=title year,D=capitalize ordinal]
D"of "MMMM", "YYYY' -> "First of April, Nineteen Ninety"
Note: set only for YYYY, MMMM, M, DDDD, D and NNN/AAAA
in date formats. It's possible to extend this for other
keywords and date + time combinations, according to
the possible language requirements.
Note 2: l10n date formats can use the new NatNum12 date
formats from formatindex=60, see FormatElement in
i18npool/source/localedata/ XML files and FormatElement
specification:
https://opengrok.libreoffice.org/xref/core/i18npool/source/localedata/data/locale.dtd#223
Change-Id: I598849f1492f4012e83cef9293773badbff16206
Reviewed-on: https://gerrit.libreoffice.org/55613
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: László Németh <nemeth@numbertext.org>
Reviewed-on: https://gerrit.libreoffice.org/55767
Tested-by: Jenkins
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
Limit NatNum12 conversion only for the selected parts of the
date format (this bug – double calls of getNumberText – was hidden
by the space prefix " " and empty return values at the first calls,
resulting unchanged dates yet).
New prefixes: "capitalize", "upper" and "title" to handle optional
capitalization. (In Calc, it was not possible to format the result of
NatNum formatting, but some languages often need capitalization
or title case to format numbers and currencies.)
Thanks code clean up using enum WhichCasing to Eike Rathke.
Reviewed-on: https://gerrit.libreoffice.org/55681
Tested-by: Jenkins
Reviewed-by: Eike Rathke <erack@redhat.com>
Conflicts:
svl/qa/unit/svl.cxx
Change-Id: I5fceb784930e6bc6d376116f5a42ad49cd248a54
Reviewed-on: https://gerrit.libreoffice.org/56202
Tested-by: Jenkins
Reviewed-by: László Németh <nemeth@numbertext.org>
|
|
dates and money amounts, supporting all the
XNumberText/libnumbertext formatting codes, for example
"ordinal", "ordinal-number", "ordinal-feminine", etc., and
ISO 4217 currency codes, also their possible combinations.
NatNum12 formatting codes are stored by using the newly
introduced (yet, loext:)transliteration-spellout attribute.
creator-initials also added to token list
Change-Id: I20f93c9d16778f142067a56d53b336d0acbe2d92
Reviewed-on: https://gerrit.libreoffice.org/54673
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: László Németh <nemeth@numbertext.org>
(cherry picked from commit 32f1c56d8ff5e6f87dbcf086fd3743d6d36182bc)
Reviewed-on: https://gerrit.libreoffice.org/55625
Reviewed-by: Eike Rathke <erack@redhat.com>
Tested-by: László Németh <nemeth@numbertext.org>
|
|
This allows using all the libnumbertext library functions.
[NatNum12] gives cardinal number names (one, two, three, ...)
[NatNum12 ordinal] gives ordinal number names (first, second, third, ...)
[NatNum12 ordinal-number] gives ordinal indicators (1st, 2nd, 3rd, ...)
[NatNum12 money USD][$-409] gives formal English (US) money text
... etc (see numbertext.org for syntax).
Change-Id: I16dbb44d8d4bdb82a1b950de6d438c8311b554ff
Reviewed-on: https://gerrit.libreoffice.org/54366
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: László Németh <nemeth@numbertext.org>
(cherry picked from commit 90d233b0311a208967a3e264820e00395bf057fb)
Reviewed-on: https://gerrit.libreoffice.org/55624
Reviewed-by: Eike Rathke <erack@redhat.com>
Tested-by: László Németh <nemeth@numbertext.org>
|
|
Change-Id: I421234e5e74bcdf83d55ed8b0e7a320e37f6a231
Reviewed-on: https://gerrit.libreoffice.org/54375
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
(cherry picked from commit 7b5f5d77d56ee494647d9e7868546b3f2140896e)
Reviewed-on: https://gerrit.libreoffice.org/55332
|
|
Change-Id: Ie93920bccfe5444e0066f8df85b4a9d2ff060a2d
Reviewed-on: https://gerrit.libreoffice.org/54650
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Noel Grandin <noel.grandin@collabora.co.uk>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
The split is pointless and misleading, there are no other subclasses of
BreakIterator_CTL.
Change-Id: I66e66834e6e064cea29f543434a35682ee7cd35d
Reviewed-on: https://gerrit.libreoffice.org/54638
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
BreakIterator_CTL in the non CharacterIteratorMode::SKIPCELL mode did
not handle UTF-16 surrogate pairs at all, causing backspace to delete
lone surrogates which is really bad. Just copied the corresponding code
from BreakIterator_Unicode.
Additionally, BreakIterator_th was not correctly skipping non-Thai text
and always treating one character as Thai.
Change-Id: Ia379327e042ff602fc19a485c4cbd1a3683f9230
Reviewed-on: https://gerrit.libreoffice.org/54631
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
Updated Hangul/Hanja conversion dictionary on LibreOffice
ref1: 韓国の人名用漢字表のテキスト版 https://srad.jp/~yasuoka/journal/589283/
ref2: Proposal to add kKoreanName and kKoreanNameVariant to the Unihan Database https://www.unicode.org/L2/L2017/17084-korean-name-var.pdf
ref3: Libhangul's hanja.txt(Hangul-Hanja conversion char&words dictionary) https://github.com/choehwanjin/libhangul/blob/master/data/hanja/hanja.txt
Change-Id: I10358689548fb53a6c78f8e8f06beaede13d0561
Reviewed-on: https://gerrit.libreoffice.org/54562
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Mark Hung <marklh9@gmail.com>
|
|
Change-Id: I963c4a8d0afa44d235cf8271b1515c67299cbe94
|
|
Change-Id: Ib8f73ceefb8278b6233d46d86a34a3869622239f
|
|
... based on libnumbertext integrated since commit
f1579d3d6c5f5f3a651825e035b93bee7a4f43c6.
[NatNum12] gives cardinal number names (one, two, three, ...)
[NatNum13] gives ordinal number names (first, second, third, ...)
[NatNum14] gives ordinal indicators (1st, 2nd, 3rd, ...)
Change-Id: Ie2afdeeb82da1b36e9755c02d7b2276c77be9c72
Reviewed-on: https://gerrit.libreoffice.org/54186
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
in page number, chapter and outline numbering
in ~30 languages by integrating libnumbertext library.
- offapi: add linguistic2::NumberText
New NumberingType constants:
- ordinal indicators (1st, 2nd, 3rd...)
- cardinal number names (One, Two, Three...)
- ordinal number names (First, Second, Third...)
Note: these numberings are parts of OOXML, too.
Plain text files of Libnumbertext's language data
are installed in share/numbertext (similar to
share/fingerprint), allowing further customization.
Change-Id: I4034da0a40a8c926f14a3f591749a89a8d807d5a
Reviewed-on: https://gerrit.libreoffice.org/53313
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: László Németh <nemeth@numbertext.org>
|
|
Change-Id: Iecfff4104ef19f9bc6f83a403d99aecb2eda2514
Reviewed-on: https://gerrit.libreoffice.org/53607
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Noel Grandin <noel.grandin@collabora.co.uk>
|
|
Change-Id: I3d1b88dbd0ff73fddc08d52f50e0efb42daab89b
Reviewed-on: https://gerrit.libreoffice.org/52756
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Noel Grandin <noel.grandin@collabora.co.uk>
|
|
Use "https://cgit.freedesktop.org/libreoffice/core"
instead of "http://cgit.freedesktop.org/libreoffice/core"
Change-Id: Ic7248eeb2a9452da7236eeee08414a77714dd234
Signed-off-by: Gulsah Kose <gulsah.1004@gmail.com>
Reviewed-on: https://gerrit.libreoffice.org/52926
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Adolfo Jayme Barrientos <fitojb@ubuntu.com>
|
|
Change-Id: I74dd0142562cb8698f19b2715fa1d514f82bd749
Reviewed-on: https://gerrit.libreoffice.org/52262
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Adolfo Jayme Barrientos <fitojb@ubuntu.com>
|
|
* remove redirects
* use https links
* replace old attchment links with working ones
Change-Id: Ic9a154f46e142138f0adea7d7b8be3b6cfe8af18
Reviewed-on: https://gerrit.libreoffice.org/52224
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Noel Grandin <noel.grandin@collabora.co.uk>
|
|
and fix fallout
Change-Id: Id06bf31f2075111e426ba40c84c885ae70697bee
Reviewed-on: https://gerrit.libreoffice.org/52206
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Jochen Nitschke <j.nitschke+logerrit@ok.de>
|
|
Casing fixes: “CCS” sorted as “CSCS”, not “cscs”.
“Ccs” and “CCS” are capitalized versions of the simplified
double consonant “cs”, but “CCs” is an abbreviation of words
beginning with “C” and “Cs” (similar to “AkH.”, “MHSz.”) etc.
To avoid the comparison result “equal” we set a precedence
between the simplified and compound-like long forms, too.
For example, “ésszerű” (old orthography before 2015) and
“észszerű” (not “észszerű”, “ésszerű”), or “mennyelv” and
“menynyelv” (words with different meanings) sorted as
“észszerű” and “észSzerű”, also “menynyelv” and “menyNyelv”.
Change-Id: If31c97262bc74429b514ede43a0384de80fe8ac5
Reviewed-on: https://gerrit.libreoffice.org/52194
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Andras Timar <andras.timar@collabora.com>
|
|
Apply the following extra requirements of Hungarian orthography
for sorting words and sentences:
– expand simplified double consonants;
– ignore spaces and hyphens;
– prefer lower case homonyms.
Note: automatic sorting is better this way, but it's still not error-free.
Important advantage, that now it's *possible* to fix all errors
in a semi-automatic way, using soft hyphens. Inserting them in bad
or ambiguous character positions will fix all automatic sortings later.
Example with consonant "gy", simplified form of long "gygy" is the
ambiguous "ggy" (it can be "g" and "gy", too, as in "meggyőz"):
= Bad = = Now = = Good (corrected, "|" signs soft hyphen) =
megbíz megbíz megbíz
meggyíz megzavar meg|győz
meggyőz megye megzavar
megzavar meggyíz megye
megye meggyőz meggyíz
megyünk megyünk megyünk
Change-Id: Ia84f264ad9ea4cdebe5f3ea22212a9594b4fe44d
Reviewed-on: https://gerrit.libreoffice.org/51973
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: László Németh <nemeth@numbertext.org>
|
|
Change-Id: I980464162b73ed9ee0a09acbca1b9050af8d1027
Reviewed-on: https://gerrit.libreoffice.org/51492
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Julien Nabet <serval2412@yahoo.fr>
|
|
to handle bad word breaking of ")-ban", ")-ben" after
reference fields. (Field content is not expanded for
spell checking, resulting red underlined "ban" and "ben"
in the correct form "a)-ban", "b)-ben" etc., see the
test file of the issue.)
Change-Id: Ic4b1fd2c99bdd2509d85dd6f2aa43e2a53becaa7
Reviewed-on: https://gerrit.libreoffice.org/51284
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: László Németh <nemeth@numbertext.org>
|
|
Fix all comments
Change-Id: Ib11662de5cac3b073114ab4425e8f45a0c04c8c8
Reviewed-on: https://gerrit.libreoffice.org/47058
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
Change-Id: Ie2e1004c1ccc03777a8da9cb1144e89eb28ff313
Reviewed-on: https://gerrit.libreoffice.org/50928
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Noel Grandin <noel.grandin@collabora.co.uk>
|
|
Now ”, (, ), ], ?, ! are MidLetter characters to handle
(1)-ben, "ember(ek)ről", "„Tisza”-ban" etc. forms as
words for better Hungarian spell checking.
Change-Id: I47882e73af08531d14d555ced5dd4c98403686ec
Reviewed-on: https://gerrit.libreoffice.org/50473
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: László Németh <nemeth@numbertext.org>
|
|
Even if that's not what should be used.. see sources in
commit 4b938c558d004537cf73ca4319019ff64f052270
Date: Fri Feb 9 12:59:51 2018 +0100
Use angle quotation marks (Guillemets) for [*-CH], tdf#115493 related
Change-Id: I733bc767ff4484c251f8624f90a5295875bf17ea
|
|
Source:
https://en.wikipedia.org/wiki/Quotation_mark#French
https://de.wikipedia.org/wiki/Anf%C3%BChrungszeichen#Schweiz,_Liechtenstein,_Frankreich
Change-Id: I015e8d68aecdc7c7ca3620105b7a2685648438d5
|
|
Source:
https://en.wikipedia.org/wiki/Quotation_mark#German
https://de.wikipedia.org/wiki/Anf%C3%BChrungszeichen#Schweiz,_Liechtenstein,_Frankreich
Change-Id: I9aa13eb610b39d1443acb0d2993a3e6f3e68a12e
|
|
Change-Id: Iff39b9298bfad474c5c011b6355b8ebf5be06318
Reviewed-on: https://gerrit.libreoffice.org/49091
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Noel Grandin <noel.grandin@collabora.co.uk>
|
|
Change-Id: Iacb0c1fba331aa76ea341bc21690f09109b14a0a
Reviewed-on: https://gerrit.libreoffice.org/48990
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Mike Kaganski <mike.kaganski@collabora.com>
|
|
Change-Id: I0afc3a08a31216efdbddc389dc23f0b04bc92b37
Reviewed-on: https://gerrit.libreoffice.org/48619
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
Change-Id: If994f615b4a9b976086546a80357b75e93952528
Reviewed-on: https://gerrit.libreoffice.org/48615
Reviewed-by: Eike Rathke <erack@redhat.com>
Tested-by: Eike Rathke <erack@redhat.com>
|
|
Follow up of tdf#105226.
Change-Id: I31816a36e427338034a30492f2a7ac7f2d0fddd2
Reviewed-on: https://gerrit.libreoffice.org/48616
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
...which triggers the assertion `index >= 0 && static_cast<sal_uInt32>(index) <
static_cast<sal_uInt32>(getLength())' in rtl::OUString::operator[] at
include/rtl/ustring.hxx:669 (see below for a reproducer).
I am not sure whether the case of `startPos == rIndexEntry.getLength()` should
really go into the following for loop inspecting `tables`, with a somewhat
random `code` of U+0000 (or some other value even?), or should rather be handled
explicitly in some other way.
Reproducer in an empty Writer document: "Insert - Table of Contents and Index -
Bibliography Entry...": "Insert Bibliography Entry" dialog: "ARJ00", "Insert",
"AVV00", "Insert", "Close"; "Insert - Table of Contents and Index - Table of
Contents, Index or Bibliography...": "Table of Contents, Index or Bibliography"
dialog: "Type - Type and Title - Type: Bibliography", "Entries - Sort by -
Content", "Entries - Sort Keys - 1: Address", "OK": assert fires
Change-Id: I2c9fad2c37bfa7a3509c197e678311fb45cb991a
Reviewed-on: https://gerrit.libreoffice.org/48564
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
Change-Id: I95ecc3163ea6081e51ce78653cebf292bee7b440
Reviewed-on: https://gerrit.libreoffice.org/48247
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Eike Rathke <erack@redhat.com>
|
|
These two words is switched for some unknown reason. It looks like it was done
by mistake.
Change-Id: I723e91bb93eed054969a09170d8307017187586d
Reviewed-on: https://gerrit.libreoffice.org/48228
Reviewed-by: Adolfo Jayme Barrientos <fitojb@ubuntu.com>
Tested-by: Adolfo Jayme Barrientos <fitojb@ubuntu.com>
|
|
we were previously excluding them
Change-Id: I48a68799b0de60b4995fae541eb363e043d4dd11
Reviewed-on: https://gerrit.libreoffice.org/48167
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Noel Grandin <noel.grandin@collabora.co.uk>
|
|
...instead of having to add it to more and more individual .cxx files, as the
need arises.
Change-Id: If1e893c79abf1c1bb7fb7b89dcb9bd423a7f1b94
Reviewed-on: https://gerrit.libreoffice.org/48108
Reviewed-by: Stephan Bergmann <sbergman@redhat.com>
Tested-by: Stephan Bergmann <sbergman@redhat.com>
|
|
Change-Id: If75ec666c0872875ab4e84e70424e540d44bc360
Reviewed-on: https://gerrit.libreoffice.org/48104
Reviewed-by: Stephan Bergmann <sbergman@redhat.com>
Tested-by: Michael Stahl <mstahl@redhat.com>
|
|
Change-Id: Id1c7ddf6c49ec709e38947a82731fe31a64aad04
Reviewed-on: https://gerrit.libreoffice.org/48076
Reviewed-by: Julien Nabet <serval2412@yahoo.fr>
Tested-by: Julien Nabet <serval2412@yahoo.fr>
|
|
Change-Id: I5fff9ccb1bc74b85ee378439ffa2272583ffcb0f
Reviewed-on: https://gerrit.libreoffice.org/48024
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Michael Stahl <mstahl@redhat.com>
|
|
Change-Id: Idf609a4012499e76acc5e7ae76565937f2a58fbd
|
|
If jday were guaranteed to be non-negative, the first could also be changed to
jday = std::round(jday);
but that would give different results for negative jday. So play it safe by
assuming that jday might be negative, and that if it were, the original way of
rounding was actually intended.
Change-Id: I137ee659e738791a7d260ce86cba60cdbbc084a6
Reviewed-on: https://gerrit.libreoffice.org/48004
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Stephan Bergmann <sbergman@redhat.com>
|
|
...instead of converting from sal_Int32 to double and back to sal_Int32
Change-Id: I005777ff031dbbd919d46c971a84dff17f0adf9b
Reviewed-on: https://gerrit.libreoffice.org/47997
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Stephan Bergmann <sbergman@redhat.com>
|
|
Change-Id: I078626eaa3155cc429320cc8c1532003b382b22a
Reviewed-on: https://gerrit.libreoffice.org/47855
Tested-by: Jenkins <ci@libreoffice.org>
Reviewed-by: Julien Nabet <serval2412@yahoo.fr>
|
|
Change-Id: Iafdc3593b7136f24e741dc63e3c46344636154eb
|