Kaydet (Commit) 34ae19b1 authored tarafından László Németh's avatar László Németh Kaydeden (comit) Andras Timar

tdf#116666 Hungarian collation: casing and equality fixes

Casing fixes: “CCS” sorted as “CSCS”, not “cscs”.
“Ccs” and “CCS” are capitalized versions of the simplified
double consonant “cs”, but “CCs” is an abbreviation of words
beginning with “C” and “Cs” (similar to “AkH.”, “MHSz.”) etc.

To avoid the comparison result “equal” we set a precedence
between the simplified and compound-like long forms, too.
For example, “ésszerű” (old orthography before 2015) and
“észszerű” (not “észszerű”, “ésszerű”), or “mennyelv” and
“menynyelv” (words with different meanings) sorted as
“észszerű” and “észSzerű”, also “menynyelv” and “menyNyelv”.

Change-Id: If31c97262bc74429b514ede43a0384de80fe8ac5
Reviewed-on: https://gerrit.libreoffice.org/52194Tested-by: 's avatarJenkins <ci@libreoffice.org>
Reviewed-by: 's avatarAndras Timar <andras.timar@collabora.com>
üst 15956041
......@@ -35,12 +35,50 @@
& u < ü <<< Ü <<< ű <<< Ű
& z < zs <<< Zs <<< ZS
& cs <<< ccs / cs <<< Ccs / cs <<< CCs / cs <<< CCS / cs
& dz <<< ddz / dz <<< Ddz / dz <<< DDz / dz <<< DDZ / dz
& dzs <<< ddzs / dzs <<< Ddzs / dzs <<< DDzs / dzs <<< DDZs / dzs <<< DDZS / dzs
& gy <<< ggy / gy <<< Ggy / gy <<< GGy / gy <<< GGY / gy
& ly <<< lly / ly <<< Lly / ly <<< LLy / ly <<< LLY / ly
& ny <<< nny / ny <<< Nny / ny <<< NNy / ny <<< NNY / ny
& sz <<< ssz / sz <<< Ssz / sz <<< SSz / sz <<< SSZ / sz
& ty <<< tty / ty <<< Tty / ty <<< TTy / ty <<< TTY / ty
& zs <<< zzs / zs <<< Zzs / zs <<< ZZs / zs <<< ZZS / zs
# We expand simplified double consonants, for example,
# "ccs" sorted as "cscs". This is still not an error-free
# method, but better, than the old method, because now
# it's *possible* to fix all errors in a semi-automatic way,
# using soft hyphens. Inserting them in bad or ambiguous
# character positions will fix all automatic sortings later,
# for example, “arccsont” -> “arc|csont” (“|” signs soft
# hyphen position).
# Note: Ccs and CCS are capitalized versions of the simplified
# double consonant “cs”, but CCs is an abbreviation of words
# beginning with C and Cs (similar to “AkH.”, “MHSz.”).
# To avoid the comparison result “equal” we set a precedence
# between the simplified and compound-like long forms, too.
# For example, “ésszerű” (old orthography before 2015) and
# “észszerű” (not “észszerű”, “ésszerű”), or “mennyelv” and
# “menynyelv” (words with different meanings), sorted as
# “észszerű” and “észSzerű”, also “menynyelv” and “menyNyelv”.
& cs <<< ccs / cs <<< cscs / Cs
& Cs <<< Ccs / cs <<< CsCs / Cs
& CS <<< CCS / Cs <<< CSCS / CS
& dz <<< ddz / dz <<< dzdz / Dz
& Dz <<< Ddz / dz <<< DzDz / Dz
& DZ <<< DDZ / Dz <<< DZDZ / DZ
& dzs <<< ddzs / dzs <<< dzsdzs / Dzs
& Dzs <<< Ddzs / dzs <<< DzsDzs / Dzs
& DZS <<< DDZS / DZs <<< DZSDZS / DZS
& gy <<< ggy / gy <<< gygy / Gy
& Gy <<< Ggy / gy <<< GyGy / Gy
& GY <<< GGY / Gy <<< GYGY / GY
& ly <<< lly / ly <<< lyly / Ly
& Ly <<< Lly / ly <<< LyLy / Ly
& LY <<< LLY / Ly <<< LYLY / LY
& ny <<< nny / ny <<< nyny / Ny
& Ny <<< Nny / ny <<< NyNy / Ny
& NY <<< NNY / Ny <<< NYNY / NY
& sz <<< ssz / sz <<< szsz / Sz
& Sz <<< Ssz / sz <<< SzSz / Sz
& SZ <<< SSZ / Sz <<< SZSZ / SZ
& ty <<< tty / ty <<< tyty / Ty
& Ty <<< Tty / ty <<< TyTy / Ty
& TY <<< TTY / Ty <<< TYTY / TY
& zs <<< zzs / zs <<< zszs / Zs
& Zs <<< Zzs / zs <<< ZsZs / Zs
& ZS <<< ZZS / Zs <<< ZSZS / ZS
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment