[arch-general] Including compiled C.UTF-8 locale by default in glibc package? Inbox
Daan De Meyer
daan.j.demeyer at gmail.com
Mon Feb 14 23:08:04 UTC 2022
Hi,
Now that glibc 2.35 is available, could we enable and ship the
compiled form of the new C.UTF-8 locale in glibc by default in Arch
Linux?
>From the glibc 2.35 release notes
(https://sourceware.org/pipermail/libc-alpha/2022-February/136040.html):
> * Support for the C.UTF-8 locale has been added to glibc. The locale
> supports full code-point sorting for all valid Unicode code points. A
> limitation in the framework for fnmatch, regexec, and regcomp requires
> a compromise to save space and only ASCII-based range expressions are
> supported for now (see bug 28255). The full size of the locale is
> only ~400KiB, with 346KiB coming from LC_CTYPE information for
> Unicode. This locale harmonizes downstream C.UTF-8 already shipping
> in various downstream distributions. The locale is not built into
> glibc, and must be installed.
Being able to rely on the existence of a UTF-8 english locale
simplifies many use cases. A good example of issues introduced due to
a lack of a built-in UTF-8 locale is
https://github.com/systemd/systemd/pull/8340 which is a workaround
added in 2018 that still exists today. Having the C.UTF-8 locale
available by default in Arch would enable removing such workarounds.
Any thoughts?
Cheers,
Daan De Meyer
More information about the arch-general
mailing list