Hi all,
Unicode support is coming along and that's nice. So let's make it even nicer and add locale support too.
A problem with the Unicode table is that outside of the 26 characters of the rather impoverished English alphabet, the characters have just been dumped in likely-looking piles. And fair enough, really, because the additional characters are treated differently in different languages; sometimes they're proper letters in their own right, sometimes they're just diacritic variations.
But that should be taken into consideration when sorting things in EA's project browser, search results, etc. Right now, everything's sorted in Unicode order, and that's not the same as alphabetical order which is what people expect.
There are two issues here. The first is that letters with diacritical marks (accents etc) should be sorted with the unmarked letters. Right now, they all end up after Z because that's where they are in the Unicode table.
The other is that proper letters should be sorted into their correct alphabetical positions.
Scandinavian languages have more letters than English, appearing at the end of the alphabet. These are proper letters, not umlauts as in German. There are two main Scandinavian alphabets, Danish/Norwegian and Swedish, and just for fun, the characters are not identical in the two, nor are they sorted in the same order.
The Danish/Norwegian alphabet ends with Z Æ Ø Å, the Swedish with Z Å Ä Ö. EA will sort everything by Univode number, and the following table shows you how that clashes with expectations.
Unicode | Character | DK/NO | SE |
196/228 | Ä/ä | -- | 28 |
197/229 | Å/å | 29 | 27 |
198/230 | Æ/æ | 27 | -- |
214/246 | Ö/ö | -- | 29 |
216/248 | Ø/ø | 28 | -- |
With Finnish I'm not entirely sure, but I think you can treat it the same as Swedish. "Å" (197/229) isn't part of Finnish proper, but there is a Swedish-speaking minority in Finland and Swedish is an official language there.
So what it all boils down to is that the same Unicode character has different meanings in different locales (in some a proper letter with its own alphabetical position, and in others a mere diacritic variation), and the sorting should take that into consideration. The current implementation is simple enough, but not good enough.
I'm sure you'll be pleased to know that Unicode recently published the 32nd revision to its
Unicode Collation Algorithm. So if EA simply implements that, we should be OK.

/Uffe