2010/4/19 Norbert Hartl <norbert(a)hartl.name>me>:
I just recently discovered that actual browsers are
displaying utf-8 safe url encodet URL paths as the right characters. Meaning
http://en.wikipedia.org/wiki/Gew%C3%BCrztraminer
actually displays
http://en.wikipedia.org/wiki/Gewürztraminer
Nice, isn't it?
As far as I can see in pier the URL path is tightly
coupled to the name of a structure. And structure names are restricted to only a few
characters. The comments in the code reason this for safe usage in some object protocols.
What would be the way to go if I want to enable those types of URLs? What are the
problematic cases if a structure name could consist of non-7bit characters?
The following comes to my mind:
- If you post the second link with umlauts, if might cause problems if
the server is not in utf-8, ie5 or some non-modern browser user agent.
- If you use WAKomEncoded(39) (which you should) and go beyond Latin-1
(e.g. €) you enter the wonderful world of WideStrings.
- There might round trip problems with external systems (files, databases, ...)
I summary you might run into bugs, but then again someone has to in
order to get them fixed.
Cheers
Philippe