[MLton] Unicode / WideChar

skaller skaller@users.sourceforge.net
Mon, 21 Nov 2005 23:06:46 +1100


On Mon, 2005-11-21 at 22:49 +1100, skaller wrote:
> On Mon, 2005-11-21 at 09:19 +0100, Florian Weimer wrote:
> 
> > UTF-16 is the
> > replacement, and sorting that representation lexicographically
> > (potentially after byte-swapping) does not result in the codepoint
> > order!
> 
> Here is the algorithm (from ISO/IEC JTC1/SC2/WG2 N 1035):

Actually you're right, comparing a code above Dxxx with
one above 10000. Then the two word code will be smaller
instead of bigger.

-- 
John Skaller <skaller at users dot sf dot net>
Felix, successor to C++: http://felix.sf.net