[MLton] Unicode... again

Matthew Fluet fluet at tti-c.org
Fri Feb 9 13:14:41 PST 2007


Gergely Buday wrote:
> On 08/02/07, Wesley W. Terpstra <terpstra at gkec.tu-darmstadt.de> wrote:
>> Once again I find myself needing Unicode in MLton.
> [...]
>> Agreed? Can I just whip this up and check it in? ;-)
> 
> I'm getting into using ml-ulex, which is a unicode-able lexer for
> sml/nj. As far as I understood, it uses   4-byte chars:
> ml-lpt/ml-ulex/BackEnds/SML/template-ml-ulex.sml contains
> 
>      structure W = Word32
>      type wchar = W.word
> 
> It would be nice to be able to use ml-ulex with your proposed unicode
> library. For the first step I'll try to compile ml-ulex with mlton.

The lastest version of SML/NJ (ver 110.62) includes
   signature UTF8
   structure UTF8 : UTF8
in the SML/NJ Library (added to support ml-ulex).  The signature there 
does specify
   type wchar = Word32.word

I got an older version of ml-ulex to work with MLton, and it wasn't too 
hard.




More information about the MLton mailing list