Leif Ekblad
2012-03-07 20:39:20 UTC
I've redesigned my font package, and will use UTF-8 throughout my OS and
applications. When I added RDOS support for libc, I made the choice not to
support wide-char character strings because I would eventually support only
UTF-8. Today, many applications, including web-pages use UTF-8, so I think
this is a reasonably choice.
However, there seem to be several issues with UTF-8 in OW:
1. The compiler cannot handle source files that MS tools has tagged as UTF-8
(the BOM sequence 0xEF, 0xBB, 0xBF gives a compilation error)
2. The Watcom editor (as well as IDE and debugger) does not handle UTF-8. It
doesn't display source files correctly, and it is not possible to insert
UTF-8 coded strings into the code with the editor
3. Because the UTF-8 BOM sequence cannot be used, source files cannot be
edited directly with notepad either, as notepad think it is plain text
rather than UTF-8.
4. I'm not sure about resource files yet. The resource records are coded in
UTF-16, but I don't know if the resource compiler correctly translates UTF-8
coded strings to UTF-16 resources.
Has anybody looked into this before? Is it reasonable to add UTF-8 support
to the Win32 tools?
Leif Ekblad
applications. When I added RDOS support for libc, I made the choice not to
support wide-char character strings because I would eventually support only
UTF-8. Today, many applications, including web-pages use UTF-8, so I think
this is a reasonably choice.
However, there seem to be several issues with UTF-8 in OW:
1. The compiler cannot handle source files that MS tools has tagged as UTF-8
(the BOM sequence 0xEF, 0xBB, 0xBF gives a compilation error)
2. The Watcom editor (as well as IDE and debugger) does not handle UTF-8. It
doesn't display source files correctly, and it is not possible to insert
UTF-8 coded strings into the code with the editor
3. Because the UTF-8 BOM sequence cannot be used, source files cannot be
edited directly with notepad either, as notepad think it is plain text
rather than UTF-8.
4. I'm not sure about resource files yet. The resource records are coded in
UTF-16, but I don't know if the resource compiler correctly translates UTF-8
coded strings to UTF-16 resources.
Has anybody looked into this before? Is it reasonable to add UTF-8 support
to the Win32 tools?
Leif Ekblad