This is the mail archive of the cygwin mailing list for the Cygwin project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Encoding of German 'umlauts' - please explain


Ronald Fischer schrieb:
> Maybe someone could enlighten me about the following:
> 
> On Cygwin bash I see
> 
> $ echo à | od -cx
> 0000000 374  \n
>         0afc
> 0000002
> 
> That means, the German letter à has encoding 0xFC. If I do the same on CMD shell
> (the 'od' used here comes from the Gnu Utilities for Windows), I see:
> 
>   echo à | od -cx
> 0000000 201      \r  \n
>         2081 0a0d
> 0000004
> 
> That is, Ã is encoded as 0x81. Why is this different?

Because the code pages differ. 0xFC is ISO-8859-1 ("Latin 1") or -15 ("Latin 9")
or CP1252/Windows-1252 (Latin 1 Extended; the latter allocates 0x80...0x9f
differently than ISO-8859-1) and CMD uses CP437 or CP850.


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]