This is the mail archive of the
cygwin
mailing list for the Cygwin project.
Re: Encoding of German 'umlauts' - please explain
- From: Matthias Andree <matthias dot andree at gmx dot de>
- To: cygwin at cygwin dot com
- Date: Thu, 24 Sep 2009 12:26:23 +0200
- Subject: Re: Encoding of German 'umlauts' - please explain
- References: <loom.20090924T100848-137@post.gmane.org>
Ronald Fischer schrieb:
> Maybe someone could enlighten me about the following:
>
> On Cygwin bash I see
>
> $ echo à | od -cx
> 0000000 374 \n
> 0afc
> 0000002
>
> That means, the German letter à has encoding 0xFC. If I do the same on CMD shell
> (the 'od' used here comes from the Gnu Utilities for Windows), I see:
>
> echo à | od -cx
> 0000000 201 \r \n
> 2081 0a0d
> 0000004
>
> That is, Ã is encoded as 0x81. Why is this different?
Because the code pages differ. 0xFC is ISO-8859-1 ("Latin 1") or -15 ("Latin 9")
or CP1252/Windows-1252 (Latin 1 Extended; the latter allocates 0x80...0x9f
differently than ISO-8859-1) and CMD uses CP437 or CP850.
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple