Jeff Abrahamson on 17 Sep 2004 10:20:03 -0000


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

Re: [PLUG] charset encodings


On Thu, Sep 16, 2004 at 11:25:57AM -0400, Tobias DiPasquale wrote:
>   [31 lines, 169 words, 1312 characters]  Top characters: _teailon
> 
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> On Sep 16, 2004, at 11:17 AM, Jeff Abrahamson wrote:
> > Anyone know a way around this sort of thing?  I'm willing to believe
> > Outlook is not respecting standards, but it must surely be possible to
> > modify the body on display to make it readable.
> 
> Outlook is actually in the right on this one. UTF-8 is the character
> set for French. What you specified was a locale _and_ character set
> (fr_FR is a locale). The real problem here is that no matter what
> you specify, your terminal emulator is not able to display the
> high-bit characters correctly. You'll have to find a fully-i18n'ized
> terminal emulator (or turn that on in your current one). Anyone know
> any high-bit capable terminal emulators?

Not completely correct there, though.  When my locale is set to en_US,
I can read iso-8859-1 (Latin-1) just fine, which gives me all the
accented characters in French and German, at any rate.

Now, I would like to have other languages come through correctly, but
since I don't in general understand other languages, it's not
critical. ;-)

Now it may be that the email in question was correctly encoded as
UTF-8, but that doesn't mean I shouldn't be able to read it.  My
terminal correctly displays c-cedilla (both lower and upper case),
grave and accute accents on e's, umlaut's on a's, o's, and u's,
circumflexes on all vowels, etc.

Part of the problem is the content encoding and the envelope encoding
are different in this case.  The envelope is always (I think) encoded
in iso-8859-1.  So the person's name is Françoise (c-cedilla) in
iso-8859-1.  If I set my *terminal* to UTF-8, that name gets garbled.

So I really think that what I want is a program that translates the
*contents* of my email correctly for the locale in which my terminal
emulator is set.

Said another way, if I had a terminal emulator that was correctly
interpreting UTF-8 and I then received an iso-8859-1 email (the
majority for French email that I receive), I would need that to be
translated properly.

-- 
 Jeff

 Jeff Abrahamson  <http://www.purple.com/jeff/>    +1 215/837-2287
 GPG fingerprint: 1A1A BA95 D082 A558 A276  63C6 16BF 8C4C 0D1D AE4B

 A cool book of games, highly worth checking out:
 http://www.amazon.com/exec/obidos/ASIN/1931686963/purple-20

Attachment: signature.asc
Description: Digital signature