William H. Magill on 17 Jun 2004 15:26:02 -0000


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

Re: [PLUG] UTF-8 question



On 17 Jun, 2004, at 10:03, Jeff Abrahamson wrote:

I tried to view this web page and found it to look like garbage to me:

    http://phi.sinica.edu.tw/aspac/reports/94/94002/plot-2.html

The page appears to be UTF-8 encoded.

So I'm curious if this garbage display means that I have configured my
machine badly for UTF-8 or if the UTF-8 in question is specifying a
script system (glyph set?) that I do not have installed.

I try to install as many fonts as I can to avoid such things.  If
something shows up in Cyrillic, I still can't read it, but at least I
know why.  The above is a bit frustrating because I don't know why.

Thanks for any suggestions.

I believe the page is simply garbage(d) ...

If you look at the html source, no encoding scheme is specified.

They have simply embeded non-ASCII characters in with ASCII characters... either intentionally or inadvertently.

The W3 validator says:

"I was not able to extract a character encoding labeling from any of the valid sources for such information. Without encoding information it is impossible to reliably validate the document. I'm falling back to the "UTF-8" encoding and will attempt to perform the validation, but this is likely to fail for all non-trivial documents."

and

"Sorry, I am unable to validate this document because on lines 15-16, 54-55, 63, 69-70, 91-92, 103-104, 108-109, 111, 114, 116-117, 119, 121, 123-128, 131, 133-135, 137, 142-143, 146, 149-150, 152-156, 172-174, 176-177, 180-181, 183-185, 190-191, 193-203, 209, 212, 215-221, 224 it contained one or more bytes that I cannot interpret as utf-8 (in other words, the bytes found are not valid values in the specified Character Encoding). Please check both the content of the file and the character encoding indication."

(PS, it also does not display correctly in either Safari or OmniWeb (both of which use OS X decoding Framework) or in IE 5.2 )

You could write to the page author/Webmaster and point out the W3 Validator (www.w3.org - select html validator) to them and suggest they fix the HTML.

T.T.F.N.
William H. Magill
# Beige G3 - Rev A motherboard - 768 Meg
# Flat-panel iMac (2.1) 800MHz - Super Drive - 768 Meg
# PWS433a [Alpha 21164 Rev 7.2 (EV56)- 64 Meg]- Tru64 5.1a
# XP1000  [Alpha EV6]
magill@mcgillsociety.org
magill@acm.org
magill@mac.com

___________________________________________________________________________
Philadelphia Linux Users Group         --        http://www.phillylinux.org
Announcements - http://lists.phillylinux.org/mailman/listinfo/plug-announce
General Discussion  --   http://lists.phillylinux.org/mailman/listinfo/plug