Note: This is a beta release of Red Hat Bugzilla 5.0. The data contained within is a snapshot of the live data so any changes you make will not be reflected in the production Bugzilla. Also email is disabled so feel free to test any aspect of the site that you want. File any problems you find or give feedback here.
Bug 159051 - GET prints error message on
Summary: GET prints error message on
Alias: None
Product: Fedora
Classification: Fedora
Component: perl-libwww-perl
Version: 3
Hardware: i386
OS: Linux
Target Milestone: ---
Assignee: Jason Vas Dias
QA Contact:
Depends On:
TreeView+ depends on / blocked
Reported: 2005-05-28 08:04 UTC by Nigel Horne
Modified: 2007-11-30 22:11 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Last Closed: 2005-12-21 16:11:26 UTC

Attachments (Terms of Use)
file with non-7-bit-ascii characters stripped (deleted)
2005-12-21 16:12 UTC, Jason Vas Dias
no flags Details

Description Nigel Horne 2005-05-28 08:04:11 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.8) Gecko/20050513 Fedora/1.7.8-1.3.1

Description of problem:
GET says

Parsing of undecoded UTF-16 at /usr/lib/perl5/site_perl/5.8.3/LWP/ line 114.

Version-Release number of selected component (if applicable):

How reproducible:

Steps to Reproduce:
1. GET

Actual Results:  This message appears

Parsing of undecoded UTF-16 at /usr/lib/perl5/site_perl/5.8.3/LWP/ line 114.

Expected Results:  The message shouldn't have appeared.

Additional info:

Comment 1 Jason Vas Dias 2005-12-21 16:11:26 UTC
We apologize for the delay in processing this bug report .

I've just downloaded page in question, with both: 
 $ wget
 $ lwp-request > /tmp/veday.htm

It appears the message from LWP/ is not in error - 
the veday.htm file is full of illegal binary characters - 
e.g. the first line:

 $ head -1 < /tmp/veday.htm | od -cx
0000000 377 376   <  \0   H  \0   T  \0   M  \0   L  \0   >  \0  \r  \0
        feff 003c 0048 0054 004d 004c 003e 000d
0000020  \n  \0

Perl does its best to figure out what kind of encoding is being used, but
there is no encoding in which all the 8-bit sequences in this file are legal.
I suggest converting the file to 7-bit ASCII:

perl -ne 'foreach $c ( split //, $_ )
{ if( ((ord($c) < 0x20) && !( $c =~ /[\n\r\t\v]/)) || (ord($c) > 0x7f) )
  { next; }; 
  print $c; 
};' < /tmp/veday.htm > /tmp/veday_ascii.htm

I've attached the /tmp/veday_ascii.htm file so you can see the differences.

Comment 2 Jason Vas Dias 2005-12-21 16:12:46 UTC
Created attachment 122493 [details]
file with non-7-bit-ascii characters stripped

Note You need to log in before you can comment on or make changes to this bug.