500 Server errors: fun...

carpeliam · Mar 17, 2000

I've been given a job nobody would ever want. I'm sure you all know that you can take an Microsoft Excel document and get it to spit out a webpage. Anybody who hates WYSIWYG editors knows just how many nasty table tags they spit out. My boss at a webdesign firm decided to make a catalog using Excel, and then export to HTML. 
 
Now I have an HTML document weighing in at 1.5 MB. 
 
That's crazy. He handed it to me and said, "it used to load so quickly- I don't know what happened to it. Could you take a look?" Hehehe... one look at the code and I knew what happened to it. TD tags up the yin yang and all sorts of code the W3C has never seen. So now I have to get rid of them all... At first, I tried cutting and pasting the necessary stuff. Basically, I started over from scratch. Well, the catalog is huge- if I trim all the fat I can, it's going to be 300K. That's a lot of typing to do by hand. 
 
So I thought I'd try writing a Perl CGI program that would extract things out of the table and put them into a new, nicer, cleaner table (with 4 columns instead of 40). Now the old server I used to program on ran Perl 4, but now that I'm on a new server, I'm trying to take advantage of Perl 5 using some of the new parsing code. I went to a tutorial website, copied things word for word, but it won't work. Help

) can you guys tell me what's wrong here? Thanks... here's my code. 
 
 
#!/usr/local/bin/perl 
use CGI; 
use LWP::Simple; 
use HTML::TokeParser; 
 
$cgiobject = new CGI; 
 
#retrieve web page 
$fetchURL=$cgiobject->param("name&quot

; 
unless ($fetchURL) 
{$fetchURL=""} 
$webPage=get($fetchURL); 
 
 
$p = HTML::TokeParser->new(\"<A HREF="

http://www.site.address.com/~directory/page-name.htm&quot"

TARGET="_new">

http://www.site.address.com/~directory/page-name.htm&quot</A>;);

print $cgiobject->header; 
$parser->get_tag("title&quot

; 
print "Content-type: text/html\n\n"; 
print "$parser->get_trimmed_text"; 
 
 
 
 
This is not working... 
 
another note- if I put print "Content-type: text/html\n\n"; towards the beginning of my program, everything after it tends to be ignored for some reason- I'm not sure why. The tutorial I've been working with often has something like that towards the top (usually, they have print $cgiobject->header;, with the same result). 
Liam Morley <a href=mailto:lmorley@wpi.edu>lmorley@wpi.edu</a> <a href=

http://www.wpi.edu/~lmorley/>::

imotic ::</a>

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

500 Server errors: fun...

carpeliam

Programmer

Similar threads

Part and Inventory Search

Sponsor