HTML whitespace/commnets cruncher

Garry Heaton · Oct 19, 2003

Can anyone recommend a perl script for crunching HTML whitespace and
comments? I wish to make duplicates of HTML files for uploading.

Garry Heaton

ko · Oct 19, 2003

Garry said:
Can anyone recommend a perl script for crunching HTML whitespace and
comments? I wish to make duplicates of HTML files for uploading.

Garry Heaton

Use one of the HTML parsing modules. For example:

http://search.cpan.org/~gaas/HTML-Parser-3.33/

Download and unpack the distribution, and check out the example scripts
in the 'eg' directory.

HTH - keith

Eric J. Roode · Oct 19, 2003

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Can anyone recommend a perl script for crunching HTML whitespace and
comments? I wish to make duplicates of HTML files for uploading.

Why not gzip the html files? Seems to me that'd be even better.

A quick google search turned up a couple freeware and commercial HTML
strippers. And I seem to recall there's an apache module that does it, but
I'm not sure.

- --
Eric
$_ = reverse sort $ /. r , qw p ekca lre uJ reh
ts p , map $ _. $ " , qw e p h tona e and print

-----BEGIN PGP SIGNATURE-----
Version: PGPfreeware 7.0.3 for non-commercial use <http://www.pgp.com>

iQA/AwUBP5KTlWPeouIeTNHoEQLWUACgpXEJ99HvToQI6liJHMN5tBLWYZMAoKoH
3r8JlfmJtxcwvovr3YPz1/YD
=MhE2
-----END PGP SIGNATURE-----

Gregory Toomey · Oct 20, 2003

It was a dark and stormy night, and Garry Heaton managed to scribble:

Can anyone recommend a perl script for crunching HTML whitespace and
comments? I wish to make duplicates of HTML files for uploading.

Garry Heaton

Would you believe I saw some code yesterday on the net that did this but now I cant find it.

The basic algorithm used regular expressions and was only a few lines long:
convert consecutive whitespace characters to single whitespace
remove whitespace from the beginning of lines
conver consecutive newlines to a single newline

gtoomey

Tad McClellan · Oct 20, 2003

Andrew Shitov said:
Look at the code on this page: http://webcode.ru/cgi/despace1/

It has several bugs in it.

It open()s FILE, but never reads from it.

It uses ampersand on function calls when it does not want the
semantics the go with using ampersand on function calls.

It will mangle spaces in <pre> sections.

Optimal way to make a table for large lists	2	Jul 7, 2022
I need help making an html website	2	Aug 2, 2023
Uhhhhh, What can I do next?	6	Nov 25, 2023
I want to Display Excel As HTML In js	2	Feb 24, 2023
Batch Convert HTML to UTF-8 Files	2	Oct 2, 2023
Hello, there!	8	Mar 3, 2021
Select files based on text list of filenames(part of the name:date) with condition	0	May 4, 2022
Changing .html in URL	3	Jul 11, 2022

HTML whitespace/commnets cruncher

Garry Heaton

ko

Eric J. Roode

Gregory Toomey

Tad McClellan

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads