remove all html tags by perl

Discussion in 'Perl' started by jjliu, Oct 10, 2003.

  1. jjliu

    jjliu Guest

    Could someone tell me how to remove all html tags (and anything inside tags)
    by perl. Some people suggested me to use HTML::TagFilter but i could not
    find window version. Thanks very much for your help.

    JJL
     
    jjliu, Oct 10, 2003
    #1
    1. Advertisements

  2. Sure.

    s/.*//s;
     
    Gunnar Hjalmarsson, Oct 10, 2003
    #2
    1. Advertisements

  3. jjliu

    jjliu Guest

    Thanks.What i wanted is to remove head tag and anything inside it. Could you
    help me out.
     
    jjliu, Oct 10, 2003
    #3
  4. jjliu

    Kris Wempa Guest

    That will remove ALL characters. He really needs something along the lines
    of:

    s/\<[^\<]+\>//;

    This only works if the entire TAG is within the same string. If the tag
    spans multiple lines, they will need to be concatenated into 1 string.
     
    Kris Wempa, Oct 10, 2003
    #4
  5. -----BEGIN PGP SIGNED MESSAGE-----
    Hash: SHA1

    Gunnar knows that. :)

    Why all the backslashes?
    It also doesn't work if anything within the tag or its attributes contain
    a > symbol. Example:

    <img src="mathexpression.gif" alt="5 is > 4" />
    <input type="submit" onclick="if (count > 1) true else false" />

    - --
    Eric
    $_ = reverse sort $ /. r , qw p ekca lre uJ reh
    ts p , map $ _. $ " , qw e p h tona e and print

    -----BEGIN PGP SIGNATURE-----
    Version: PGPfreeware 7.0.3 for non-commercial use <http://www.pgp.com>

    iQA/AwUBP4ftJGPeouIeTNHoEQJxpACghIOdjOo5xr7rh9N5zQ6d9EF3KvIAmwdA
    R0qdv3U33ZyBzW4L7u8Vq6jf
    =sIdz
    -----END PGP SIGNATURE-----
     
    Eric J. Roode, Oct 11, 2003
    #5
  6. Only the head tag? Well, in that case a regexp similar to what Kris
    suggested might be sufficient. But please note that normally you'd
    better use a module when dealing with HTML code, and even if I have
    never used the one you mentioned, it appears to be a good suggestion.
    What do you mean by Windows version? What makes you think that
    HTML::TagFilter doesn't work on Windows?
     
    Gunnar Hjalmarsson, Oct 15, 2003
    #6
    1. Advertisements

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments (here). After that, you can post your question and our members will help you out.