parse HTML by class rather than tag

lorean2007 · Feb 23, 2007

Hello,

i'm would be interested in parsing a HTML files by its corresponding
opening and closing tags but by taking into account the class
attributes and its values,

<html>
<body>
....
<div class="one">
....
<div class="two">
</div>
....
</div>
....
<div class="one">...</div>
<a href="..." class="three">
</body>
</html>

in this example, i will need all content inside div with class="two",
or only class="one",

so i wondering if i should go with regular expression, but i do not
think so as i must jumpt after inner closing div, or with a simple
parser, i've searched and found
http://www.diveintopython.org/html_processing/basehtmlprocessor.html
but i would like the parser not to change anything at all (no
lowercase).

can you help ?

best.

gatti · Feb 23, 2007

Hello,

i'm would be interested in parsing a HTML files by its corresponding
opening and closing tags but by taking into account the class
attributes and its values, [...]
so i wondering if i should go with regular expression, but i do not
think so as i must jumpt after inner closing div, or with a simple
parser, i've searched and foundhttp://www.diveintopython.org/html_processing/basehtmlprocessor.html
but i would like the parser not to change anything at all (no
lowercase).

Horribly brittle idea. Use a robust HTML parser (e.g.
http://www.crummy.com/software/BeautifulSoup/) to build a document
tree, then visit it top down and look at the value of the 'class'
attributes.

Regards,
Lorenzo Gatti

Background image not showing up on html page	3	Sep 23, 2023
Stuck with html and css	25	Dec 14, 2022
How to have two html audio players on one page?	0	May 3, 2022
Only one table shows up with the information	2	Mar 29, 2023
Is it possible an iframe can overlapp another?	3	Apr 20, 2022
Flip-Cards with Local Images	1	Mar 27, 2023
Need assistance finetuning HTML, CSS, Javascript - sticky header issue	3	Feb 25, 2022
Login form no longer working	2	Feb 18, 2023

parse HTML by class rather than tag

lorean2007

gatti

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads