Parseing HTML

G

Guest

I have a large number of html file (10,000+) and need to programmatical
modify them on a regular basis.

How can I determine the textual data that is present, ignoring tags etc

so if i have a line such as:-
<TD WIDTH="524"><B><FONT SIZE="2" FACE="Times New Roman"
COLOR="#000000">Direction2</FONT></B></TD>

I need to return "Direction2" and nothing else

any ideas?

guy
 
M

Marina Levit [MVP]

The only way I know how would be to use MSHTML. This has the HTML object
model and allows you to load up an HTML dom, and then you can get the inner
text of a node, and so on.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,733
Messages
2,569,440
Members
44,832
Latest member
GlennSmall

Latest Threads

Top