Parsing text file with ASP

SROSeaner · Sep 26, 2004

I have a text file that is the result of using XMLHTTP object to pull back a
page of search results from a search engine.

So I have the entire results page in HTML, and want to break out each hit
result from the text file as a unique item and do what I want with each hit
result.

Is there any suggested algorithms or any other techniques I could be
directed to?

Ray Costanzo [MVP] · Sep 27, 2004

What exactly is a "hit result?" As far as what you want to do, it'd all
depend on what the html looks like and how consistent it remains. Do you
have control over this remote source? Or is it some other site that can
change on any given day without any forewarning?

Ray at home

SROSeaner · Sep 28, 2004

Actually, all I really need to do is pull out any text in the HTML text that
is a web site address, so, in the form of http://www._____.__ or starting
with www.

I think I know how to find that, by using InStr and passing it http: (for
example) as the text to look for, but, that will only give me the starting
point of the address correct?

Ray Costanzo [MVP] · Sep 28, 2004

Yes, that'd give you the starting point. The best you can do is have your
code make an educated guess about things when you have no idea what kind of
data will be thrown at it.

If the string contains:

<a href="http://something.com">click me</a>, should it be ignored because
there's no WWW? Should your code assume that as soon as it finds a ", then
then that is the end of the domain? What about a carriage return? What
about a < character? What about when it's in a sentence in the document,
eg.

Most Web site addresses start with http://www.

Should that be found?

There are lots of variables to deal with, and all you can really do is hope
for accuracy.

Ray at work

Patrice · Sep 28, 2004

You have DOM parsers available but your code will break if the architecture
of the page change. I would rather use an API or a "service" if
available....

Patrice

SROSeaner · Sep 28, 2004

Thanks for your help guys. I figure I will just have to code it in a way to
take care of all the variables in such a situation.

Problem Splitting Text String	2	Dec 29, 2022
Parsing Text file	8	Jul 2, 2013
Select Eof extension files based on text list of filenames with if condition	0	May 4, 2022
Select files based on text list of filenames(part of the name:date) with condition	0	May 4, 2022
Python pyPDF4 code to bookmark pdf based upon date text	1	Jan 18, 2023
Need help with this script	4	Mar 12, 2023
Issue with textbox script?	0	Sep 5, 2022
How do I set the default content page) on a Classic ASP file?	0	Aug 24, 2021

Parsing text file with ASP

SROSeaner

Ray Costanzo [MVP]

SROSeaner

Ray Costanzo [MVP]

Patrice

SROSeaner

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads