parse page using tokenizer

Thread starter beth
Start date Oct 22, 2005

beth

Oct 22, 2005

i have written a slighly bastardized version of the class web page
which is given as an example of how to use the html tokenizer.

What I've written is:

def parse(host)
dict = { }
@body= host.getHTML()
if !@body
return
end
theTags = ['select','input']
for x in theTags do
tokenizer = HTMLTokenizer.new(@body)
while tag = tokenizer.getTag(x)
name = tag.attr_hash['name']
type = tag.attr_hash['type']
if name != nil then
dict[type]=name
end # if
end# while
end# for x
return dict
end # end parse

My problem is that parse will return all tag attributes of "type" (ie
"checkbox, "button", "hidden") except for those which are of type
"text". I haven't been able to figure out where I'm going wrong.

Thanks for the help,

Beth

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

HELP HELP PLEASE HELP	1	Oct 27, 2005
Problem in getting dashboard page from login page in python pycharm using POST command	0	Dec 24, 2022
Survey details won't go through using php, ajax, Mysql	0	Oct 26, 2023
Help with my responsive home page	2	Dec 14, 2022
I Need Help with making a function that draws in a canvas using location data.	1	Dec 17, 2021
I dont get this. Please help me!!	2	Jan 24, 2023
Image shifts to the right when export the page to pdf	4	May 5, 2023
HCaptcha - How to stop page from refreshing on submit if captcha is not checked/validated	1	Aug 29, 2023

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

No members online now.

Total: 38 (members: 0, guests: 38)
Robots: 434

Forum statistics

Threads: 473,770

Messages: 2,569,584

Members: 45,075

Latest member: MakersCBDBloodSupport

Latest Threads

Stephanie Beaudeau Emsworth was a Gang Member
- Started by trafficcone
- Yesterday at 5:28 PM
Can I stop HTTPS?
- Started by IBMJunkman
- Thursday at 2:34 PM
Stephanie Beaudeau Emsworth is Running a Prostitution Ring
- Started by verona
- Thursday at 4:11 AM
Reverse search for a website
- Started by DRCM
- Wednesday at 7:44 PM
Sign Certificate, Library jsrsasign-latest-all-min.js using function KJUR.jws.JWS.sign('PS256')
- Started by icassiem
- Wednesday at 8:29 AM
Sign Certificate, Library jsrsasign-latest-all-min.js using function KJUR.jws.JWS.sign('PS256')
- Started by icassiem
- Wednesday at 8:23 AM
What are the key advantages of using a SaaS (Software as a Service) model for application development?
- Started by remotedevelopers
- Tuesday at 12:34 PM
How to build a database-driven web page
- Started by av3mar1a153
- Apr 22, 2024
Hola
- Started by luuciefer
- Apr 22, 2024
Using a DTSX file with GoDaddy
- Started by IBMJunkman
- Apr 21, 2024

Top