J
Jeff Sheffel
I'm looking for a simple estimate of a "level of effort", for a Perl
programming task.
The estimate should be in hours. (Maybe a range of programming hours -
based on level of Perl experience.) Any other additional estimate
information, and design comments are appreciated. (Please do not ask
questions about the requirements, since I did not write them; make any
assumptions necessary.)
Program Requirements:
---------------------
Design a web scraping utility to scrap information from various shopping
sites. The code should be written in Object Oriented Perl, with use strict
and warnings enabled.
The initial sites used should be http://www.shopzilla.com and
http://www.shopping.com. However, the program should be designed in a
manor that will allow other sites to be added in the future.
The minimum requirement for output is: Site scraped from, product name,
short description, low price, high price. For simplicity scrapings can be
limited to 60 items or less from each target site.
Optional features that can be added are throttling and threading.
Throttling will limit the number of hits to a particular site in a giving
time period and threading would allow the program to make several requests
simultaneously.
The program should be fully documented and run without warnings.
programming task.
The estimate should be in hours. (Maybe a range of programming hours -
based on level of Perl experience.) Any other additional estimate
information, and design comments are appreciated. (Please do not ask
questions about the requirements, since I did not write them; make any
assumptions necessary.)
Program Requirements:
---------------------
Design a web scraping utility to scrap information from various shopping
sites. The code should be written in Object Oriented Perl, with use strict
and warnings enabled.
The initial sites used should be http://www.shopzilla.com and
http://www.shopping.com. However, the program should be designed in a
manor that will allow other sites to be added in the future.
The minimum requirement for output is: Site scraped from, product name,
short description, low price, high price. For simplicity scrapings can be
limited to 60 items or less from each target site.
Optional features that can be added are throttling and threading.
Throttling will limit the number of hits to a particular site in a giving
time period and threading would allow the program to make several requests
simultaneously.
The program should be fully documented and run without warnings.