A
a
Hi,
I am writing a script to download all the links of the whole site. The link
of the web site is not a simple tree. There may be some replicated links
pointing to the same location.
So, I need to walk through the site to extract and push the URLs of each
page into a data structure.
I dont want the replicated links. Every link should only appear once in my
storage.
So, is there any effective way to achieve this?
Thanks
I am writing a script to download all the links of the whole site. The link
of the web site is not a simple tree. There may be some replicated links
pointing to the same location.
So, I need to walk through the site to extract and push the URLs of each
page into a data structure.
I dont want the replicated links. Every link should only appear once in my
storage.
So, is there any effective way to achieve this?
Thanks