R
Rolin Nelson
I am having trouble scrubbing a page that has bad markup. After
fetching the page, the Scrubyt::Extractor exits while parsing the
document. The Apple Safari web inspector shows numerous errors from the
page:
<meta> is not allowed inside <td>. Moving <meta> into the <head>.
Unmatched </embed> encountered. Ignoring tag.
Unmatched </span> encountered. Ignoring tag.
Unmatched </a> encountered. Ignoring tag.
Is there anyway to scrub a page with scrubyt that is poorly formated? I
am using the latest version (0.4.1) of scrubyt.
Thanks,
Rolin
fetching the page, the Scrubyt::Extractor exits while parsing the
document. The Apple Safari web inspector shows numerous errors from the
page:
<meta> is not allowed inside <td>. Moving <meta> into the <head>.
Unmatched </embed> encountered. Ignoring tag.
Unmatched </span> encountered. Ignoring tag.
Unmatched </a> encountered. Ignoring tag.
Is there anyway to scrub a page with scrubyt that is poorly formated? I
am using the latest version (0.4.1) of scrubyt.
Thanks,
Rolin