how to get the source of html in lxml?

Thread starter contro opinion
Start date Dec 31, 2012

contro opinion

Dec 31, 2012

import urllibimport lxml.html
down='http://blog.sina.com.cn/s/blog_71f3890901017hof.html'
file=urllib.urlopen(down).read()
root=lxml.html.document_fromstring(file)
body=root.xpath('//div[@class="articalContent "]')[0]print body.text_content()

When i run the code, what i get is the text content ,how can i get the html
source code of it?

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads

why the conut( ) can not get the number?	0	Dec 8, 2012
HTML form to csv file on server	1	Feb 12, 2025
Setup a portion of html page as scrollable?	25	Jan 7, 2025
I am trying to make an audio player, how do I get the selected file to be playable?	5	Mar 29, 2022
Using Xpath to parse a Yahoo Finance page	4	Dec 2, 2012
How to use PDF-lib and how to center each line of texts on the page?	1	Aug 16, 2023
Hello I am learning how to code and I tried making a calculator with HTML and js with some CSS I am stuck at thing, Like the screen value is	0	Mar 13, 2025
How to have two html audio players on one page?	0	May 3, 2022

Facebook Twitter Reddit Pinterest Tumblr WhatsApp Email Link

Members online

No members online now.

Total: 313 (members: 1, guests: 312)
Robots: 350

Forum statistics

Threads: 474,472

Messages: 2,571,833

Members: 48,802

Latest member: shadowoftheunknown

Latest Threads

Enterprise Outlook to Google Workspace Migration Looking for Real-World Recommendations
- Started by henrywalker
- Today at 4:49 AM
Best Strategy for Enterprise Microsoft 365 Tenant Consolidation Without Data Loss?
- Started by henrywalker
- Tuesday at 2:17 AM
Need a SharePoint Online migration tool that actually works
- Started by henrywalker
- Jul 21, 2026
Export Google Photos Albums to Computer with Complete Folder Structure
- Started by henrywalker
- Jul 15, 2026
Best Enterprise Strategy for Large PST Archives
- Started by henrywalker
- Jul 13, 2026
Create Better Digital Experiences with Modern Design Thinking
- Started by Damian01
- Jul 11, 2026
How Can I Convert Old DBX Files to PST When Outlook Express Is No Longer Available?
- Started by Damian01
- Jul 10, 2026
How to Improve Ruby Application Performance and Fix Common Slowdown Issues?
- Started by Damian01
- Jul 10, 2026
Best Way to Prepare PST Email Archives for Legal Discovery Without Outlook?
- Started by henrywalker
- Jul 10, 2026
Need a Reliable Office 365 Backup Solution for Business ComplianceAny Recommendations?
- Started by henrywalker
- Jul 8, 2026

Top