UserPreferences

PythonLanguage/HttpTechniques


Problem to solve: how to programmatically push content to MyWikis. Some possible help in :

I will most likely end up using the [WWW]httplib library; [WWW]some examples of using httplib will be helpful.

HTML parsing

The example from MarkPilgrim's [WWW]Extracting data from HTML documents uses [WWW]SGMLParser, related to [WWW]htmllib. An alternative interface is [WWW]HTMLParser.

Which one to use? According to the discussion thread [WWW]htmllib vs. HTMLParser, HTMLParser is "A much improved HTML parser -- a replacement for sgmllib". So I'll give HTMLParser a try.

Is the solution here?

[WWW]mechanize: Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize

Crawling/Spidering/User Agent tool?

Stuart Langridge: [WWW]Browser.py ["Browser.py is an automated web browser for Python. It's something like Perl's WWW:Mechanize - use it to navigate to a page, follow links, fill out forms, and the like."]

Misc Notes and Questions

A [WWW]Python Web-SIG has been established. The [WWW]mailing list archive is off to a good start.

Is there any Python equivalent of PerlLanguage's [WWW]LWP (which I remember being really awesome? Last time I checked, I would say no, like [WWW]this post on usenet. I hope I'm wrong.

Lots of links at [WWW]Python Web programming wiki on lots of topics except for "Python-based Web Client Programming" -- maybe there is just a dearth of good modules in this area?

Is [WWW]Python Web Programming any good?

Good to note [WWW]a post I wrote in 2000 about using urllib and urllib2 in python -- one that got no response.

[WWW]Another thread that shows how LWP makes life easy in Perl but there's a lot of low-level munging in Python to get the same effect. Sigh. (Time to pull out PerlLanguage again!)

[WWW]Python and HTML Processing: