[Python-talk] Kent's Korner?

Lloyd Kvam python at venix.com
Fri Oct 19 10:33:13 EDT 2007


On Fri, 2007-10-19 at 08:01 -0400, Bill Sconce wrote:
> We do have certain ingredients for maybe solving a simple problem, and
> using it as an enteraining program (i.e., Ted's screen scraper).  The 
> primary experimental aspect of it is that we don't know how it will
> turn
> out...  and that without having the code prepared, and formatted on S5
> slides beforehand we'll have to allow the disscussion to go wherever
> events take us. 

I think this is likely to turn out pretty well.  Developing a program to
extract data from HTML should fit into an evening.

I usually have the most trouble obtaining the HTML.  Getting urllib2 to
cope with cookies, user names, passwords, certificates, keep-alives,
user-agent strings, etc. can be a challenge.  Getting those issues right
can require a lot of experimenting.  Once the HTML is in hand, you can
see what you are dealing with.  Hopefully the connection issues are
already understood.

-- 
Lloyd Kvam
Venix Corp
DLSLUG/GNHLUG library
http://www.librarything.com/catalog.php?view=dlslug



More information about the Python-talk mailing list