In pondering how to interface
CalPhotos with the ScholarsBox, I thought about how to screen-scrape the system. The query forms that can be studied to figure out how to generate the right queries:
A good place to start is to read
How to Link to the Photos.
I was wondering whether there would be a way to generate some XML from the CalPhotos system. One idea is to do the query:
UCMP collection (200 items) and then pass the
results to
HTML Tidy. This is probably not a great idea to try to parse this (because the resulting XHTML(?) is a mess, even though it might be the only way to get all the images.
Another approach is to use the text query mode to get a HTML table.
Example 1 and
Example 2
Once you are looking at one picture (an
example from UCMP), you can get up to 3 sizes:
by forming the right URL.
More useful info:
