×
INTELLIGENT WORK FORUMS
FOR COMPUTER PROFESSIONALS

Contact US

Log In

Come Join Us!

Are you a
Computer / IT professional?
Join Tek-Tips Forums!
  • Talk With Other Members
  • Be Notified Of Responses
    To Your Posts
  • Keyword Search
  • One-Click Access To Your
    Favorite Forums
  • Automated Signatures
    On Your Posts
  • Best Of All, It's Free!

*Tek-Tips's functionality depends on members receiving e-mail. By joining you are opting in to receive e-mail.

Posting Guidelines

Promoting, selling, recruiting, coursework and thesis posting is forbidden.

Students Click Here

How do I move through a range of URL IDs

How do I move through a range of URL IDs

How do I move through a range of URL IDs

(OP)

My objective: to move through a range of IDs, pull the HTML down, and convert it to plain text.

Below is the actual link:

CODE

http://www.albme.org/index.cfm?fuseaction=app.LicenseeDetails2&ID=86699

An example range: 86650 - 87000

Below is the actual code to pull down the requested data:

CODE

import sys, urllib
from StringIO import StringIO
import html2text

if __name__ == '__main__':
    url = 'http://www.albme.org/index.cfm?fuseaction=app.LicenseeDetails2&ID=86699'
    encoding = 'utf-8'
    f = urllib.urlopen(url)
    try: s = f.read()
    finally: f.close()
    ustr = s.decode(encoding)
    b = StringIO()
    old = sys.stdout
    try:
        sys.stdout = b
        html2text.wrapwrite(html2text.html2text(ustr, url))
    finally: sys.stdout = old
    text = b.getvalue()
    b.close()
    print text

I am thinking I need to supply some sort of range and define a scenario of x+1 to move through the IDs...pulling down the data after each new URL ID is reached.

Any ideas would be helpful...

 

RE: How do I move through a range of URL IDs

CODE

start, stop = 86650, 87000
format = 'http://www.albme.org/index.cfm?fuseaction=app.LicenseeDetails2&ID=%i'
for i in range(start, stop+1):
    print format % (i,)

May I suggest that you go through the tutorials freely available on the web?

Red Flag This Post

Please let us know here why this post is inappropriate. Reasons such as off-topic, duplicates, flames, illegal, vulgar, or students posting their homework.

Red Flag Submitted

Thank you for helping keep Tek-Tips Forums free from inappropriate posts.
The Tek-Tips staff will check this out and take appropriate action.

Reply To This Thread

Posting in the Tek-Tips forums is a member-only feature.

Click Here to join Tek-Tips and talk with other members! Already a Member? Login


Close Box

Join Tek-Tips® Today!

Join your peers on the Internet's largest technical computer professional community.
It's easy to join and it's free.

Here's Why Members Love Tek-Tips Forums:

Register now while it's still free!

Already a member? Close this window and log in.

Join Us             Close