Skip to content Skip to sidebar Skip to footer

How To Navigate Through HTMl Pages That Have Paging For Their Content Using Python?

I want to crawl all the table entries(table that describes the S/No. , Document No., etc.) from the following website and write it to excel. So far, I am able to crawl the data fro

Solution 1:

As I can see in this page, you need to interact with java script that is invoked by button Go or Next Page button. For Go button you need to fill the textbox each time. You can use different approaches to work around this:

1) Selenium - Web Browser Automation

2) spynner - Programmatic web browsing module with AJAX support for Python and also take look here

3) If you are familiar with c#, it also provide a webBrowser component that helps you to click on the html elements. (e.g. here). You save html content of each page and later on crawl them from offline pages.


Post a Comment for "How To Navigate Through HTMl Pages That Have Paging For Their Content Using Python?"