HTML

Extracting information from a HTML page is similar to the techniques mentioned in the XML section We use the same XPATH selector to extract html tags such as DIV, TD etc Example:

We have a local copy of a similar pricing table and we want to simply scrape and show it on our dashboard

“url”:”http://localhost/wireframes_org_website/test.php”,”selector”:”//*[@id=’pricing_home_table’]/tbody/tr/*”,”columns”:[“c1″,”c2″,”c2″,”c2″,”c2”] The ID for the table is “pricing_home_table” and you can use firebug to even get the XPATH of any tag in the HTML page. This works exactly like the XML Output so please refer the examples

InfoCaptor

Contact Us

Documentation

Free Trial

Login

InfoCaptor

How can we help?

Contact Us

Documentation

Free Trial

Login