HTML

Extracting information from a HTML page is similar to the techniques mentioned in the XML section
 
We use the same XPATH selector to extract html tags such as DIV, TD etc
 
Example:
 
We want to extract the pricing table from the home page of https://www.wireframes.org
 
We have a local copy of a similar pricing table and we want to simply scrape and show it on our dashboard
 
 
 
 
"url":"http://localhost/wireframes_org_website/test.php",
"selector":"//*[@id='pricing_home_table']/tbody/tr/*",
"columns":["c1","c2","c2","c2","c2"]
 
 
The ID for the table is "pricing_home_table" and you can use firebug to even get the XPATH of any tag in the HTML page.
 
 
This works exactly like the XML Output so please refer the examples
 
 
 
Use online Web Dashboards in the cloud or download and host your own Dashboards