Monday, 11 November 2013

WP Web Scraper

An easy to implement professional web scraper for WordPress. This can be used to display realtime data from any websites directly into your posts, pages or sidebar. Use this to include realtime stock quotes, cricket or soccer scores or any other generic content. The scraper is an extension of WP_HTTP class for scraping and uses phpQuery or xpath for parsing HTML. Features include:

    Can be easily implemented using the button in the post / page editor.
    Configurable caching of scraped data. Cache timeout in minutes can be defined in minutes for every scrap.
    Configurable Useragent for your scraper can be set for every scrap.
    Scrap output can be displayed thru custom template tag, shortcode in page, post and sidebar (through a text widget).
    Other configurable settings like timeout, disabling shortcode etc.
    Error handling - Silent fail, error display, custom error message or display expired cache.
    Clear or replace a regex pattern from the scrap before output.
    Option to pass post arguments to a URL to be scraped.
    Dynamic conversion of scrap to specified character encoding (using incov) to scrap data from a site using different charset.
    Create scrap pages on the fly using dynamic generation of URLs to scrap or post arguments based on your page's get or post arguments.
    Callback function to parse the scraped data.

For demos and support, visit the WP Web Scraper project page. Comments appreciated.

Tags: curl, html, import, page, phpquery, Post, Realtime, sidebar, stock market, web scraping, xpath   



Source: http://wordpress.org/plugins/wp-web-scrapper/

1 comment:

  1. Hey

    I agree with you all
    But why to waste time doing all this manually for LinkedIN
    I use a software that does that automatically for me and I am really happy
    You can get the free trial also
    It even scrap data and do all the functions that are available in LinkedIn
    Here it is where you can get that - http://goo.gl/ZUgqVD

    ReplyDelete