Various topics related to Web Scraper, Web Crawler and Data Processing development

Content Grabber with free proxy account integration for business directories scrape

content grabber free nohodoProfessional data extraction requires adequate proxying to keep anonymity of scraping robots. When attempting to extract large data sets (over 1M records, ex. business directories) reliable and fast proxy service is needed.

Sequentum has released the Nohodo proxy service integration for Content Grabber. Nohodo provides a free account for Content Grabber users (up to 5000 requests/monthly for free). The feature is available for both trial users and regular customers. Here’s how it works… more…

No CAPTCHA reCaptcha challenge

No CAPTCHA reCAPTCHA solvingSooner or later a new generation of spam protection methods will emerge to block all unwanted site visitors. The recently launched Google No CAPTHCA reCaptcha could just be such a method. This new “behaviour analysis” tool is getting more and more attention both from the site owners and from scraping engines who are trying to break it. Since Google does not reveal any secrets of its operation, we want to share with you the techniques used in this new smart analysis CAPTCHA that determines between bot and human. Let’s look inside. more…

Back to top