Miscellaneous

Other posts not belonging to any specific category

Hotel: scrape prices, Q&A

Question

I want to extract the hotel name and the current room price of some hotels daily from https://www.expedia.ca/Hotel-Search?#&destination=Quebec,%20Quebec,%20Canada&startDate=06/11/2016&endDate=07/11/2016&regionId=&adults=2

I am a small hotel owner and want those info quite often, and hope I can do it with codes automatically in someway.  You are expert in this field, what is the easiest ways to get those information?  Can you give me some example codes? more…

Is this a legal method of acquiring insurance leads?

Recently I received a question on insurance leads:

Is this a legal method of acquiring insurance leads [from the web]? Are there any agent testimonials on the efficiency of this type of service?

Legality issue in web scraping

With the matter of legality in web scraping, there should be a clear approach –  it depends on the website and its privacy policy. There could be at least 2 cases:

  1. Public info (prices, inventory info, public offers), i.e. everything that is not protected by copyright and available for scraping.
  2. The copyright protected info –  website Terms of Use or Terms of Service restrictions make copying and therefore web scraping illegal.

So far I have no insurance agent testimonies on the efficiency of any insurance lead scrape service. The web sites I searched [on the insurance leads] have given me the impression that the customer info they gather is highly secured (not viewable). I doubt that any sites are going to expose insurance leads. In most of them the leads are available by paid subscription plans.

If there are any such websites like insurance leads directories (public insurance quotes), we might develop a scraper that consistently grabs fresh or new info for further analysis. It does save the agent’s time for re-searching, re-visiting and so on. One scraper might work with multiple directory pages for scrape.

You might find it interesting to read about web page change tracking if you only need to see updates (no data storing applied).

Death By Captcha Updated API clients

Death By Captcha is a reputable CAPTCHA solving service with more than 7 years in the Captcha Solving business. They have recently updated all their API clients, so users can experience maximum efficiency and faster solving times.

They enthusiastically recommend that users and software developers visit the API page and update their DBC API implementation in order to get the most out of it (the API and docs are available for registered users only).  The free credits are provided for users to test or implement the new client API!

If you tell them you saw this info through the scraping.pro blog, they’ll give you a 1K free CAPTCHAs additional credit!
For further info, you may contact them directly.

Create and Manage WP Blog With CodeLobster IDE

wordpress at codelobster iedIf you plan to create and maintain large-scale projects using WordPress, you may be challenged with performance and security problems. How do you maintain a large project in WP? Everything depends on the programmer’s skills. To avoid problems with scaling and support, you should be able to write and maintain a well-documented code. An IDE accommodating a WP code might be good choice. more…

Edit and resend HTTP POST in a browser

Recently I’ve encountered a challenge to make a series of HTTP POST requests with different parameters. This has forced me to look for existing tooling in the marketplace; the features I am looking for are getting POST request,  editing the request and resending it. What I’ve found useful for this is the FireFox browser + dev tools – really quick and usable for this purpose. All other methods are either not full stack (only resend without edit) or require much soft to plug in. more…

Scrapinghub review

4 in 1 of scrapinghub

Scrapinghub is the developer-focused web scraping platform. It provides web scraping tools and services to extract structured information from online sources. The Scrapinghub platform also offers several useful services to collect organized data from the internet. Scrapinghub has four major tools – Scrapy Cloud, Portia, Crawlera, and Splash. We’ve decided to try the service. In this post we’ll review its main functionality and also share our experience with Scrapinghub. more…

Back to top