Data For The Taking: Using Website Terms And Conditions To Combat Web Scraping

Is it stealing to take data without permission from a public website, or is it simply making use of resources that are made available to you? "Web scraping" or "screen scraping" is the practice of extracting large amounts of data from public websites using bots.

A recent case in the European Court of Justice has focused attention both on the intellectual property infringement aspects of scraping practices and on the potential for website owners to use their sites' contractual terms and conditions to combat the scrapers.

Scraping is not new, but it has become increasingly widespread in recent years, fuelled by the rise in big data analytics and the popularity of price comparison websites. Indeed, in 2013, it accounted for 18% of site visitors and 23% of all Internet traffic. Scraping is not inherently bad: it can have legitimate uses, spur innovation and give companies with limited resources access to large amounts of data. However, unsurprisingly, many website operators do not like it. Not only are operators keen to protect their proprietary rights, but repeated scraping can also take a heavy toll on websites by using up bandwidth and leading to network crashes.

In the U.S., website operators have asserted various claims against scrapers, including copyright claims, trespass to chattels claims and contract-based claims alleging that scrapers violated their websites terms of use. In the EU, operators have tended to rely on intellectual property infringement claims against scrapers, but there has been little case law to provide guidance.

However, in January 2015, in a much anticipated decision, the European Court of Justice (CJEU) held that where a website operator cannot establish intellectual property rights in its database, an operator may still be able to rely on its website terms and conditions to prohibit scraping. This ruling may impact an increasing number of companies whose business models rely on mining data from websites and social media platforms without permission. On the other hand, it will be viewed positively by those data-rich businesses keen to protect and/or monetise their data.

RYANAIR LTD V PR AVIATION

The CJEU case involved PR Aviation which operates a price-comparison website for low-cost airlines. Consumers can book a flight on the website and PR Aviation receives a commission. The website relies on information obtained by screen scraping publicly available data from the websites of low cost airlines, including data...

To continue reading

Request your trial

VLEX uses login cookies to provide you with a better browsing experience. If you click on 'Accept' or continue browsing this site we consider that you accept our cookie policy. ACCEPT