Posts Tagged ‘Scraping’

Attributor Signs Up Reuters

By Jonathan Bailey • Sep 17th, 2007 • Category: Articles, News, Products

In a press release dated today, content monitoring company Attributor announced that they have signed a deal with the British news service Reuters.
This deal closely mirrors a similar arrangement Attributor announced with the Associated Press in May of this year.
According to its press release, Attributor will “fingerprint original Reuters’ content and continuously monitor [...]



RSS Brief: Another Scraping/Spam Threat

By Jonathan Bailey • Sep 14th, 2007 • Category: Articles, Legal Issues, News

Yesterday, the makers of the controversial Pay Per Post service launched a new tool designed to make blog reading faster, RSS Brief.
The idea is that the service takes long posts, like what you might expect here on Plagiarism Today, and condenses them down into a few short sentences.
Though the service sounds convenient and [...]



Legal and Ethical Link Blogging

By Jonathan Bailey • Sep 12th, 2007 • Category: Articles, Legal Issues, News

In a recent post on TechCrunch, Duncan Riley sparked a controversy by saying that the blogs created by Google Reader’s linkblog feature “already break copyright and in a small way undermine blogs and content creators.”
That statement resulted in a flurry of comments with individuals falling on both sides of the debate. The debate then quickly [...]



The DMCA on 7 Blog Hosts

By Jonathan Bailey • Sep 6th, 2007 • Category: Articles, DMCA, DMCA Seven, Legal Issues, Personal Experiences

For the next chapter in the “DMCA Seven” series, we’re taking a look at one of the most common types of hosts out there, blog hosts.
Many of these hosts have been copyright headaches for Webmasters. They are prime targets for spam blogs and scrapers and some have played a huge role in rise of [...]



Autodiscovery and RSS Scraping

By Jonathan Bailey • Sep 5th, 2007 • Category: Articles, Prevention

Feed autodiscovery is one of the most powerful tools available for encouraging feed usage and subscription. Theoretically at least, by giving browsers and feed readers an easy way to identify the feed and users an intuitive way to subscribe to it, more people will take advantage of it.
However, when a reader of this site had [...]



Transcraping: Multi-Lingual Content Theft

By Jonathan Bailey • Aug 29th, 2007 • Category: Articles, Legal Issues, Prevention

When Sallie Goetsch received a Google Alert for her name, she was originally excited. It meant, most likely, that one of her free Web articles had been picked up and used by another site.
However, when she followed the link, she found something else. Rather than a properly attributed article, complete with bio and proper [...]



A Scrape of a Scrape

By Jonathan Bailey • Aug 7th, 2007 • Category: Articles, Personal Experiences, Prevention

I often get asked by reporters and bloggers alike exactly how bad scraping is on the Web. I discuss my past experiments on the topic and how, depending on your keywords, suspicious traffic starts showing up with the first post.
However, as I was searching for information on IE7 security flaws for another site I’m [...]



Plagiarism Today Turns Two

By Jonathan Bailey • Aug 2nd, 2007 • Category: Articles, Housekeeping, Personal

It was two years ago today, on August 2nd, 2005 that the first “live” post on Plagiarism Today was posted. Though I had been writing for almost two months prior to that, largely as an experiment to see if there was enough to warrant a blog on this topic, August 2nd was the day in [...]



Protecting the Comment Feed

By Jonathan Bailey • Aug 1st, 2007 • Category: Articles, Prevention

When bloggers and Webmasters look at the issue of RSS scraping, they typically think solely about their main RSS feed. It is the feed that contains their posts, the one that is most subscribed to and the one most prominently displayed.
When one thinks of “subscribing” to a site, most think exclusively about the main feed.
However, [...]



Five Practical Reasons for Fighting Plagiarism

By Jonathan Bailey • Jul 25th, 2007 • Category: Articles, DMCA, Legal Issues, Prevention, Products

For most, being plagiarized is an inherently emotional experience. Finding out that someone else copied and claimed something that took hours to produce is bound to produce some negative feelings.
This has lead some to wonder if fighting plagiarism is more about revenge than practicality. To some, it is better to simply forget about the plagiarists, [...]