Posts Tagged ‘Scraping’

RSS in the Mainstream Media

By Jonathan Bailey • Dec 6th, 2007 • Category: Articles, Prevention, Punditry

For all intents and purposes, RSS is still an extremely new technology. Bloggers, often viewed as being on the cutting edge of such things, are still struggling to determine how to best use the tool to distribute their works.
But as hot as the debate is among bloggers, it is even more heated in the [...]



Akismet and Spam Blogs

By Jonathan Bailey • Nov 29th, 2007 • Category: Articles, News, Personal Experiences, Products



Making the Switch: Going From Partial to Full Feeds

By Jonathan Bailey • Nov 28th, 2007 • Category: Articles, Prevention, Products

Content theft and RSS scraping is not going away. As more and more spammers get into the game and the tools they use improve in effectiveness, the problem is only going to get worse. Because of this, RSS feeds are going to remain vulnerable and bloggers will continue to seek out ways to protect them.
However, [...]



Google’s Shell Games

By Jonathan Bailey • Nov 16th, 2007 • Category: Articles, DMCA, Legal Issues, Punditry

Anyone who is a regular reader to this site knows that, in order to get Adsense removed from a scraper or plagiarist’s page, you are required to file a DMCA notice.
Adsense has its own DMCA policy and follows it very strictly. Though results can be obtained through that means, few bloggers actually use it.
Not only [...]



workFRIENDLY: An Accidental Scraper

By Jonathan Bailey • Nov 9th, 2007 • Category: Articles, DMCA, Legal Issues, News, Prevention

On the surface, workFRIENDLY is something of a novelty site.
The idea is pretty simple, you punch in a URL that you want to visit and workFRIENDLY pulls up the site in a format that resembles a Microsoft Word document (see Blog Herald on workFRIENDLY). The idea is that, if you use the site to [...]



Modified Scraping on the Rise

By Jonathan Bailey • Nov 8th, 2007 • Category: Articles, Legal Issues, News, Personal Experiences, Prevention

It appears that Google’s push to handle duplicate content may be having an unintended side effect.
Even though a recent report by Attributor indicates that the search engine has done a terrible job separating originals from copies, the spammers don’t seem to be taking any chances.
Spam bloggers are no longer content on scraping entries [...]



Attributor Signs Up Reuters

By Jonathan Bailey • Sep 17th, 2007 • Category: Articles, News, Products

In a press release dated today, content monitoring company Attributor announced that they have signed a deal with the British news service Reuters.
This deal closely mirrors a similar arrangement Attributor announced with the Associated Press in May of this year.
According to its press release, Attributor will “fingerprint original Reuters’ content and continuously monitor [...]



RSS Brief: Another Scraping/Spam Threat

By Jonathan Bailey • Sep 14th, 2007 • Category: Articles, Legal Issues, News

Yesterday, the makers of the controversial Pay Per Post service launched a new tool designed to make blog reading faster, RSS Brief.
The idea is that the service takes long posts, like what you might expect here on Plagiarism Today, and condenses them down into a few short sentences.
Though the service sounds convenient and [...]



Legal and Ethical Link Blogging

By Jonathan Bailey • Sep 12th, 2007 • Category: Articles, Legal Issues, News

In a recent post on TechCrunch, Duncan Riley sparked a controversy by saying that the blogs created by Google Reader’s linkblog feature “already break copyright and in a small way undermine blogs and content creators.”
That statement resulted in a flurry of comments with individuals falling on both sides of the debate. The debate then quickly [...]



The DMCA on 7 Blog Hosts

By Jonathan Bailey • Sep 6th, 2007 • Category: Articles, DMCA, DMCA Seven, Legal Issues, Personal Experiences

For the next chapter in the “DMCA Seven” series, we’re taking a look at one of the most common types of hosts out there, blog hosts.
Many of these hosts have been copyright headaches for Webmasters. They are prime targets for spam blogs and scrapers and some have played a huge role in rise of [...]