<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism Todayfeeds | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/feeds/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 06:51:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Five Years Later: Why RSS Scraping Still is Not OK</title>
		<link>http://www.plagiarismtoday.com/2011/08/17/five-years-later-why-rss-scraping-still-is-not-ok/</link>
		<comments>http://www.plagiarismtoday.com/2011/08/17/five-years-later-why-rss-scraping-still-is-not-ok/#comments</comments>
		<pubDate>Wed, 17 Aug 2011 16:27:57 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[feeds]]></category>
		<category><![CDATA[Implied-License]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[RSS scraping]]></category>
		<category><![CDATA[Scraping]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=10723</guid>
		<description><![CDATA[FIve years after first writing about RSS scraping, the legal realities of scraping haven't changed but the scrapers definitely have.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2011/08/icon_rss-250x250.png" alt="Sample RSS Icon" title="RSS Icon" width="250" height="250" class="alignleft size-medium wp-image-10732" />Five years ago I penned an article entitled &#8220;<a href="http://www.plagiarismtoday.com/2006/08/29/why-rss-scraping-isnt-ok/">Why RSS Scraping Isn&#8217;t OK</a>&#8220;. The goal of the article was to take a look at the arguments scrapers used, legal and ethical, and explain why the realities of the law were not on their side. </p>
<p>Basically, at that time, RSS scrapers were arguing that, by putting content into an RSS feed, one was giving permission to use it on other sites, essentially creating an implied license to republish it. However, as I talked about in the previous article, the legal realities are much different and RSS scraping without per mission is, almost certainly, a copyright infringement.</p>
<p>However, while the legal realities haven&#8217;t changed much in the past five years, the people doing the scraping have. Spammers and sploggers, now wary of duplicate content issues, have largely abandoned RSS scraping in favor of other techniques. Today, the scrapers are fewer but place themselves as editors, curators and collectors, people building moderated lists of great content.</p>
<p>This shift hasn&#8217;t done much to alter the legal realities of scraping nor has it done much to placate creators who still see this as one of the most common issues they face.</p>
<p>The truth is that, even with this new veneer, RSS scraping is still not legally or ethically acceptable. Whether it&#8217;s curators or spammers, those who scrape from RSS feeds are in a dubious position and one that seems to be getting worse every day.<span id="more-10723"></span></p>
<h4>The Past Five Years Of Law and Scraping</h4>
<p>The past five years of legal history have been strangely quiet on the issue of RSS scraping. Despite how common the behavior is, very few suits have dealt with the issue.</p>
<p>The best known of those cases was <a href="http://www.boston.com/business/ticker/2009/01/nyt_gatehouse_r.html">Gatehouse Media vs. The New York Times</a>. Which saw Gatehouse Media, the owners of &#8220;Wicked Local&#8221; brand sites as well as hundreds of smaller papers, sue the New York for the Times&#8217; scraping of their RSS feeds for inclusion on Boston.com&#8217;s &#8220;Your Town&#8221; section. </p>
<p>The suit only centered around the headlines and excerpts from the stories involved but the Times felt their position was weak enough to warrant settling the matter publicly and quickly. In the end, the New York Times agreed to stop scraping Gatehouse feeds and respect restrictions placed by Gatehouse Media on the content.</p>
<p>Related cases on the issue of data scraping, sometimes called data mining, have largely been equally negative for the scrapers. Though only at the summary judgment phase at last report, the <a href="http://blog.ericgoldman.org/archives/2010/04/court_denies_su_1.htm">Snap-on Business Solutions Inc. v. O&#8217;Neil &#038; Assocs., Inc</a>, highlights the other legal perils of scraping.</p>
<p>In that case, Snap-on produced and maintained a database of auto parts for Mitsubishi. After two years, Mitsubishi began to look at other vendors for the contract but Snap-on would not give up control over the data. Mitsubishi eventually hired an outside contractor, O&#8217;Neil, to scrape the content out of the database and bring it into a new system. When Snap-on learned of the scraping, they filed suit.</p>
<p>In the summary judgement phase of the case, the judge ruled that Snap-on likely had arguments regarding the Computer Fraud and Abuse Act (CFAA), Trespass to Chattels and Breach of Contract. The court rejected a copyright infringement argument, but only because the content copied did not qualify for copyright protection, unlike with RSS feeds.</p>
<p>The case shows, as I pointed out years ago, that <a href="http://www.plagiarismtoday.com/2006/08/24/linkworthy-scraping-as-a-legal-minefield/">scraping is a legal minefield</a>. Even cases that seem to go the way of the scraper, such as the <a href="http://blog.ericgoldman.org/archives/2010/09/antiscraping_la.htm">Cvent, Inc. v. Eventbrite, Inc. case</a>, are highly fact-specific and seem to hinge more on poor case preparation than the law itself. (Note: Even in that &#8220;victory&#8221; the copyright claims and the unjust enrichment claims survived dismissal.)</p>
<p>Instead, most seem to follow the route of the <a href="http://blog.ericgoldman.org/archives/2007/10/ticketmaster_wi.htm">Ticketmaster L.L.C. v. RMG Technologies, Inc.</a> case, a 2007 win for Tickemaster against a sniping service that was snatching up popular tickets using an automated process. In that case, the court ruled RMG was infringing copyright by merely browsing the relevant pages since they were doing so in violation of Tickemtaster&#8217;s &#8220;browserwrap&#8221; license.</p>
<p>In short, the legal realities for scraper are even more bleak than they were five years ago. The implied license argument that&#8217;s so popular among scrapers has been eroded and, all in all, it&#8217;s almost impossible to scrape legally, RSS or otherwise. </p>
<p>Yet, what&#8217;s changed in the last five years isn&#8217;t so much the law, but the scrapers themselves and that&#8217;s where things have truly gotten interesting.</p>
<h4>The Death of the Spammer Scraper</h4>
<p>Back in 2006, your &#8220;typical&#8221; RSS scraper was probably a spammer, someone seeking a quick, hands off way of filling a large number of sites with search engine friendly content to rise in the rankings and, eventually, usurp the original work for certain keywords.</p>
<p>Those days, however, are gone. Though scraping spammers still exist, most spammers moved on from this method as Google and the other search engines improved their duplicate content detection, making it a less effective technique. Methods such as content spinning, content generation and even cheap outsourcing have proved to be more effective and equally reliable.</p>
<p>This decline has largely mirrored the <a href="http://www.readwriteweb.com/archives/rss_reader_market_in_disarray.php">overall decline in traditional (reader-based) RSS usage</a>. RSS is falling out of vogue, at least as a tool for scraping and reading, but not as a tool for &#8220;curating&#8221;. </p>
<p>The reason is that tools for integrating RSS into existing websites have grown much more common and easier to use in the past five years. Though some were developed for the use of spamming, other tools were meant to allow authors to integrate all of their sites in one place. However, some authors have latched onto these tools as a way of bringing in the work of others without permission.</p>
<p>This has created a situation where the people doing the scraping are fewer in number, but likely much more dangerous. Where search engines were relatively effective at filtering out spammers, these sites tend to appear to be much more legitimate, increasing the likelihood they could be mistaken as originals.</p>
<p>Fortunately, the law doesn&#8217;t make a great distinction between spammers and those who scrape with less nefarious intentions, but many who engage in this practice have, according to emails I&#8217;ve seen, have claimed to have an ethical or even legal right to engage in the scraping, calling themselves &#8220;editors&#8221;.</p>
<p>This has set the stage for some ugly battles that, while they haven&#8217;t reached the courtroom yet, have certainly been heated on the Web.</p>
<p>Indeed, this argument seems to be one that&#8217;s moving out of the courtroom and into the court of public opinion, a place where it&#8217;s likely to stay given how straightforward the legal issues seem.</p>
<h4>Bottom Line</h4>
<p>In the end, consider that the New York Times Company, one of the most powerful media institutions on the planet, couldn&#8217;t or didn&#8217;t want to defend scraping of just headlines and summaries, there&#8217;s little hope for a successful defense of full RSS scraping. This is especially true in the light of other, related scraping cases.</p>
<p>However, those who want to scrape and those who are willing to allow their feeds to be scraped do have options. Creative Commons, for example, <a href="http://wiki.creativecommons.org/Syndication">has modules for RSS feeds</a> that enable applications to detect what they are allowed to do with a feed. </p>
<p>To those who don&#8217;t wish to allow it, I encourage you to put in your feed itself a notice stating that you do not wish to allow republishing and that the feed is for private personal use only. Though it shouldn&#8217;t be necessary under the law, it&#8217;s a wise move that blocks many of the potential arguments a scraper might raise. Furthermore, such footers can greatly help with the detection of scrapers.</p>
<p>All in all, RSS scraping has definitely changed in terms of who is using it and why, but the threat isn&#8217;t all that different and the legal realities have hardly changed at all. </p>
<p>This means that RSS scraping can be easily fought, just that the people you&#8217;re moving against may be a bit more vocal in their views. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2011/08/17/five-years-later-why-rss-scraping-still-is-not-ok/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>The FeedBurner Question</title>
		<link>http://www.plagiarismtoday.com/2007/12/05/the-feedburner-question/</link>
		<comments>http://www.plagiarismtoday.com/2007/12/05/the-feedburner-question/#comments</comments>
		<pubDate>Wed, 05 Dec 2007 18:44:14 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Prevention]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[FeedBurner]]></category>
		<category><![CDATA[feeds]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[RSS scraping]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/12/05/the-feedburner-question/</guid>
		<description><![CDATA[When I started Plagiarism Today approximately two and a half years ago, FeedBurner seemed like a Godsend. Not only was it a powerful way to track your feed&#8217;s usage, but it was also an easy way to customize the feed&#8217;s features, all while ensuring compatibility across the board. FeedBurner took the complicated black art of...]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/21374257@N02/2088757709" title="View 'fblogo.png' on Flickr.com"><img src="http://static.flickr.com/2095/2088757709_6f9bced237.jpg" hspace="10" alt="fblogo.png" border="0" width="" height="" align="left" /></a>When I started Plagiarism Today approximately two and a half years ago, <a href="http://www.feedburner.com">FeedBurner</a> seemed like a Godsend. Not only was it a powerful way to track your feed&#8217;s usage, but it was also an easy way to customize the feed&#8217;s features, all while ensuring compatibility across the board.</p>
<p>FeedBurner took the complicated black art of feed management/tracking and whittled down into an easy &#8220;set and forget&#8221; service.</p>
<p>The case for FeedBurner was further bolstered in March of last year when it <a href="http://www.plagiarismtoday.com/2006/03/02/feedburner-fights-splogging-and-scraping/">released its &#8220;Uncommon Uses&#8221; feature</a>. With that upgrade, FeedBurner also become the most powerful tool available for fighting scraping.</p>
<p>However, since then, plugins such as <a href="http://redalt.com/Resources/Plugins/AntiLeech">AntiLeech</a> and <a href="http://wordpress.org/extend/plugins/copyfeed/">CopyFeed</a> have become popular and widely used, giving new powers to those who host their own feed. Meanwhile, FeedBurner was <a href="http://blog.tmcnet.com/blog/tom-keating/google/google-acquires-feedburner.asp">acquired by Google</a> and the entire service seems to have shifted its focus.</p>
<p>Now, with those developments in mind, it is time to take another look at FeedBurner and decide if the trade off is worth it or not.<br />
<span id="more-746"></span><br />
<strong>The Case For FreedBurner</strong></p>
<p>Those who support FeedBurner note that the service has a set of powerful features for bloggers. They offer statistic tracking, customization and feed monitoring. It also offloads the feed to their servers, great for sites with a lot of feed subscribers, and ensures that the feed remains active even if the original site goes down.</p>
<p>Because of the appeal of those services, they have been picked to manage well over 1.1 million feeds and tens of millions of subscriptions. This has given them an almost God-like understanding of how feeds are used and where they are used.</p>
<p>That knowledge is put to use in their uncommon uses feature, which tracks uses of your feed that don&#8217;t fit the typical pattern. You can check your uncommon uses dialog on a regular basis and discover at least some cases of feed scraping.</p>
<p>FeedBurner also has the ability to perform many plugin-like functions from within their system. You can add copyright notices to your feed, affix a Creative Commons License or create a summary feed if you wish, all without modifying your original feed. </p>
<p>All of this is from a service that, since the purchase by Google, is completely free and is filled with other great features such as chicklets, feed counters and a robust API.</p>
<p>To fans of FeedBurner, there is no question, FeedBurner is a powerful service that provides important features at a price you can not beat. </p>
<p><strong>The Case Against FeedBurner</strong></p>
<p>When you offload your RSS feed to a third party, as with FeedBurner, it comes with drawbacks. The biggest is that you lose all ability to block any scraping of your feed. </p>
<p>WordPress users who have their own server and host their own feed can take advantage of plugins such as AntiLeech and CopyFeed, both of which make it possible to block abuse of the feed before it happens. Meanwhile, FeedBurner has <a href="http://forums.feedburner.com/viewtopic.php?p=43775">gone on the record</a> saying that they &#8220;cannot remove your feed from sites which have resyndicated the content.&#8221;</p>
<p>This means that, by in large, FeedBurner users have to focus on detecting scraping and stopping it, rather than blocking it at the source.</p>
<p>However, those detection tools seem to be somewhat flawed as well. Though their uncommon uses tool is powerful, mine seems to report my site as being the number one uncommon use.</p>
<p><a href="http://www.flickr.com/photos/21374257@N02/2088758539" title="View 'uncommonsues.png' on Flickr.com"><img src="http://farm3.static.flickr.com/2409/2088758539_aa488d7522.jpg" alt="uncommonsues.png" border="0" width="500" height="261" /></a></p>
<p>As scraping has moved away from traditional RSS republishing and into scraping search results and summaries from third parties, the power of the uncommon uses tool wanes. Simply put, scrapers just aren&#8217;t grabbing the original feed at the rate they used to.</p>
<p>Since most of FeedBurner&#8217;s features can be mimicked using other tools that don&#8217;t require you to offload your feed, many feel that it is time to abandon the service and take direct control of our RSS content.</p>
<p><strong>Conclusions</strong></p>
<p>There are no easy answers here. The only universal truth that I see is that FeedBurner is a must-have for anyone using a Blogspost, WordPress.com, or other free blogging service. Since you don&#8217;t have direct control over your feed to begin with, you sacrifice nothing by using FeedBurner and gain a wealth of information and tools.</p>
<p>But for those of us with our own servers, the choice is more difficult. FeedBurner might make sense on sites where the feed traffic is very high and could cause a drain on the service. It also makes sense if there are reliability issues that could negatively impact the user experience.</p>
<p>However, for most users, it would seem that you can get more features and better protection by learning the tools available to you and taking control of your own feed. That being said, those without a lot of computer experience may benefit from FeedBurner, but would still likely be better served by hitting the books.</p>
<p>Personally, I&#8217;ve had a growing case of &#8220;buyers remorse&#8221; in the past year since <a href="http://www.plagiarismtoday.com/2007/01/18/safer-without-feedburner/">I wrote about this issue</a>. I&#8217;ve been reading closely the <a href="http://blogs.feedburner.com/feedburner/archives/001251.html">FeedBurner exit strategy</a> and considering the move.  </p>
<p>I haven&#8217;t done it yet, largely because I fear losing subscribers and do enjoy many of the features of FeedBurner, including the reliability, but there is little doubt in my mind, and in the mind of others that I&#8217;ve talked with, that FeedBurner has stepped away from this issue for the most part and that has sped up greatly since the acquisition.</p>
<p>Still, I want to leave the question with you, is FeedBurner a service to use or a service to avoid? If you were starting a site today, what choice would you make?</p>
<p>Leave a comment below or <a href="http://www.plagiarismtoday.com/contact-pt/">drop me a line</a> with your thoughts. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/12/05/the-feedburner-question/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>CopyFeed Plugin Now Available in English</title>
		<link>http://www.plagiarismtoday.com/2007/05/24/copyfeed-plugin-now-available-in-english/</link>
		<comments>http://www.plagiarismtoday.com/2007/05/24/copyfeed-plugin-now-available-in-english/#comments</comments>
		<pubDate>Thu, 24 May 2007 18:29:49 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Prevention]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[feed]]></category>
		<category><![CDATA[feeds]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Plugin]]></category>
		<category><![CDATA[plugins]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[Wordpress]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/05/24/copyfeed-plugin-now-available-in-english/</guid>
		<description><![CDATA[With my recent move this story fell through the cracks a bit, but I wanted to let everyone know that the CopyFeed WordPress Plugin, previously reported on here, has now been translated into English. I took the time today and installed the plugin here on my feed and am, so far, loving it. It is...]]></description>
			<content:encoded><![CDATA[<p>With my recent move this story fell through the cracks a bit, but I wanted to let everyone know that the <a href="http://wordpress.org/extend/plugins/copyfeed/">CopyFeed WordPress Plugin</a>, previously reported on <a href="http://www.plagiarismtoday.com/2007/05/04/wordpress-plugin-copyright-feed/">here</a>, has now been translated into English.</p>
<p>I took the time today and installed the plugin here on <a href="http://feeds.feedburner.com/PlagiarismToday">my feed</a> and am, so far, loving it. It is a very powerful plugin and seems to do everything that it advertises. Though I am not using the IP identification tool due to my use of <a href="http://www.feedburner.com">FeedBurner</a>, I am using it to place a small copyright notice and the digital fingerprint.</p>
<p>It also corrects other feed-related issues, such as WordPress artificially truncating entries that use the &#8220;MORE&#8221; tag, and adds the ability to insert comments and related posts into the feed.</p>
<p>The only frustration I&#8217;ve had is that you need to be careful to save your options before you update the preview, otherwise, you will likely lose all of your work.</p>
<p>Beyond that, if it works correctly with this post, it is a powerful plugin that combines functionality that was once spread across nearly a half dozen plugins.</p>
<p>I would encourage every WordPress using that is running their own installation to look at getting it.</p>
<p><strong>Update:</strong> I checked the output from this feed after I published the post and it caused an error with the feed, removing all paragraph breaks in the post itself. I&#8217;m going to contact the creator of the plugin and see if this is a problem he is aware of.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/05/24/copyfeed-plugin-now-available-in-english/feed/</wfw:commentRss>
		<slash:comments>31</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 05:49:39 -->
