<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism TodayReddit | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/reddit/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 06:51:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Eps 224 &#8211; Going Dark</title>
		<link>http://www.plagiarismtoday.com/2012/01/13/eps-224-going-dark/</link>
		<comments>http://www.plagiarismtoday.com/2012/01/13/eps-224-going-dark/#comments</comments>
		<pubDate>Fri, 13 Jan 2012 22:20:05 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Podcast]]></category>
		<category><![CDATA[anonymous]]></category>
		<category><![CDATA[capitol records]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[ddos]]></category>
		<category><![CDATA[finland]]></category>
		<category><![CDATA[netherlands]]></category>
		<category><![CDATA[ninjavideo]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[redigi]]></category>
		<category><![CDATA[sopa]]></category>
		<category><![CDATA[The-Pirate-Bay]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=12244</guid>
		<description><![CDATA[The Pirate Bay to be blocked in the Netherlands and Finland, SOPA sparks Reddit protest, Capitol Records sues "used" digital music celler and more!]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2012/01/reddit-logo-300x115.jpg" alt="Reddit Logo" title="Reddit Logo" width="300" height="115" class="alignleft size-medium wp-image-12246" /></p>
<p>It is Friday again and that means that it is time for another episode of the Copyright 2.0 Show.</p>
<p>This week&#8217;s a show about site blocking, specifically blocking pirate sites. Not only do we have an update on SOPA and PIPA, but we also have news of The Pirate Bay being blocked in both the Netherlands and Finland. However, once we&#8217;re done with that we take another look at the idea of selling &#8220;used&#8221; music tracks and even a glance at the overall opinion of file sharing in the U.S.</p>
<p>All in all, this is an episode you do not want to miss so start your (legal) downloads!</p>
<p>This week&#8217;s stories include:</p>
<ul id="null">
<li>Netherlands Moves to Block The Pirate Bay</li>
<li>Pirate Bay Block in Finland Spawns DDOS Attacks</li>
<li>Reddit Plans Major SOPA Protest</li>
<li>Capitol Records Sues ReDigi Over &#8220;Used&#8221; Audio Files</li>
<li>NinjaVideo.net Admin Sentenced</li>
<li>Much, Much More!</li>
</ul>
<p>You can <a href="http://recordings.talkshoe.com/TC-22590/TS-580058.mp3">download the MP3 file here</a> (direct download). Those interested in subscribing to the show can do so via <a href="http://www.copyright20.com/podcasts/rss">this feed</a>.</p>
<p><a href="http://www.diigo.com/list/plagiarismtoday/episode-224">Show Notes</a></p>
<h4>About the Hosts</h4>
<p><strong>Jonathan Bailey</strong></p>
<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/06/jonathan-box-150x150.png" alt="jonathan-box" title="jonathan-box" class="alignleft size-thumbnail wp-image-3842" height="150" width="150"></p>
<p>Jonathan Bailey (<a href="http://twitter.com/plagiarismtoday">@plagiarismtoday</a>) is the Webmaster and author of Plagiarism Today (Hint: You&#8217;re there now) and works as a copyright and plagiarism consultant. Though not an attorney, he has resolved over 700 cases of plagiarism involving his own work and has helped countless others protect their work and develop strategies for making their content work as hard as possible toward their goals.</p>
<p><strong>Patrick O&#8217;Keefe</strong></p>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/06/patrick.jpg" alt="patrick" title="patrick" class="alignright size-full wp-image-3848" height="150" width="150"></p>
<p>Patrick O&#8217;Keefe (<a href="http://twitter.com/iFroggy">@iFroggy</a>) is the owner of the <a href="http://www.ifroggy.com">iFroggy Network</a>, a network of websites covering various interests. He&#8217;s the author of the book <a href="http://www.managingonlineforums.com/">&#8220;Managing Online Forums,&#8221;</a> a practical guide to managing online communities and social spaces. He maintains a blog about online community management at <a href="http://www.managingcommunities.com/">ManagingCommunities.com</a> and a personal blog at <a href="http://www.patrickokeefe.com/">patrickokeefe.com</a>.</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=9,0,0,0" id="LastFramePlayer" align="top" height="60" width="173"><param name="allowScriptAccess" value="always"><param name="allowFullScreen" value="false"><param name="movie" value="http://www.talkshoe.com/resources/talkshoe/images/swf/lastEpisodePlayer.swf?fileUrl=http://recordings.talkshoe.com/TC-22590/TS-580058.mp3"><param name="quality" value="high"><param name="bgcolor" value="#EEF9C1"><param name="wmode" value="transparent"><embed src="http://www.talkshoe.com/resources/talkshoe/images/swf/lastEpisodePlayer.swf?fileUrl=http://recordings.talkshoe.com/TC-22590/TS-580058.mp3" quality="high" bgcolor="#EEF9C1" play="true" loop="true" scale="exactfit" name="LastFramePlayer" salign="lt" allowscriptaccess="always" allowfullscreen="false" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" wmode="transparent" align="top" height="60" width="173"></object></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2012/01/13/eps-224-going-dark/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
<enclosure url="http://recordings.talkshoe.com/TC-22590/TS-580058.mp3" length="38976784" type="audio/mpeg" />
		</item>
		<item>
		<title>3 Count: Dutch Embargo</title>
		<link>http://www.plagiarismtoday.com/2012/01/11/3-count-dutch-embargo/</link>
		<comments>http://www.plagiarismtoday.com/2012/01/11/3-count-dutch-embargo/#comments</comments>
		<pubDate>Wed, 11 Jan 2012 20:02:30 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Copyright News]]></category>
		<category><![CDATA[AP]]></category>
		<category><![CDATA[blocking]]></category>
		<category><![CDATA[brein]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[dutch]]></category>
		<category><![CDATA[isps]]></category>
		<category><![CDATA[netherlands]]></category>
		<category><![CDATA[New York Times]]></category>
		<category><![CDATA[newsright]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[sopa]]></category>
		<category><![CDATA[The-Pirate-Bay]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=12214</guid>
		<description><![CDATA[Dutch court orders ISPs to block The Pirate Bay, AP and New York Times open NewsRight and Reddit to go dark to protest SOPA. ]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2010/07/3count004-trim.png" alt="" title="3count004-trim" class="alignleft size-full wp-image-7303" height="162" width="175"></p>
<p><em>Have any suggestions for the 3 Count? Let me know via Twitter <a href="http://twitter.com/plagiarismtoday">@plagiarismtoday</a>.</em></p>
<h4>1: <a href="http://www.npr.org/templates/story/story.php?storyId=145032010">Dutch Court Orders Companies To Block Pirate Bay</a></h4>
<p>First off today, a Dutch court has ordered two of the country&#8217;s biggest ISPs, Ziggo and XS4ALL, to block subscriber access to The Pirate Bay. The action comes after a judge ruled that the reciprocal nature of Bittorrent makes it so that users can not merely download pirated content but must upload as well. In the Netherlands, downloading unlicensed content is legal but uploading it is not. The lawsuit, which was brought by the local anti-piracy outfit BREIN, may be appealed though the ISPs will have to comply with the order as it goes through. BREIN has also said it will be taking similar action against other ISPs in the country.</p>
<h4>2: <a href="http://www.washingtonpost.com/business/ap-nytimes-mcclatchy-others-launch-newsright-online-rights-clearinghouse/2012/01/05/gIQAgBwxcP_story.html">AP, NYTimes, McClatchy, Others Launch NewsRight Online Rights Clearinghouse</a></h4>
<p>Next up today, the AP, New York Times and 26 other news organizations have formed a company named NewsRight to track, monitor and enforce copyright in news-related content. The company, which opened formally last week, is composed of 11 people and is said to be using technology to track news content, understand how it&#8217;s being used and, eventually, seek licensing agreements for it. According to a representative for the company, the first year will be spent mostly learning the market for this service with a focus on selling the data its content tracking can generate.</p>
<h4>3: <a href="http://arstechnica.com/tech-policy/news/2012/01/reddit-going-dark-for-a-day-to-protest-sopa-online-censorship-bill.ars?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=rss">Reddit Going Dark for a Day to Protest SOPA Online Censorship Bill</a></h4>
<p>Finally today, the popular social news site Reddit is planning to shut down for a period of 12 hours on January 18 in protest of the Stop Online Piracy Act. The act, which would make it possible for copyright holders to obtain court orders forcing ISPs to block access to &#8220;rogue&#8221; websites, has been the source of controversy online and the Reddit community has been instrumental in organizing opposition to it. Reddit is also attempting to get Google and other major sites to join them with the blackout.</p>
<h4>Suggestions</h4>
<p>That&#8217;s it for the three count today. We will be back tomorrow with three more copyright links. If you have a link that you want to suggest a link for the column or have any proposals to make it better. Feel free to leave a comment or send me an email. I hope to hear from you. </p>
<h4>Want the Full Story?</h4>
<p>Tune in <a href="http://www.plagairsimtoday.com/podcast">every Wednesday evening at 5 PM ET for the live recording of the Copyright 2.0 Show</a> or wait and get the edited version <a href="http://www.plagiarismtoday.com/category/podcast/">Friday right here on Plagiarism Today</a>. </p>
<p><em>The 3 Count Logo was created by <a rel="nofollow" href="http://www.cloudjunkies.com/">Justin Goff</a> and is licensed under a <a rel="nofollow" href="http://creativecommons.org/licenses/by/3.0/">Creative Commons Attribution License</a>. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2012/01/11/3-count-dutch-embargo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DMCA Takedown Fail</title>
		<link>http://www.plagiarismtoday.com/2009/01/06/dmca-takedown-fail/</link>
		<comments>http://www.plagiarismtoday.com/2009/01/06/dmca-takedown-fail/#comments</comments>
		<pubDate>Tue, 06 Jan 2009 18:15:56 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Chilling Effects]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[takedown]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=2396</guid>
		<description><![CDATA[When does a successful DMCA takedown not actually disable access to the work? When the notice is public and includes the full URL.]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.plagiarismtoday.com/wp-content/uploads/2009/01/digg-logo.png"><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2009/01/digg-logo.png" alt="digg-logo" title="digg-logo" width="222" height="142" class="alignleft size-full wp-image-2418" /></a>One of my favorite projects on the Web regarding the DMCA is <a href="http://www.chillingeffects.org/">Chilling Effects</a>. Though the project has not been updated much in recent months, I&#8217;ve always felt that the idea of publicizing DMCA notices and making them a matter of record not only is important in understanding the impact of the law, but in reducing the number of false notices.</p>
<p>However, having DMCA notices a matter of public record does create a problem that I had not previously considered. As my good friend and co-host of the <a href="http://copyright20.com/">Copyright 2.0 Show</a> Patrick O&#8217;Keefe found out, it can actually hinder the takedown process.</p>
<p>Because, when the goal is to remove a link to a potentially infringing work, making that link public can only cause more problems down the road.<span id="more-2396"></span></p>
<h4>An Example</h4>
<p><a href="http://www.plagiarismtoday.com/wp-content/uploads/2009/01/managing-cover.png"><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2009/01/managing-cover.png" alt="managing-cover" title="managing-cover" width="157" height="184" class="alignright size-full wp-image-2421" /></a>In early December, O&#8217;Keefe noticed that a copy of his book was being made available for illegal download and was promoted on at least one likely spam blog. The spammer, as part of his promotion effort had also <a href="http://digg.com/educational/Managing_Online_Forums" rel="nofollow">submitted the article to Digg</a>, which not only propelled it toward the top of Google&#8217;s results for the book, but risked it reaching the front page.</p>
<p>Patrick took action and filed a DMCA notice with Digg. Since Digg is an &#8220;information location tool&#8221; as described in the DMCA, Digg was obligated to remove the link under the law (or be held potentially liable for the infringement) and did so. </p>
<p>However, Digg, as is their right, <a href="http://www.chillingeffects.org/notice.cgi?sID=9056" rel="nofollow">submitted the notice to Chilling Effects</a> and then linked tot he notice when it was posted. The only problem was that the posting on Chilling Effects came with the complete URL for the infringing site. Though it was not a &#8220;live&#8221; link, one could easily copy and paste it into their browser. </p>
<p>The end result of this was that instead of Digg linking to a page with the book for download, Digg now links to a page on Chilling Effects that has the full URL to the download page. Clearly, this doesn&#8217;t achieve the goal of the takedown nor does it do much to discourage spam from appearing on Digg. After all, the spammer is getting traffic from the site, just indirectly.</p>
<p>It&#8217;s clear that the system has broken down here, fortunately though, it should be pretty simple to fix.</p>
<h4>Simple Solutions</h4>
<p>Given <a href="http://searchengineland.com/digg-gives-in-to-user-revolt-are-those-dmca-takedown-notices-even-valid-11115">Digg&#8217;s history with the DMCA</a>, it is understandable that they would be uneasy about responding to notices and want to do everything they can to protect themselves, including using Chilling Effects. As I&#8217;ve said earlier, I am very fond of what Chilling Effects is trying to do and wish the site were updated more often.</p>
<p>However, it defeats the purpose of filing a DMCA notice from the search engines if the infringing URL is placed in full in the notice, making it so that the content can be trivially found. I want to encourage Chilling Effects to post notices, hosts to use the site and for them to link to the notices, but not if doing so negates the purpose of filing a takedown.</p>
<p>The solution is fairly simple, since Chilling Effects already redacts large portions of the notices is posts, namely the private information of the filer, it makes sense to also remove all or part of the infringing URL. Though it is an important part of the notice, leaving it in negates the purpose of a section 512(d) takedown (information location tools) by simply redirecting the user to the allegedly infringing site.</p>
<p>This is true with all DMCA notices to social news and search engine sites, including Google. </p>
<p>As long as Chilling Effects posts the full URL, it will be very difficult for copyright holders to correctly apply the DMCA and it will likely hinder many hosts, search engines and news sites from submitting the notices they receive. After all, what is the point of them removing a link and submitting a notice when it will just result in the link being reposted, albeit in a roundabout way.</p>
<p>Simply put, there has to be some middle ground that can display enough information in the URL, without hindering the DMCA process.</p>
<h4>Conclusions</h4>
<p>I&#8217;ve written before about the need for <a href="http://www.plagiarismtoday.com/2008/03/20/the-need-for-dmca-transparency/">greater DMCA transparency</a> and I think that Chilling Effects can be a large part of the solution. Howver, being part of the solution means observing and reporting on the DMCA process, not hindering it, even by accident.</p>
<p>I certainly don&#8217;t feel that Chilling Effects is malicious in this nor do I think they did any serious harm in this case. In the end, O&#8217;Keefe was able to get the PDF file removed from the file server it was stored on through another DMCA process, thus ensuring that it couldn&#8217;t be downloaded even if the spam blog remained up, which it has.</p>
<p>Though this story has a happy ending, it does highlight a strange problem with the DMCA process when it is move into the public light. </p>
<p><em>Note: I contacted <a href="http://cyber.law.harvard.edu/people/wseltzer">Wendy Seltzer</a>, the project leader at Chilling Effects on 12/23/08 but have not heard back from her. I will update this post if I receive a response. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/01/06/dmca-takedown-fail/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Review: The Plagiarism Checker</title>
		<link>http://www.plagiarismtoday.com/2008/12/16/review-the-plagiarism-checker/</link>
		<comments>http://www.plagiarismtoday.com/2008/12/16/review-the-plagiarism-checker/#comments</comments>
		<pubDate>Tue, 16 Dec 2008 17:45:07 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[plagiarism checking]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Social-News]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=2283</guid>
		<description><![CDATA[The rebirth of "The Plagiarism Checker" has made waves throughout social news sites and Twitter alike, but is the site worth the attention it has been getting?]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-checker-logo-300x39.png" alt="plagiarism-checker-logo" title="plagiarism-checker-logo" width="300" height="39" class="alignleft size-medium wp-image-2298" />Late last week, <a href="http://www.reddit.com/r/reddit.com/comments/7j4e6/the_plagiarism_checker_i_made_this_site_in_2002/">a post reached the front page of Reddit</a> that piqued the curiosity of copyright holders, teachers and professors alike. It was about a service called &#8220;<a href="http://www.dustball.com/cs/plagiarism.checker/">The Plagiarism Checker</a>&#8221; (dubbed by me the &#8220;Dustball&#8221; checker due to its domain), created by <a href="http://www.linkedin.com/in/brianklug">Brian Klug</a> in 2002, when he was a student at the University of Maryland at College Park, and abandoned until recently this year.</p>
<p>The site, according to Klug, was getting about 2,000 visits per day when it was forgotten but is almost certainly doing much better now as it has taken off, attracting countless Twitter Tweets and other social news attention. Librarians and teachers are especially captivated by this site.</p>
<p>But is &#8220;The Plagiarism Checker&#8221; worth using? Is it as powerful of a tool as some, although not the site itself, have made it to be? The sad answer is no, but it could, with a few simple tweaks, become a much more useful service for teachers and bloggers alike.<span id="more-2283"></span></p>
<h4>How it Works</h4>
<p>The basic premise of the minimalist site can be summed up by its instructions:</p>
<blockquote><p>Cut &#038; paste your students paper or homework assignment into the box below, and click the &#8220;check&#8221; button.  This free plagiarism detector will find plagiarized text in homework and other essays/reports.</p></blockquote>
<p>In short, you take an essay, article or other lengthy prose work, paste it into a textbox and hit &#8220;check&#8221;. From there, the site extracts several strings of text, runs them through Google and compiles the result, determining whether plagiarism is probable.</p>
<p>In that regard, the idea is actually very similar to Copyscape, which also uses Google via their API, to process results. However, where Copyscape&#8217;s keeps the &#8220;magic&#8221; hidden from the user, the &#8220;Dustball&#8221; plagiarism checker includes links to the Google results, encouraging users to click through and research the case for themselves.</p>
<p>That alone is a big part of the problem Webmasters, and many teachers, will have with the service. Where Copyscape, as well as academic tools such as TurnItIn, provide very simple and colorful results, The Plagiarism Checker is a very bare-bones approach, requiring the user to perform a large amount of research on their own.</p>
<p>Still, a bit of research will be welcomed if the service produces great results, unfortunately, it seems that the service performs only lukewarm, at best.</p>
<h4>My Tests</h4>
<p>To test the service, I decided to run it through a <a href="http://www.plagiarismtoday.com/2005/06/28/copyscape-not-ready-for-prime-time/">similar battery of tests</a> that I had run Copyscape through and then watched as they <a href="http://www.plagiarismtoday.com/2007/10/02/copyscape-improved-again/">improved upon the initial results</a>. </p>
<p>The first test was to run <a href="http://www.ravensrants.com/in-the-dark/print/">an old poem of mine</a> through the system, one that allegedly has over 300 matches in Google. However, that test was thwarted as The Plagiarism Checker refused to even look at the work, saying that it could not function with such short text strings.</p>
<p><a href="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-checker-error.png"><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-checker-error-300x83.png" alt="plagiarism-checker-error" title="plagiarism-checker-error" width="300" height="83" class="alignnone size-medium wp-image-2284" /></a></p>
<p>I then shifted gears and started using prose works, <a href="http://www.ravensrants.com/loner/print/">the first being one</a> that had <a href="http://www.google.com/search?q=%22there+is+always+a+person+sitting+alone+in+a+corner+not+engaging+in+conversation%22&#038;hl=en&#038;client=firefox-a&#038;rls=org.mozilla:en-US:official&#038;hs=0aI&#038;filter=0">36 matches in Google</a> at the time I did the search. The result was stunning. </p>
<p><a href="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-checker-none-found.png"><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-checker-none-found-300x138.png" alt="plagiarism-checker-none-found" title="plagiarism-checker-none-found" width="300" height="138" class="alignnone size-medium wp-image-2287" /></a></p>
<p>Despite the fact Google had reported three dozen matches on test snippets from the work itself, the &#8220;Dustball&#8221; checker was unable to find anything. To make matters worse, using some of the sample quotes from the test, I was able to locate other copies of the work, <a href="http://www.google.com/search?q=%22every+crowd+big+or+small+there+is+always+a+person%22&#038;ie=utf-8&#038;oe=utf-8&#038;aq=t&#038;rls=org.mozilla:en-US:official&#038;client=firefox-a">such as with the first quote</a>.</p>
<p>Clearly, The Plagiarism Checker was missing results that Google was finding, meaning it was discarding them for whatever reason.</p>
<p>A similar test for <a href="http://www.ravensrants.com/trees/print/">another prose work</a> only returned one sentence that was matched against anything and the results for it were all false positives. This work, in Google, <a href="http://www.google.com/search?q=%22trees+of+nature+that+I+hold+so+dear+will+soon%22&#038;hl=en&#038;client=firefox-a&#038;rls=org.mozilla:en-US:official&#038;hs=tLw&#038;filter=0">has six results</a>.</p>
<p><a href="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-checker-none-found4.png"><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-checker-none-found4-300x146.png" alt="plagiarism-checker-none-found4" title="plagiarism-checker-none-found4" width="300" height="146" class="alignnone size-medium wp-image-2289" /></a></p>
<p>The only search using the service that seemed to work remotely well was when I ran the <a href="http://www.ushistory.org/Declaration/document/index.htm">Declaration of Independence</a> through it. Every search term, in this test, came back positive. </p>
<p><a href="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-found.png"><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/12/plagiarism-found-300x175.png" alt="plagiarism-found" title="plagiarism-found" width="300" height="175" class="alignnone size-medium wp-image-2293" /></a></p>
<p>It appears that text that is not widely distributed around the Web may or may not show up as plagiarized in this work, something that has me very worried as many are starting to rely on this plagiarism checker as their main tool for detecting both copyright infringement and the plagiarism of students.</p>
<h4>The Sad Truth</h4>
<p>Simply put, any and all of these search results should have come back as being plagiarized. Even if there were no other matches of the content, these works existed on my site and are available through Google there. There is no reason that any of these works should have come back as anything short of 100% plagiarized since this site can not know I was the one submitting them.</p>
<p>For teachers, this is not good news. Is a student plagiarizes material from obscure sources, they are likely to escape detection. Likewise, Webmasters and those that might want to use this tool to track their own content, will likely be disappointed that it doesn&#8217;t seem to pick up when the infringement is only a few dozen sites. </p>
<p>This can most likely be fixed through tweaks in the algorithm, but as it sits right now, it doesn&#8217;t appear that it has much to offer teachers or Webmasters, especially when <a href="http://www.copyscape.com">Copyscape</a> is relatively effective and cheap to use.</p>
<p>Simply put, at this moment, Copyscape is easier, more effective and faster than The Plagiarism Checker and, at only five cents a search, is affordable too.</p>
<p>However, the best technique still appears to be taking the time to select good phrases from a work and manually searching for those. It returns the most results and seems to work well nearly all of the time.</p>
<h4>The Big Picture</h4>
<p>My issue with The Plagiarism Checker has less to do with the service itself and more to do with how others have been promoting it. The site itself is actually fairly humble about what it can do, but bloggers and Twitter users have been advertising it as if it were a silver bullet to detect plagiarism. Clearly, that is not the case.</p>
<p>With a few tweaks and fixes to the algorithm, I don&#8217;t doubt that this service, much like Copyscape, could become a very powerful tool. However, even if the results were on par with Copyscape, the latter remains faster and easier to use, meaning that there will not be much reason to use the &#8220;Dustball&#8221; checker.</p>
<p>To make matters worse, most teachers and professors have access to services such as TurnItIn that are far more accurate and covers a much larger breadth of sources than &#8220;The Plagiarism Checker&#8221;. Considering the ease of us and added features, there is not much that can be gleaned from a Google-only search, that can&#8217;t be gleaned from the more automated service (Though <a href="http://www.plagiarismtoday.com/2008/11/04/copyscape-tops-plagiarism-checker-testing/">Copyscape did top Turnitin in a recent plagiarism detection study</a>). </p>
<p>In short, I don&#8217;t see much usefulness for this tool, even if its accuracy improves, and I and more than a little confused as to why so many seem to have promoted it so heavily.</p>
<h4>Conclusions</h4>
<p>More than anything, this is a case against the reliance on any one plagiarism checking service. Even the best services will let results slip through the cracks. Furthermore, just because a service is popular does not mean that it should be trusted above all.</p>
<p>However, I find it very difficult to fault The Plagiarism Checker for this confusion and these problems. It is clear that the service was as much an experiment as anything, it is promoted humbly and was actually abandoned for approximately six years. It was others, perhaps desperate for some way to more effectively detect plagiarism, that gave it an unjustified reputation.</p>
<p>If anything, this case shows the need and the potential market for such services and illustrates why some companies have made millions in this field. People are eager for a solution and are excited by any promise of one.</p>
<p>Sadly though, this site is not the one people are looking for.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/12/16/review-the-plagiarism-checker/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
		<item>
		<title>Fragmented Conversations and Blogging</title>
		<link>http://www.plagiarismtoday.com/2008/04/16/fragmented-conversations-and-blogging/</link>
		<comments>http://www.plagiarismtoday.com/2008/04/16/fragmented-conversations-and-blogging/#comments</comments>
		<pubDate>Wed, 16 Apr 2008 19:00:54 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Prevention]]></category>
		<category><![CDATA[comments]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[shyftr]]></category>
		<category><![CDATA[Social-News]]></category>
		<category><![CDATA[Spam]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogging]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=937</guid>
		<description><![CDATA[The notion of fragmented conversations has drawn a lot of attention over the past few weeks. However, what are the real implications solutions to the problem?]]></description>
			<content:encoded><![CDATA[<p><img class="picleft" src="http://www.plagiarismtoday.com/wp-content/uploads/2008/04/shyftr-logo.jpg" border="0" alt="shyftr-logo.jpg" width="241" height="72" align="left" />One of the interesting side notes to the recent <a title="The Shyftr Controversy" href="http://www.blogherald.com/2008/04/14/the-shyftr-saga/">controversy over Shyftr</a> was that Shyftr, as a service, not only scraped and displayed the full RSS feed, but that it also encouraged users to comment on the blog post at Shyftr itself, rather than visiting the original site.</p>
<p>Some, including <a title="Eric Berlin on Shyftr" href="http://onlinemediacultist.com/2008/04/12/an-argument-against-shyftr-and-communities-built-around-full-text-rss-feeds/">Eric Berlin</a>, were very opposed to this while still others, including <a title="Louis Gray" href="http://www.louisgray.com/live/2008/04/should-fractured-feed-reader-comments.html">Louis Gray</a>, saw this as a natural evolution.</p>
<p>But while it is clear that bloggers are interested in reading what others have to say about their content and many work to actively encourage comments on their site, it is unclear where that conversation is going to take place in the next few years.</p>
<p>As more RSS readers, including those that do not republish content publicly, offer users the ability to comment to feeds, the dialog is spreading out.</p>
<p>However, this may not be a bad thing entirely, but rather, an opportunity for someone to help solve a very complicated problem.<br />
<span id="more-937"></span></p>
<h4>How Fractured Conversations Hurt</h4>
<p>The problem with fractured comments is that, most likely, there is only a finite number of people interested in a specific post.</p>
<p>If a post does well, it draw a dozen commenters. However, while a dozen people talking about a post in one place makes for a robust conversation, if they comment at twelve different places, then they are just twelve people shouting into deep space.</p>
<p>Though more commenting opportunities may actually draw more people to the dialog, since people typically do not like repeating themselves, it is unlikely that the number will offset how fragmented the conversation has become.</p>
<p>For example, even if the conversation with a dozen people draws in three times as many commenters, that will only leave an average of three people per location, hardly a robust dialog.</p>
<p>Worse, a fragmented conversation is much harder for the blog author to follow and participate in. If a conversation takes place on six different sites, the blog author has to check all of them and then, if interested, respond to each of them.</p>
<p>While certainly a possibility, it assumes that the blogger is even aware of all the conversations, something that is unlikely given how large the Web is. The result is that these conversations almost always exclude the actual author of the entry, just because they are unaware of the existence of the exchange.</p>
<p>For bloggers hungry for feedback, this is very frustrating and very worrisome. In many cases, the conversation is the reward for the post and having it taken away, understandably, very upsetting.</p>
<p>The problem, however, is not new and has been battled, quite literally, for centuries.</p>
<h4>An Ongoing Issue</h4>
<p>The problem of fragmented conversations, however, is almost as old as marketing itself.</p>
<p>If you&#8217;ve ever talked about a product with a friend, had a water cooler conversation about a TV show or called someone to talk about a movie, you&#8217;ve helped fragment a conversation.</p>
<p>The companies involved, almost certainly, would rather be able to hear what you had to say, both good and bad, but instead have to deal with millions of conversations taking place beyond their ears. This was a problem before the Web and it continues to be one afterward.</p>
<p>Likewise, with blogs, if you&#8217;ve written about a site on a message board, told a friend in person about something you read or email someone an interesting link, you&#8217;ve fragmented the conversation there as well. The owner of the site would love to know that you think their content is good enough to recommend or bad enough to hate, but they have little to no way of knowing what you said.</p>
<p>Conversations about your site have always been fragmented and this includes both public and private dialogs. Message boards, chat rooms, newsgroups and even Twitter have long offered a means for people to talk about content away from eye of the author.</p>
<p>What is new about these services is not that they are fragmenting the conversation, but that they are codifying the process. Rather than encouraging people to talk about a certain topic or general area of interest, they are encouraging people to talk specifically about your post and to do it away from your site.</p>
<p>To date, this process has not had a major impact on bloggers as the networks involved have not obtained enough traction. For example, even Shyftr, despite all of the attention, only has a few viable conversations taking place.</p>
<p>However, there are at least a few places that both fragment a conversation and have a strong community behind them. However, they are sites that people actively encourage their sites to appear on and many dream about getting attention from.</p>
<h4>The Digg and Reddit Problem</h4>
<p><img class="picright" src="http://www.plagiarismtoday.com/wp-content/uploads/2008/04/digg-logo.jpg" border="0" alt="digg-logo.jpg" width="218" height="140" align="right" />Having been the survivor of two Diggs, I can say safely that, in both cases, more comments appeared on Digg than did my own site. The same is true for Slashdot and any other major social news &#8220;bumps&#8221; I&#8217;ve had here.</p>
<p>This is understandable as these sites all have strong communities within them that like to talk among themselves, but they are still the classic example of a fragmented conversation. After all, the people who commented to Digg did not, for the most part, comment here and if I had not followed the conversations there, I never would have been able to read their thoughts.</p>
<p>Yet, even Webmasters that dislike Shyftr for fragmenting the dialog encourage the use of Digg and Reddit on their site. The reason is that these sites provide a great deal of traffic to bloggers. Digg and Reddit both use content from the sites they link responsibly and encourage their readers to visit the site before commenting, often leading to the dreaded <a title="The Digg Effect" href="http://www.ndesign-studio.com/blog/updates/the-digg-effect/">Digg Effect</a>.</p>
<p>The problem with sites such as Shyftr is not that they fragment the conversation, but that they fragment the conversation and the audience. By displaying the full feed publicly and offering a means to comment they encourage their members to both read and comment on the feed in one place.</p>
<p>However, it is easy to imagine a situation where a feed reader, such as <a href="http://www.google.com/reader/view">Google Reader</a>, allows users to comment on a story from within their site and have it displayed only to other subscribers. This would be less of an audience fragmentation, since all of the readers would be tracked and still getting the content from the original source, but would be fragmenting the conversation.</p>
<p>This creates a problem for bloggers. Where Digg and Reddit fragment a conversation in exchange for increased exposure, this would not increase the audience but would still divide the conversation in two. The would hurt both the conversation itself and the original author.</p>
<p>That could, in the long run, hurt the site itself as many blogs thrive in large part to their conversations.</p>
<p>However, for readers, the convenience of commenting where they land is far too tempting. We can and should expect that RSS readers, as well as other kinds of sites, will attempt to fragment the conversation in order to provide a feature to their users and that many of those sites will not raise copyright red flags, like Shyftr did.</p>
<h4>Solving the Problem</h4>
<p>What we&#8217;re left with is a disjunct. Readers, the few who do regularly comment, want to do so in the easiest way possible while bloggers want to be sure that the conversation leads back to their site.</p>
<p>There have been many attempts to fix this problem, services such as <a title="Disqus" href="http://www.disqus.com/">Disqus</a> have attempted to take local conversations and make them global. However, the solutions have focused primarily on the commenter, not the blogger, and solving the issue of unifying the many sites the visitor has participated in.</p>
<p>But while that is important, it does little to help the conversations themselves.</p>
<p>This is an area ripe for a technological solution, a means to take a conversation that is happening in multiple places and unify it, preferably on the original site itself.</p>
<p>However, the barriers to this are obvious. Spam would likely be one of the first problems, along with obtaining a broad enough user-base to be viable. But the more long-term threat would be copyright issues involved in porting over one&#8217;s comments from one site to another. After all, a commenter owns the rights in his or her comments and, though an implied license is granted to use the work on the site it was posted, it does not cover moving it to another page.</p>
<p>A real solution to this problem would require a very high level of cooperation both technologically and legally. Ensuring that comment licenses allow cross-posting and that different interfaces work together, including with spam protection systems, is a big challenge.</p>
<p>Still, if such a system could be developed and bloggers could retain control over comments posted to their site, including both editorial control over what appears on their site and export control (in the event that a different system came along), it would likely be something that bloggers would jump upon.</p>
<p>After all, it would theoretically mean more comments and a chance to participate in conversations that never would have been available before.</p>
<h4>Conclusions</h4>
<p>The bitter truth is that, despite how fragmented conversations are on the Web, they are still more unified here than they ever have been in history.</p>
<p>Though we may think we&#8217;re sharing our thoughts with just a few readers on an obscure forum, they are almost instantly searchable and a blogger who is looking at their referrers or doing vanity searches has a good chance at finding these conversations.</p>
<p>Furthermore, though hundreds of sites attempt to fragment the conversation, only a few have enough of an audience to do so in a meaningful way. This means that, currently, dialog fragmentation is very predictable for the most part.</p>
<p>The bottom line is that while our conversations may be fragmented, bloggers have far less of a problem than television networks, newspapers and other businesses do. There is a reason why millions are spent every year on market research, it&#8217;s because companies want to know what people are thinking and what they are saying.</p>
<p>As bad as we have it, we are the lucky ones in this area. Furthermore, the problem of fragmented conversations is still nowhere near as severe as the problem of audience fragmentation, the problem created by plagiarism and widespread scraping.</p>
<p>Despite that, one does have to hope that a solution can be found. Just imagine the conversations that we can have if we can actually talk across our self-imposed borders and outside of our communities.</p>
<p>On second thought, given what I&#8217;ve seen on Usenet, that may not be such a great idea after all&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/04/16/fragmented-conversations-and-blogging/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Offering Embedding to Prevent Plagiarism</title>
		<link>http://www.plagiarismtoday.com/2008/03/26/embedding-as-protection-against-plagiarism/</link>
		<comments>http://www.plagiarismtoday.com/2008/03/26/embedding-as-protection-against-plagiarism/#comments</comments>
		<pubDate>Wed, 26 Mar 2008 16:37:41 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Prevention]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[embedding]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[screwattack]]></category>
		<category><![CDATA[xkcd]]></category>
		<category><![CDATA[YouTube]]></category>
		<category><![CDATA[zero punctuation]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2008/03/26/embedding-as-protection-against-plagiarism/</guid>
		<description><![CDATA[Embedding multimedia content is now a way of life on the Web. But while webmasters and bloggers love the ease of use, content creators are benefiting from less plagiarism and greater control.]]></description>
			<content:encoded><![CDATA[<p><img SRC="http://aycu36.webshots.com/image/48235/2000901282758032986_rs.jpg" alt="YouTube Logo" align="left" class="picleft"/>One of the hallmarks of the &#8220;New Web&#8221; has been the embedding of multimedia. Where <a href="http://altlab.com/hotlinking.html">hotlinking</a> was once a dirty word, now you can find services that allow you to embed <a href="http://allyoucanupload.webshots.com/">images</a>, <a href="http://www.boxstr.com">audio</a> and <a href="http://www.blip.tv">video</a>. </p>
<p>This has proved to be a boon for small Webmasters and bloggers. Where previously including multimedia on one&#8217;s site required a large amount of bandwidth and storage space, those services can now be offloaded and embedded after the fact using custom HTML. </p>
<p>However, it has had a secondary, unforeseen benefit. By making works easy to legitimately embed, it has reduced the amount of plagiarism while helping content creators keep their work at a central location, even as it is being viewed at dozens of sites across the Web.</p>
<p>More than just a great tool for webmasters, embedding has proved to be useful for multimedia content creators as well, providing their work with a high level of exposure while maintaining a resiliency against plagiarism that textual works simply do not enjoy.<br />
<span id="more-860"></span></p>
<h4>Path of Least Resistance</h4>
<p><img SRC="http://aycu20.webshots.com/image/48499/2003547457411031541_rs.jpg" alt="Boxstr Logo" align="right" class="picright"/>If you found a video and wanted to get it on your site, which of the two following methods would you choose?</p>
<ol>
<li>Extract/Download the video, edit it for size, format and possibly to remove credit/title slides. Then re-upload it to your server, or another service, and embed it into your site. </li>
<li>Simply copy a few lines of HTML and drop it into a new post or page. </li>
</ol>
<p>While both are practical solutions to anyone with a modest amount of computer knowledge, the latter is much easier, requiring only a few seconds of time, while providing consistent and quality results to your viewers.</p>
<p>If there is one thing that is well-understood about plagiarists, it is that they usually take the path of least resistance. They copy articles because it is easier than writing them. They ignore CC licenses because it is easier than following them. They scrape articles with scripts because it is easier and faster than doing it by hand. </p>
<p>Though plagiarists have many motivations, there is always an underlying thread of taking the easy route. If that is the case, than embedding is by far the easiest route of them all.</p>
<p>This is not to say that the people who use YouTube and other embeds are simply lazy plagiarists, but that would-be plagiarists are, in some cases, herded toward a more legitimate route simply by offering a means of copying that is far easier than taking it by force.</p>
<p>It is a novel concept, but one that seems to work at least fairly well.</p>
<h4>Benefits to Content Creators</h4>
<p>Though offering an image or a video up for embedding might seem to be a step in giving up control of the work, it is actually quite the opposite.</p>
<p>Consider the following:</p>
<ul>
<li>When offering an work for embedding, you can still remove it at any time.</li>
<li>You control the user experience, including how it is displayed.</li>
<li>All traffic to the work is tracked at the original source, despite appearing on multiple sites.</li>
<li>Likewise, all advertising revenue from the item still returns to the source.</li>
<li>It is possible, in most cases, to track what sites display the work and participate in discussions there.</li>
</ul>
<p>Though many would prefer that all views of the work take place in a set location on their site, it is generally not practical. Once a work reaches a certain popularity, it can be safely assumed that others are going to want to share it on their sites and profiles. </p>
<p>If they lack a legitimate way to do it, many will turn to copyright infringement and possibly even plagiarism, depending on their motivations. </p>
<p>This, in turn, results in a complete loss of control over the work, with multiple copies existing with different names affixed to them. This not only fragments the audience, but can hurt any advertising efforts and destroys any attempt to track viewership.</p>
<p>By giving up the exclusive right to display your work on your own site, you can actually better consolidate your audience in many cases, using the actual media file, not your site, as the touchstone.</p>
<h4>The Fly in the Ointment</h4>
<p><img SRC="http://aycu12.webshots.com/image/47171/2000959131389521001_rs.jpg" alt="Flickr Logo" align="left" class="picleft"/>The problem with this system is that it does not always work. </p>
<p>Images, particularly, are still vulnerable to plagiarism as it is easier in many cases to just save the work and re-upload it. Even though services such as Flickr make it trivial to embed images into other sites, image copying is still rampant.</p>
<p>A recent example of this occurred when a strip from the Webcomic XKCD appeared on college humor, initially without attribution and while bearing the College Humor watermark. The matter <a href="http://reddit.com/info/6ajj3/comments/c03bpr0">drew a great deal of attention</a> and was eventually resolved. </p>
<p>However, even videos are not entirely safe. Many people download and copy online videos to either to move them to their favorite service, such as taking clips from Revver and putting them on YouTube, or to profit from them on their own site.</p>
<p>In that regard, video game site <a href="http://www.screwattack.com">Screwattack</a> has had to <a href="http://forum.screwattack.com/viewtopic.php?t=5172">repeatedly ask users</a> not to download and post clips to other sites and encourage users to help &#8220;police&#8221; the matter. This is despite the fact that all Screwattack videos are easily embedded into other sites.</p>
<p>Likewise, <a href="http://www.escapistmagazine.com/profiles/posts/Yahtzee%20Croshaw">Yahtzee&#8217;s Zero Punctuation video series</a> on <a href="http://www.escapistmagazine.com/">the Escapist Magazine</a> has seen a <a href="http://arstechnica.com/journals/thumbs.ars/2007/11/02/zero-punctuation-links-and-the-trouble-with-exclusive-video">similar problem with people ripping out the video file</a> and uploading it to other sites without permission, removing all of the ads along the way. Once again, the player for Zero Punctuation offers an &#8220;embed&#8221; feature with every post.</p>
<p>Clearly, while ease of use is a major factor in determining how others will use your content, it is not the only one. Others will continue to be motivated by other elements, including a desire to use their favorite service, a wish to boost their own views, a need to take credit for the work and even simple monetary greed. </p>
<p>These factors may combine, in many cases, to ensure that a work is used inappropriately, no matter how easy it is to legitimately use it.</p>
<h4>Conclusions</h4>
<p>Clearly, making multimedia works available for embedding is not a cure-all for content theft. However, though hard numbers are not available, it does seem to lessen it and create new opportunities for content creators.</p>
<p>However, the one thing it does beyond any doubt is end one of the pro-copying arguments, that people just want to &#8220;share&#8221; the work with others. </p>
<p>While that argument is generally a stretch to begin with, when there is a simple and legitimate means to share the work in a way that continues to support the creator, it speaks volumes when it is not followed.</p>
<p>Much like Creative Commons helps to separate the sharers from the plagiarists, embedding has the same effect. </p>
<p>In that regard, it lets you focus your time and energy on fighting the true bad guys, not well-meaning, but misguided, fans. </p>
<p>That, in turn, frees you up to make more content while helping push your work to an even larger audience. </p>
<p>For most, making content available for embedding is a clear win-win. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/03/26/embedding-as-protection-against-plagiarism/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>The DMCA on Social News Sites</title>
		<link>http://www.plagiarismtoday.com/2007/05/01/the-dmca-on-social-news-sites/</link>
		<comments>http://www.plagiarismtoday.com/2007/05/01/the-dmca-on-social-news-sites/#comments</comments>
		<pubDate>Tue, 01 May 2007 16:39:01 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[DMCA Seven]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Personal Experiences]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Fark]]></category>
		<category><![CDATA[Netscape]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[Safe-Harbor]]></category>
		<category><![CDATA[Slashdot]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/05/01/the-dmca-on-social-news-sites/</guid>
		<description><![CDATA[Previously, I reported about how plagiarized content can create unique challeges on social news sites such as Digg and Reddit. The article detailed a case involving the site Celebrity Hack, a celebrity gossip blog. The site had one of its articles, entitled &#8220;37 Famously Stupid Celebrity Quotes&#8221; submitted to Digg only to be buried for...]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.plagiarismtoday.com/2007/04/19/social-news-another-plagiarism-concern/">Previously</a>, I reported about how plagiarized content can create unique challeges on social news sites such as Digg and Reddit.</p>
<p>The article detailed a case involving the site <a href="http://www.celebrityhack.com/">Celebrity Hack</a>, a celebrity gossip blog. The site had one of its articles, entitled &#8220;37 Famously Stupid Celebrity Quotes&#8221; <a href="http://digg.com/celebrity/37_Famously_Stupid_Celebrity_Quotes">submitted to Digg</a> only to be buried for unknown reasons. Then, a few weeks later, a lower quality copy was <a href="http://digg.com/celebrity/37_Famously_Stupid_Celebrity_Quotes_2">submitted to the site</a> and reached the front page without being buried at all.</p>
<p>It is obvious that, in that case, the traditional notion of social news failed. The usual means of preventing duplicate failed to stop the advancement of the plagiarized story. It would seem that, in that light, there is very little that a site such as Celebrity Hack can do to prevent scraped and plagiarized content from being promoted on social news sites. If the crowd fails to see the duplication and pushes a duplicate article forward, there is little a lone Webmaster or blogger can do.</p>
<p>However, the DMCA may provide the answer to that problem. It gives individuals who have had their content ripped off and posted to social news sites the ability to have those links removed. It can help webmasters mitigate against the damage that such plagiarism can do and even stop thieves before they see any benefit from the stolen work.</p>
<p><span id="more-481"></span><strong>Why Digg has a DMCA Policy</strong></p>
<p>The DMCA carves out safe harbor protection for many kinds of online service providers. One of those service providers, found under <a href="http://www.copyright.gov/title17/92chap5.html#512">section 512(d)</a> in Title 17, is what is called an &#8220;Information Location Tool&#8221;.</p>
<p>The law goes on to define an &#8220;information location tool&#8221; as one that is:</p>
<blockquote><p>referring or linking users to an online location containing infringing material or infringing activity, by using information location tools, including a directory, index, reference, pointer, or hypertext link</p></blockquote>
<p>That section covers search engines such as Google, directories such as DMOZ and any other site that links to Web pages where infringing material may be found.</p>
<p>The law grants those sites safe harbor, meaning they are not liable for the infringement, so long as they have no knowledge of the infringement, remove access to the material once notified and register a DMCA agent with the United States Copyright Office (USCO).</p>
<p>This is one area where the DMCA has done a great deal to assist the development of the Web. By eliminating most of the gray area that existed before the law was passed, the DMCA made sites like Digg easier to found without the fear of being held liable for linking to infringing material. In short, it removed a great risk factor from starting this kind of site.</p>
<p>However, because of this, social news sites have had to develop copyright policies and work to comply with the DMCA. Most larger social news sites have designated DMCA agents and processes in place to handle complaints of copyright infringement.</p>
<p>Here are just a few examples of that.</p>
<p><strong>Digg</strong></p>
<p><strong>Format</strong>: Email<br />
<strong>Email Address</strong>: abuse at digg dot com<br />
<strong>Location of Policy</strong>: <a href="http://www.digg.com/tos">Item Five, Terms of Use</a><br />
<strong>Registered with USCO</strong>: No<br />
<strong>Comments</strong>: Overall, Digg&#8217;s DMCA policy seems very well-rounded. It offers the necessary links to information for filing both a notice and a counter-notice under the DMCA. However, Digg&#8217;s contact information is somewhat lacking. Digg handles all DMCA-related matters through their abuse email address, which is normally the best way to handle such matters, but offers no additional contact information including name, phone number, fax number or physical address. Sites that list their <a href="http://www.copyright.gov/onlinesp/list/a_agents.html">DMCA agents with the United States Copyright Office</a> have to include such information. Digg may need to see about expanding their policy to include this information and formally registering with the USCO.</p>
<p><strong>Reddit</strong></p>
<p><strong>Format</strong>: Email, Fax, Snail Mail<br />
<strong>Email Address</strong>: copyright at sbandg dot com<br />
<strong>Location of Policy</strong>: <a href="http://reddit.com/help/useragreement">Item Four, User Agreement</a><br />
<strong>Registered with USCO</strong>: <a href="http://www.copyright.gov/onlinesp/list/r_agents.html">Yes</a><br />
<strong>Comments</strong>: Reddit apparently has taken on the services of the NY law firm <a href="http://pview.findlaw.com/view/2503381_1?noconfirm=0">Sabin, Bermant &amp; Gould</a> to handle their DMCA notices. It seems likely that the policy was written by them as well. The policy is both succinct and thorough, covering all of the required elements of a DMCA notice and offering multiple means of contact to get in touch with the agents. However, the policy completely lacks any mention of a counter-notice, which is relatively unusual. However, that information may be included in the contacts sent to users that have had DMCA notices filed against them.</p>
<p><strong>Slashdot</strong></p>
<p><strong>Format</strong>: Email, Snail Mail<br />
<strong>Email Address</strong>: dmca at ostg dot com<br />
<strong>Location of Policy</strong>: <a href="http://www.ostg.com/terms.htm">Item Thirteen, Terms of Use</a><br />
Registered with USCO: <a href="http://www.copyright.gov/onlinesp/list/s_agents.html">Yes</a><br />
<strong>Comments</strong>: DMCA complaints to Slashdot are handled by their parent company, the <a href="http://www.ostg.com/index.htm">Open Source Technology Group</a>. Their policy is very lengthy and covers all of the information needed to both submit a DMCA notice and file a counter-notice. The contact information is complete, though it is unclear if the number listed is a phone or fax number, and the policy itself spells out the steps needed, without referencing other sites. Users unfamiliar with the DMCA will likely find this policy very helpful.</p>
<p><strong>Netscape</strong></p>
<p><strong>Format</strong>: Email, Snail Mail<br />
<strong>Email Address</strong>: <a href="http://about.aol.com/aolnetwork/info_notify">See Form</a><br />
<strong>Location of Policy</strong>: <a href="http://about.aol.com/aolnetwork/copyright_infringement">Own Page</a><br />
<strong>Registered with USCO</strong>: <a href="http://www.copyright.gov/onlinesp/list/n_agents.html">Yes</a><br />
Comments: Netscape&#8217;s policy, which is handled by their parent AOL, is probably the most mature out of all of the copyright policies. Not only do they have a well-written policy that provides all of the needed information to file a DMCA notice, they even provide an <a href="http://about.aol.com/aolnetwork/info_notify">easy form to submit it</a>. Lacking from their site is any information about filing a counter-notice, once again, information likely included when notifying a user that has been the subject of a DMCA notice, and a fax number for sending in complaints that way. Still, they have the easiest and most robust policy overall and, in my experience with their Web host, have a very respectable reputation for handling infringing material.</p>
<p><strong>Fark</strong></p>
<p><strong>Format</strong>: Email?<br />
<strong>Email Address</strong>: <a href="http://cgi.fark.com/cgi/fark/feedback.pl">See Form</a><br />
<strong>Location of Policy</strong>: None<br />
<strong>Registered with USCO</strong>: No<br />
<strong>Comments</strong>: Of all the social networks I looked at, Fark is the only one completely lacking a formal DMCA policy. Though the contact form can be used to ask a &#8220;legal question&#8221; or report a bad headline, there is no clear place to file an abuse report nor is there an actual policy for receiving DMCA notices on the site. This isn&#8217;t the first time <a href="http://www.plagiarismtoday.com/2007/04/26/fark-claims-copyright-on-posted-works/">Fark&#8217;s copyright policy has been questioned</a>, but the first time was easily explained by poor writing and it seems likely that this is also an issue with a poorly-written terms of use. Hopefully, as Fark works on their copyright policy, they will also work on their DMCA policy as well. Otherwise, they risk losing protection under the DMCA safe harbor act.<br />
<strong>Update From Fark</strong>: I heard back from Drew earlier today, he said that he is going to work with his new attorney on this matter shortly after correcting the current copyright policy, which is now in final draft stage. Hopefully, this matter too will be cleared up easily.</p>
<p><strong>Conclusions</strong></p>
<p>Of all of the DMCA policies I reviewed, Netscape&#8217;s procedure was far and away the most robust and the most user-friendly. However, this is understandable as AOL has been a Web host for over a decade and has a lot of experience working with the DMCA. Also, they probably have the largest volume of complaints.</p>
<p>After Netscape, Slashdot and Reddit both have very solid policies, both are registered with the USCO and both provide all of the needed information. They are definitely both excellent role models in this area. Digg is also more than acceptable, however, their lack of registration with the USCO is worrisome since it is a requirement of the DMCA and their safe harbor status may be in jeopardy because of that. However, these are all easy things to fix.</p>
<p>At the bottom, right now, is Fark. Their lack of a formal policy not only puts their safe harbor status at risk, but also, can confuse and hinder anyone reporting links to any infringing material.</p>
<p>However, since I have not worked with any of these companies personally, save AOL, I can not make any judgments on them for the <a href="http://www.plagiarismtoday.com/host-report/">host report</a>, all of this is based purely upon my evaluation of their stated policies and information I have heard from others.</p>
<p>The bottom line, however, is that social news is supposed to be about rewarding people who produce high-quality content. That can&#8217;t happen as long as plagiarists are able to steal that content and promote it as their own.</p>
<p>When the crowd fails to catch a thief in time, as will happen sometimes, it is important for a site to have an effective and copyright policy and a strong will to enforce it.</p>
<p>Otherwise, social news sites risk becoming an even darker element of the &#8220;<a href="http://www.bloggersforhire.com/blogging-and-the-problem-of-the-echo-chamber/">echo chamber</a>&#8221; that some feel blogs already are.</p>
<p><em>Note: All of the above networks were contacted for this article and, as of this writing, had not responded. I will update this article should any reply after it is published.</em></p>
<p><em>Note 2: This story is in no way related to the HD-DVD story that has consumed Digg over the past 24 hours or so. That story, apparently, did not involve a DMCA notice, <a href="http://blog.digg.com/?p=74">but rather a cease and desist letter</a>, and this story was underway well before the HD-DVD controversy began to draw much attention. This was intended to be a <a href="http://www.plagiarismtoday.com/2007/04/19/social-news-another-plagiarism-concern/">follow up to the story mentioned above</a>. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/05/01/the-dmca-on-social-news-sites/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>Farked, BoingBoinged and More</title>
		<link>http://www.plagiarismtoday.com/2007/04/27/farked-boingboinged-and-more/</link>
		<comments>http://www.plagiarismtoday.com/2007/04/27/farked-boingboinged-and-more/#comments</comments>
		<pubDate>Fri, 27 Apr 2007 16:17:18 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Housekeeping]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Fark]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[Slashdot]]></category>
		<category><![CDATA[Social-News]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/04/27/farked-boingboinged-and-more/</guid>
		<description><![CDATA[Yesterday was a very eventful day for this site. The article about Fark&#8217;s Copyright Policy made an appearance on the front page of Fark.com. Shortly after that, it was featured as an update to the BoingBoing article on the subject and even appeared on Wired.com&#8217;s Epicenter Blog. Needless to say, traffic was extremely high yesterday...]]></description>
			<content:encoded><![CDATA[<p>Yesterday was a very eventful day for this site. The article about <a href="http://www.plagiarismtoday.com/2007/04/26/fark-claims-copyright-on-posted-works/">Fark&#8217;s Copyright Policy</a> made an <a href="http://forums.fark.com/cgi/fark/comments.pl?IDLink=2765367">appearance on the front page of Fark.com</a>. Shortly after that, it was featured as an update to the <a href="http://www.boingboing.net/2007/04/26/farks_copyright_poli.html">BoingBoing article on the subject</a> and even appeared on <a href="http://blog.wired.com/business/2007/04/farks_new_copyr.html">Wired.com&#8217;s Epicenter Blog</a>.</p>
<p>Needless to say, traffic was extremely high yesterday and remains very high today. However, I wanted to welcome all of the new readers of the site and encourage those of you interested in Web-related copyright issues to <a href="http://feeds.feedburner.com/PlagiarismToday">subscribe to this site&#8217;s feed</a>. There will be a lot more coming on these issues, especially as they pertain to social news, very soon.</p>
<p>Second, I would like to thank my host, <a href="http://www.mediatemple.net">Media Temple</a>, for keeping this site alive during the storm. I moved to MT after my last host crashed twice under similar burdens, once following a <a href="http://www.plagiarismtoday.com/2006/05/22/slashdotted/">Slashdot</a>, the other following a <a href="http://www.plagiarismtoday.com/2006/12/11/welcome-boingboing-digg-10zenmonkey-visitors/">Digg</a>.  MT kept swimming right along, exactly as designed, and hopefully won&#8217;t hit me with too much of an overage.</p>
<p>Finally, a thank you also has to go out to everyone who helped with the article, especially Drew Curtis for writing me back so promptly despite his busy schedule.</p>
<p>It was a crazy day yesterday and I&#8217;ll be recovering much of the day today. But I am looking forward to returning to my normal schedule this weekend.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/04/27/farked-boingboinged-and-more/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Social News: A New Plagiarism Concern</title>
		<link>http://www.plagiarismtoday.com/2007/04/19/social-news-another-plagiarism-concern/</link>
		<comments>http://www.plagiarismtoday.com/2007/04/19/social-news-another-plagiarism-concern/#comments</comments>
		<pubDate>Thu, 19 Apr 2007 19:50:31 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Prevention]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[Slashdot]]></category>
		<category><![CDATA[Social-Media]]></category>
		<category><![CDATA[Social-Networking]]></category>
		<category><![CDATA[Social-News]]></category>
		<category><![CDATA[Web-2.0]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/04/19/social-news-another-plagiarism-concern/</guid>
		<description><![CDATA[If you need another reason to worry about content theft, consider the recent case involving the gossip blog Celebrity Hack. About two weeks ago, an article on the site entitled &#8220;37 Famously Stupid Celebrity Quotes&#8221; was submitted to Digg. The story reached the front page briefly but was buried for unknown reasons. Yesterday, a much...]]></description>
			<content:encoded><![CDATA[<p>If you need another reason to worry about content theft, consider the recent case involving the gossip blog <a href="http://www.celebrityhack.com/">Celebrity Hack</a>.</p>
<p>About two weeks ago, an <a href="http://www.celebrityhack.com/">article on the site</a> entitled &#8220;37 Famously Stupid Celebrity Quotes&#8221; was <a href="http://digg.com/celebrity/37_Famously_Stupid_Celebrity_Quotes">submitted to Digg</a>. The story reached the front page briefly but was buried for unknown reasons.</p>
<p>Yesterday, a much lower-quality scraped version of the same article was <a href="http://digg.com/celebrity/37_Famously_Stupid_Celebrity_Quotes_2">submitted to Digg</a>. The article, which has since been removed, was promoted to the front page and not buried, earning over 800 Diggs.</p>
<p>This has lead many, <a href="http://www.pronetadvertising.com/articles/social-media-exacerbating-the-contenttheft-problem24882.html">including Muhammad Saleem at Pronet Advertising</a>, to worry that social networking is making the problem of content theft issue worse.</p>
<p>Sadly, Saleem seems to be right. Even as social news sites like Digg are increasing the rewards for creating original and interesting content, they are also rewarding those who steal it.</p>
<p><span id="more-475"></span></p>
<p><strong>Content Theft Goes Social</strong></p>
<p>For years Webmasters have been worried about the duplicate content penalty. The fear is that, since search engines place a higher value on unique content and intentionally bump down duplicate pages, that the search engine might bump down a site simply because it is being scraped and reposted elsewhere, even it is taking place without permission.</p>
<p>Though <a href="http://www.plagiarismtoday.com/2006/12/21/google-addresses-duplicate-content/">Google has addressed that</a> and said that it should not be a major concern, I have worked with several businesses that have had their marketing material scraped, only to have the plagiarist rank as high or higher they.</p>
<p>Even if there is no &#8220;penalty&#8221;, scraping is a cheap and fast way for sites to compete for the same keywords with the same strategy and, with good linking such as comment spam, achieve similar or greater results.</p>
<p>But with social networking and social news begins a new problem. If search engines, with their advanced algorithms and overhead view of the Web, have a difficult time telling duplicate content from original material, individual users have at least an equal challenge.</p>
<p>Sadly, it is often a challenge that they fail to meet, especially when they are in a rush to submit an article to their favorite social networking site.</p>
<p><strong>A Ripe Target</strong></p>
<p>With sites like Digg, Slashdot and Reddit capable of directing incredible volumes of traffic to a site (Note: This site has been both Dugg and Slashdotted at varying points), they&#8217;ve become very ripe targets for spammers.</p>
<p>The desire for placement on Digg has even <a href="http://www.techcrunch.com/2005/12/27/gaming-digg/">generated a few start ups</a>, including the most recent <a href="http://www.techcrunch.com/2007/04/02/subvert-and-profit-next-service-to-try-gaming-digg/">Subvert and Profit</a>. These companies, reviled by most Digg users, let submitters purchase Diggs and pay members for Digging up stories.</p>
<p>The popularity of these social news sites has also attracted spammers who, often times, copy high-quality and legitimate content, post it to their site and then actively promote it on these networks.</p>
<p>Legitimate Webmasters, who passively promote these social networks with badges, links and buttons, often don&#8217;t get as much push as the sites created almost exclusively for the networks. Legitimate users of the network, genuinely interested in the content, unwittingly promote the scraped version until it reaches the front page, thus rewarding the plagiarist for his acts.</p>
<p>Sometimes the theft is caught in time, either by an astute reader or the original author. Other times, as with the case in the Celebrity Hack story, the plagiarized work goes straight to the front page and the plagiarist enjoys the traffic and revenue that the site brings in.</p>
<p>The beauty of this approach from the angle of the spammer is that it only takes one story to take off to justify the effort. <a href="http://www.plagiarismtoday.com/2007/03/27/the-economics-of-spam-blogging/">Much like with spam blogging</a>, many attempts can fail so long as one succeeds. The rewards for success far exceed the penalties for failure.</p>
<p><strong>Reasons for Concern</strong></p>
<p>This type of plagiarism is especially damaging to Webmasters as it can impact them very deeply and very directly.</p>
<ol>
<li><strong>Only the Best Content:</strong> Unlike spam bloggers, who scrape indiscriminately, plagiarists targeting social network sites are only interested in the best-quality content. They select articles, copy them by hand and paste them to their own site without attribution or under their own name.</li>
<li><strong>A Public Plagiarism:</strong> This plagiarism is especially damaging as it takes place in the most public forum on the Web and in the most deceptive way. The plagiarists involved often use hand-crafted sites, not just easily-detected computer-generated sites like spam bloggers, and many viewers may genuinely believe the plagiarized copy to be the original work, even if there is significant evidence to the contrary.</li>
<li><strong>Closing The Door:</strong> Worse still, it detracts from one of the major incentives to post original, high-quality content on the Web. By taking their content to the social news sites, the original authors lose that ability to get their work to that large audience. It greatly impacts the original author&#8217;s ability to use the social news sites to build their reputation and readership.</li>
</ol>
<p>In short, this type of plagiarism doesn&#8217;t just risk replacing the original in the eyes of the SERPS or a few readers, but in the minds of tens of thousands of people and on some of the largest, most popular sites in the world.</p>
<p><strong>Reasons Not To Worry</strong></p>
<p>On the flip side of the coin, there are several reasons why spammers may not be drawn to the social networking sites over the long run.</p>
<ol>
<li><strong>Limited Ad Revenue</strong>: Users on social networking sites tend not to promote sites that have many ads and <a href="http://chitika.com/blog/?p=162">studies have shown</a> that traffic from social networking is much less likely to result in ad clicks than via search engines.</li>
<li><strong>Limited Search Engine Benefit</strong>: The SEO benefit of getting a page listed on Digg or another social news site is dubious a best. A single link that scrolls off the home page quickly will not carry as much weight as multiple links on more static sites.</li>
<li><strong>Higher Costs</strong>: Surviving a Digg or Slashdot effect can be difficult, as I have personally found out. Doing so requires either investing more into hosting or setting up shop on services such as Blogspot that have less credibility on the sites.</li>
</ol>
<p>In short, while these sites can definitely generate a large amount of traffic, they don&#8217;t always generate a lot of money for the people they link to. Professional spammers will likely be seeking out other methods that should result in larger checks.</p>
<p>The more likely candidate for this type of plagiarism is a new Webmaster trying to grow his blog or blog network and is very misguided on how to do so. They are frustrating and annoying, but are generally easily stopped once they are spotted.</p>
<p>Simply put, in these cases the shame of being discovered a plagiarist is usually, in and of itself, enough to send the plagiarist into hiding.</p>
<p><strong>What Webmasters Can Do</strong></p>
<p>Despite the drawbacks, it is obvious that the problem is both real and ongoing. Bloggers and Webmasters, especially those that wish to leverage social news sites, need to consider taking a few steps to guard against this.</p>
<ol>
<li><strong>Track Popular Articles</strong>: Monitoring and protecting the RSS feed will stop spam bloggers, but not plagiarists who manually select content. You need to track your best articles to ensure they aren&#8217;t copied without permission. Consider using <a href="http://www.google.com/alerts">Google Alerts</a> to monitor those articles automatically.</li>
<li><strong>Monitor the Social News Sites</strong>: Sometimes the plagiarized articles can hit the social news sites before Google or even Technorati picks them up. Follow the social news sites in relevant categories, you probably should be regardless, and catch plagiarists early.</li>
<li><strong>Report Infringement to the Social News Sites</strong>: In addition to the usual steps of getting the works removed from the plagiarist site and, possibly, getting their ad revenue severed, report them to the social news site they submitted to. Most sites will ban domains and sites that host plagiarized content.</li>
</ol>
<p>Though these steps can not guarantee that a plagiarist will not slip through and get an article of yours on the front page of a social news site, they can help thwart that vast majority of those who might try.</p>
<p><strong>Conclusions</strong></p>
<p>The important thing to remember is that, even with blogs, plagiarism goes well beyond just RSS scraping. As social news sites have bolstered the rewards for hosting and creating content that stands out, many plagiarists will start taking only that content, often bypassing the RSS feed completely.</p>
<p>It&#8217;s another concern for Webmasters and bloggers, many of whom spend hours a day creating content only to have it ripped off by others almost immediately. It&#8217;s also a concern that is difficult to deal with.</p>
<p>It is also an issue that we will be talking about this more in depth over the next few weeks as it doesn&#8217;t appear to be going away any time soon.</p>
<p>In fact, it seems likely that we&#8217;ve just now seen the beginning of it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/04/19/social-news-another-plagiarism-concern/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Six Apart/Rojo: Now Spam Bloggers?</title>
		<link>http://www.plagiarismtoday.com/2007/04/03/six-apartrojo-now-spam-bloggers/</link>
		<comments>http://www.plagiarismtoday.com/2007/04/03/six-apartrojo-now-spam-bloggers/#comments</comments>
		<pubDate>Tue, 03 Apr 2007 17:09:51 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Digg]]></category>
		<category><![CDATA[Livejournal]]></category>
		<category><![CDATA[Nooz]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Reddit]]></category>
		<category><![CDATA[Rojo]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[Six-Apart]]></category>
		<category><![CDATA[Spam-Blogging]]></category>
		<category><![CDATA[Splogging]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/04/03/six-apartrojo-now-spam-bloggers/</guid>
		<description><![CDATA[- Article Updated &#8211; See Below - Six Apart was one of the first rock stars of the blogging world. Propelled to fame on the back of its Movable Type blogging platform, it quickly became one of the most recognized names in the blogging world. Though Movable Type has largely been replaced by newer blogging...]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.plagiarismtoday.com/2007/04/03/six-apartrojo-now-spam-bloggers/sixapart-logo/" rel="attachment wp-att-461" title="SixApart Logo"><img src="http://www.plagiarismtoday.com/wp-content/uploads/2007/04/sixapart_small.png" title="SixApart Logo" alt="SixApart Logo" align="left" hspace="5" vspace="5" /></a><strong>- Article Updated &#8211; See Below -<br />
</strong></p>
<p>Six Apart was one of the first rock stars of the blogging world. Propelled to fame on the back of its <a href="http://www.movabletype.com/">Movable Type</a> blogging platform, it quickly became one of the most recognized names in the blogging world.</p>
<p>Though Movable Type has largely been replaced by newer blogging applications, including <a href="http://www.wordpress.org">WordPress</a>, Six Apart has remained very active in the blogging world, not only offering <a href="http://www.sixapart.com/typepad/">Typepad</a>, a popular blogging service, but also <a href="http://gigaom.com/2005/01/04/six-apart-to-buy-live-journal/">purchasing several other blogging comapnies</a>, including <a href="http://www.sixapart.com/livejournal/">LiveJournal</a> and <a href="http://www.rojo.com/" rel="nofollow">Rojo</a>.</p>
<p>However, some of these subsidaries have begun engaging in practices that many bloggers consider unethical. One of the sites under Six Apart&#8217;s control even engages in <a href="http://www.plagiarismtoday.com/2006/09/25/bitacle-debacle/">behavior akin to Bitacle</a>.</p>
<p>This has left some to wonder why Six Apart, a company largely respected in the Blogging world, has begun to play fast and loose with RSS feeds and copyrighted content.  Worse still, why have they begun using tactics largely reserved for spam bloggers?</p>
<p>Sadly, the answers are not very clear.</p>
<p><span id="more-462"></span><strong> LiveJournal Syndication</strong></p>
<p>The least worrisome of Six Apart&#8217;s scraping activities revolves around their LiveJournal service.  There, paid members can take advantage of their &#8220;Syndication&#8221; feature. It allows users to select an RSS feed and LiveJournal then creates a specialized page for the feed. The feed can then be added as a &#8220;friend&#8221;, the same as if it were an actual LiveJournal member, and can appear in friend lists.</p>
<p>The Syndication feature is worrisome because it creates an &#8220;account&#8221; with duplicate content from the feed. The site displays the entire contents of the feed (<a href="http://syndicated.livejournal.com/officialgaiman/" rel="nofollow">see sample</a> using <a href="http://www.neilgaiman.com/journal/index.html">Neil Gaiman&#8217;s Journal</a>) and allows users to post comments without returning to the original site.</p>
<p>However, with the LiveJournal Syndication service, attribution is very clear and all synidcated accounts are on a separate subdomain (syndicated.livejournal.com). Also, the LiveJournal team has, historically, been very responsive about removing feeds that their owners don&#8217;t want to be scraped. Furthermore, results from the Syndication service <a href="http://www.google.com/search?hl=en&amp;safe=off&amp;client=firefox-a&amp;rls=org.mozilla%3Aen-US%3Aofficial&amp;hs=uBk&amp;q=site%3Asyndicated.livejounal.com&amp;btnG=Search">do not appear in Google</a> eliminating most of the major concers one has with scraping.</p>
<p>Still, many bloggers are likely to be concerned that a duplicate of their blog exists, that users can and do comment to it and that LiveJournal users no longer need to subscribde to the feed directly or visit their site.</p>
<p><strong>Rojo Front Page</strong></p>
<p><a href="http://www.plagiarismtoday.com/wp-content/uploads/2007/04/rojo.png" title="Rojo Screenshot"><img src="http://www.plagiarismtoday.com/wp-content/uploads/2007/04/rojo.thumbnail.png" title="Rojo Screenshot" alt="Rojo Screenshot" align="left" hspace="5" vspace="5" /></a>When Six Apart <a href="http://www.sixapart.com/about/press/2006/09/six_apart_acqui_1.html">aquired RSS reader Rojo in September 2006</a>, it also aquired some of Rojo&#8217;s bad habits.</p>
<p>Rojo&#8217;s home page functions almost exactly like a rapidly-updating spam blog. It features the full content of the most popular feed items of the day, all next to Google Adsense ads (see screenshot above). The site is then further sub-divided into new categories, including &#8220;politics&#8221; &#8220;Web 2.0&#8243;, etc., it is also possible to view the original feed on Rojo without visiting the original site (<a href="http://www.rojo.com/feed/c0ft_NCxBCFNH03_">see PTs feed on Rojo</a>) and those feeds are also surrounded by ads.</p>
<p>Attribution on Rojo is prominent and the headlines do link back to the original story. However a &#8220;Rojolink&#8221; feature encourages others to use the Rojo permalink for the article rather than link to the original site.</p>
<p>At the very least, <a href="http://calacanis.com/">Jason Calacanis</a> will likely be upset by this. He has repeatedly stated that he will not allow his full feeds to be placed next to ads, <a href="http://www.rojo.com/feed/V1WuPMhWMOkiZv9A">something that Rojo does</a>.</p>
<p>Though most people expect RSS readers to make money off of other people&#8217;s content, generally it is also expected that they will add value to the feed by making it easier for people to subscribe. Instead, Rojo has just created a valueless duplicate of the feeds, and surrounded the content with ads.</p>
<p><strong>All The Nooz</strong></p>
<p>Worst of all Six Apart&#8217;s properties though is the Rojo-owned site <a href="http://www.nooz.com" rel="nofollow">Nooz.com</a>. Nooz is designed to function like Digg for Myspace.  Nooz users pick articles from the Web, vote on them and add them to their special Nooz widgets that they they place on their Myspace profiles.</p>
<p>The problem with Nooz, however, is not the widgets but the way the content is obtained. Rather than letting users select their own articles from the Web, like Digg or Reddit, Nooz forces users to select from versions of the blog that it has scraped and reposted on its own site (<a href="http://www.nooz.com/feed/c0ft_NCxBCFNH03_" rel="nofollow">see Plagiarism Today on Nooz</a>). Once again, as with Rojo itself, Nooz offers &#8220;Noozlinks&#8221; to encourage people to link to Nooz&#8217;s scraped copy, rather than the original.</p>
<p>Though no ads appear on Nooz at this time, Nooz.com is accessible by the search engines, <a href="http://www.google.com/search?q=site%3Anooz.com&amp;ie=utf-8&amp;oe=utf-8&amp;aq=t&amp;rls=org.mozilla:en-US:official&amp;client=firefox-a">Google estimates</a> that about 150,000 pages have been indexed already. Even worse, all of the contact addresses for Nooz, <a href="http://www.nooz.com/about/policies/copyright-policy/" rel="nofollow">including the copyright agent</a>, all bounced back.</p>
<p>Nooz is not only scraping and reposting feeds without permission, but it is being irresponsible in doing so. There is no means to ask Nooz to stop reusing the content.</p>
<p>If you don&#8217;t like the way Nooz uses your content, quite frankly, you are out of luck at the moment.</p>
<p style="font-weight: bold">A Murmured Outcry</p>
<p>Six Apart is no stranger to blogging, as discussed above, they helped ignite the blogging movement with their software. They are not unfamiliar with the ettiquite of blogging and should realize, at least on some level, that some bloggers will not ba happy to see their feeds scraped and republished on someone else&#8217;s site, all the while surrounded by ads.</p>
<p>The reasons Six Apart allows this to continue are dubious at best. Legal scholars have already agreed that <a href="http://www.plagiarismtoday.com/2007/01/29/twil-discusses-implied-licenses-on-rss-feeds/">there is no implied license with RSS feeds</a>, this use, as long as it is executed without permission, is basically copyright infringement. Unless a CC license or a direct agreements permits the use, what Six Apart is doing in all three cases is, most likely, illegal.</p>
<p>To my knowledge, no one has complained about these three uses for the following reasons. Why is a mystery, but the reasons may include the following:</p>
<ol>
<li>Very few people seem to be affected by the LiveJournal Syndication feature. Since only paid members can take advantage of it, severely limiting the pool, only very large blogs are scraped. Also, LiveJournal has been very cooperative in removing people that don&#8217;t want to participate. Furthermore, since the Syndicated blogs are not picked up by search engines, it&#8217;s unlikely most bloggers know that they exist.</li>
<li>Few bloggers want to upset Rojo since many readers use the feed reader service to subscribe to blogs. Currently, about 5% of all Plagiarsim Today subscribers use Rojo.</li>
<li>Nooz seems to have flown under the radar, targeted mostly at Myspace users, generally a separate group from bloggers, and still a relatively new creation (its current incarnation starting some time this year).</li>
</ol>
<p>No matter the reasons though, these issues are not going away. RSS scraping and reuse issues will likely be around for a very long time, that is, until a licensing scheme emerges that resolves the issue once and for all.</p>
<p><strong>Conclusions </strong></p>
<p>What Six Apart is doing is wrong. Though I have no major issues with their use of my content, save perhaps on Rojo where the use is more commercial (and thus a violation of <a href="http://creativecommons.org/licenses/by-nc-sa/2.5/">my Creative Commons License</a>), Six Apart is taking content from thousands of blogs, without permission, and reposting them on various sites. That is copyright infringement and there is little way around that.</p>
<p>Though some might argue that Six Apart&#8217;s scraping would qualify for protection under the DMCA (section 512(b)) protection for caching services. However, <a href="http://www.plagiarismtoday.com/2007/01/16/debunking-the-dmca-caching-loophole/">as discussed earlier</a>, that is not likely the case.</p>
<p>All of Sixapart&#8217;s sites modify the content and create permanent files, both violations of the caching provision. It also does not follow accepted practices (as there are no accepted practices for scraping and republishing RSS feeds) and it is not automated, seemingly relying at every step on users to submit the original feed.</p>
<p>It is unlikely, at best, that Six Apart would obtain the same kind of protection that was <a href="http://www.searchenginejournal.com/google-cache-is-ruled-legal-fair-use/2837/">afforded the Google Cache</a>, especially considering both the commercial nature of the use and the apparent intent of setting up the copy as a substitute for the original. The latter is shown by the new permalinks and location of cached material (placed before the link to the original).</p>
<p>Six Apart desperately needs to look at its policy for reusing others content. In that regard, it should look toward sites such as <a href="http://www.digg.com">Digg</a> and <a href="http://www.reddit.com">Reddit</a> that have built great communities without infringing on copyright.</p>
<p>In short, there&#8217;s no reason for a social news site to scrape and repost content like Rojo and Nooz currently do.  Links and snippets are perfectly adequate.</p>
<p>When it&#8217;s all said and done, Six Apart seems to have nothing to gain by scraping and reposting content as it does. Successful news sites have, for a very long time, worked well with content creators and there seems to be no reason for Six Apart to try and change that, especially in a way that is both legally dubious and likely to cause outrage.</p>
<p>Hopefully they will reevaluate their policies soon and come up with a more fair approach to its sites. In the meantime, they are treading on very thin legal ice and dealing with a very wary public.</p>
<p><em><strong>Hat tip:</strong> Thanks to <a href="http://www.typetive.com/">Cybele of Typetive</a> for the heads up about Nooz.com </em></p>
<p><em>Note: During the course of writing this article, which started Thursday, I made several attempts to contact Six Apart by both email and phone. I was able to get in touch with Jane Anderson, Six Apart&#8217;s <a href="http://www.sixapart.com/about/press/">press contact</a>. We scheduled a time for an interview on Monday but, when I called in there was no answer. Subsequent attempts to contact Six Apart via both office phone and cell phone have produced no answer. I will update this article when and if I get further information from them.</em></p>
<p><strong>Update:</strong> I&#8217;ve gotten back in touch with Jane Anderson, she is speaking with her counterparts at Six Apart and will be back in touch with me soon. They have scheduled a meeting for tomorrow to discuss these issues. I will report back after I hear from them.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/04/03/six-apartrojo-now-spam-bloggers/feed/</wfw:commentRss>
		<slash:comments>41</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 07:02:39 -->
