<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism Todaywikipedia | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/wikipedia/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 06:51:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>3 Count: Wiki-Darkness</title>
		<link>http://www.plagiarismtoday.com/2012/01/17/3-count-wiki-darkness/</link>
		<comments>http://www.plagiarismtoday.com/2012/01/17/3-count-wiki-darkness/#comments</comments>
		<pubDate>Tue, 17 Jan 2012 16:53:54 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Copyright News]]></category>
		<category><![CDATA[acslaw]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[crossley]]></category>
		<category><![CDATA[i have a dream]]></category>
		<category><![CDATA[martin luther king jr. mlk]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[protest]]></category>
		<category><![CDATA[sopa]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=12270</guid>
		<description><![CDATA[Wikipedia to go dark to protest SOPA, ACS:Law attorney suspended for two years and why you have to pay to watch MLK's "I Have a Dream"]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2010/07/3count004-trim.png" alt="" title="3count004-trim" class="alignleft size-full wp-image-7303" height="162" width="175"></p>
<p><em>Have any suggestions for the 3 Count? Let me know via Twitter <a href="http://twitter.com/plagiarismtoday">@plagiarismtoday</a>.</em></p>
<h4>1: <a href="http://www.cbsnews.com/8301-205_162-57359773/wikipedia-to-join-web-blackout-protesting-sopa/">Wikipedia to Join Web Blackout Protesting SOPA</a></h4>
<p>First off today, Wikipedia has announced that they will be joining Reddit and a slew of other sites in going &#8220;dark&#8221; on January 18th in opposition to the controversial Stop Online Piracy Act (SOPA) and it&#8217;s sister act, Protect IP (PIPA). The 24-hour blackout will affect only the English language version of the online encyclopedia and is designed to protest the acts&#8217; alleged censorship of the Web. However, the move comes as both acts are being rewritten to remove the controversial site blocking provisions and SOPA is currently on hold following a condemnation by the Obama administration. </p>
<h4>2: <a href="http://www.thelawyer.com/sdt-suspends-acslaw-founder-for-two-years/1010952.article">SDT Suspends ACS:Law Founder for Two Years</a></h4>
<p>Next up today, Andrew Crossley, the founder of the controversial UK law firm ACS:Law has been suspended from the practice of law for two years and ordered to pay £76,326.55 in costs at a Solicitors Disciplinary Tribunal (SDT). The firm rose to prominence, and controversy, in 2009 for its bulk lawsuits against suspected pirates, where it demanded settlements from them to avoid further litigation. Crossley admitted to six of the charges before him, including acting against the best interest of his clients, but contested a seventh, that he had failed to adequately protect client data. The latter charge stemmed from a data leak from his server that resulted in defendant information being leaked on the Web. The tribunal rebutted his claims and found him guilty of that charge as well.</p>
<h4>3: <a href="http://motherboard.vice.com/2012/1/16/copyright-king-why-the-i-have-a-dream-speech-still-isn-t-free">Copyright King: Why the &#8220;I Have a Dream&#8221; Speech Still Isn&#8217;t Free</a></h4>
<p>Finally today, as the U.S. celebrated Martin Luther King Jr. day yesterday, many were surprised to learn that they were unable to find a free version of his famous &#8220;I Have a Dream&#8221; speech. The reason is that the speech itself, which was given 48 years ago, is still protected by copyright and will continue to be until 2038, 70 years after MLK&#8217;s death. Currently, EMI records is the company responsible for the licensing of the work though, with EMI&#8217;s recent auction, that right will, likely, soon go to another record company.</p>
<h4>Suggestions</h4>
<p>That&#8217;s it for the three count today. We will be back tomorrow with three more copyright links. If you have a link that you want to suggest a link for the column or have any proposals to make it better. Feel free to leave a comment or send me an email. I hope to hear from you. </p>
<h4>Want the Full Story?</h4>
<p>Tune in <a href="http://www.plagairsimtoday.com/podcast">every Wednesday evening at 5 PM ET for the live recording of the Copyright 2.0 Show</a> or wait and get the edited version <a href="http://www.plagiarismtoday.com/category/podcast/">Friday right here on Plagiarism Today</a>. </p>
<p><em>The 3 Count Logo was created by <a rel="nofollow" href="http://www.cloudjunkies.com/">Justin Goff</a> and is licensed under a <a rel="nofollow" href="http://creativecommons.org/licenses/by/3.0/">Creative Commons Attribution License</a>. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2012/01/17/3-count-wiki-darkness/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Cpedia: A Spam Blog Disguised as an Encyclopedia</title>
		<link>http://www.plagiarismtoday.com/2010/06/09/cpedia-a-spam-blog-disguised-as-an-encyclopedia/</link>
		<comments>http://www.plagiarismtoday.com/2010/06/09/cpedia-a-spam-blog-disguised-as-an-encyclopedia/#comments</comments>
		<pubDate>Wed, 09 Jun 2010 16:21:04 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[copyright infirngement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[cpedia]]></category>
		<category><![CDATA[cuil]]></category>
		<category><![CDATA[encyclopedia]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[spam-blog]]></category>
		<category><![CDATA[splog]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=6818</guid>
		<description><![CDATA[CPedia is a new automatically-generated encyclopedia from the makers of Cuil. So why is it throwing thousands of pages of duplicate content into Google?]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2010/06/cpedia-logo.jpg" alt="" title="cpedia-logo" width="255" height="88" class="alignleft size-full wp-image-6825"></p>
<p>Last week, <a href="http://twitter.com/melebeth">@melebeth</a> introduced me to <a rel="nofollow" href="http://cpedia.com">CPedia</a>, a new &#8220;encyclopedia&#8221; by the makers of <a href="http://cuil.com">Cuil</a>, a search engine that was initially greeted with much fanfare before <a href="http://techcrunch.com/2008/12/27/cuil-fail-traffic-nearly-hits-rock-bottom/">seemingly flaming out</a>. </p>
<p>Cpedia is not an encyclopedia in the strictest sense as it is not written by human beings. Unlike traditional encyclopedias, which are written by paid experts, or Wikipedia, which is written largely by volunteers, Cpedia is written automatically from the search result pages creating an automated, and <a href="http://gigaom.com/2010/04/16/cpedia-founder-errors/">often wildly inaccurate</a> encyclopedia-like page. </p>
<p>For example, <a rel="nofollow" href="http://www.cpedia.com/wiki/Jonathan_Bailey_of_Plagiarism_Today_(all_pages)">I have a Cpedia, page</a> as well as <a rel="nofollow" href="http://www.cpedia.com/wiki?q=Plagiarism+Today">one for this site</a>, though my personal page doesn&#8217;t actually say anything about me and the one for PT seems to discuss random people/items only tangentially related to the site.</p>
<p>However, the concern Melebeth approached me with was not just about the accuracy of Cpedia, but about the way it used content from other sources. According to her, the search engine was lifting text directly from third-party sites but not properly quoting or citing it.</p>
<p>So, I delved into Cpedia and found, unfortunately, that her fears were largely founded.<span id="more-6818"></span></p>
<h4>How CPedia Works</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2010/06/pt-cpedia-sample-217x300.jpg" alt="" title="pt-cpedia-sample" width="217" height="300" class="alignright size-medium wp-image-6827"></p>
<p>The basic idea behind CPedia is that it combs through the search results for a relevant term and tries to build out an encyclopedia entry automatically. The results, visually, are very similar to Wikipedia but the content is generally more jumbled and difficult to read. </p>
<p>Cpedia does attribute the content it uses, but in a very strange way. If you click or hover your mouse over the text within an article, but not the link, you will be given a sidebar that shows the text that&#8217;s been quoted and a link to the source in the sidebar. If you click the inline text link, you are instead taken to a references page that then links to the original source. </p>
<p>Usually, the individual copied passages are very short though, sometimes, the word count of the passage approached 100 words, especially when the source package was broken up into multiple parts.</p>
<p>CPedia seems to have a nearly unlimited number of topics covered, likely aided by the fact it is automatically generating results, and has many pages that Wikipedia does not, including one for me.</p>
<p>All in all, CPedia is fairly straightforward but that does not mean it isn&#8217;t a problem. In fact, in this case, it means quite the opposite.</p>
<h4>Problems With Cpedia</h4>
<p>Apart from the questionable accuracy of Cpedia, the entire operation, to me, seems highly suspect. The idea of creating new pages of content using snippets from dozens, even hundreds of other pages seems to be a very poor way to do business. </p>
<p>But even discarding the way the content is created, there are several issues with the attribution issue alone. Consider the following two problems:</p>
<ol>
<li><strong>Always One Step From Source Link:</strong> Whether you hover over the text or click to the references page, you are always one action away from the source link. This means users and other search engines alike are always two steps from the source site even though it would be trivial to make it one.</li>
<li><strong>Lack of Clear Quotes:</strong> The entire entry is made up of short verbatim quotes from various sources but it is not clear where the quotes begin and end without hovering over the text. The goal is to make the entire work seem like an original creation, an actual encyclopedia entry, without much in the way of visible quotes, just traditional footnote citations.</li>
</ol>
<p>However, the bigger problem is actually very simple. There are already many sites that build thousands and thousands of entries using snippets from various other pages. They&#8217;re called spam blogs and they use a variety of article generation and spinning technology to build new articles out of hodgepodges of existing ones.</p>
<p>And Cpedia is acting very much like a spam blog. Entries from CPedia are appearing in Google, <a href="http://www.google.com/search?hl=en&#038;safe=off&#038;client=safari&#038;rls=en&#038;q=site%3Acpedia.com&#038;aq=f&#038;aqi=&#038;aql=&#038;oq=&#038;gs_rfai=">which currently has about 177,000 entries indexed</a>, and though, <a rel="nofollow" href="http://cpedia.com/robots.txt">Cpedia&#8217;s robots.txt</a> disallows the wiki directory, it doesn&#8217;t seem to be stopping search engines from indexing the entries.</p>
<p>When you factor all of this together, it becomes clear that Cpedia is acting exactly like a spam blog and less like an encyclopedia. Was the intention? Probably not. But it is how the site is functioning, pumping thousands of pages of poorly-written duplicate content into the major search engines.</p>
<p>If that is not the hallmark of a spam blog, I&#8217;m not sure what is.</p>
<h4>Making it Stop</h4>
<p>To be clear, what Cpedia is doing isn&#8217;t, most likely, illegal. Fair use would likely protect their very limited use of the content from each individual source. This is one of the reasons this technique is so common among spam blogs is that it makes them almost immune to copyright disputes as a means of closure.</p>
<p>In short, even though the ethics of Cpedia can be hotly debated, most likely they are on the right side of the law.</p>
<p>That being said, if you want your work removed from Cpedia, all you have to do is remove it from Cuil and that can be done by <a rel="nofollow" href="http://www.cpedia.com/info/webmaster_info/">using robots.txt to block &#8220;twiceler&#8221;</a>.</p>
<p>Also, you can block the IP range that Cuil uses for crawling, which is also listed on the link above.</p>
<p>It is a fairly simple change to make and one that is relatively easy to make. (Note: I have not and will not make it on PT, I keep my robots.txt open intentionally to help observe various issues, like these).</p>
<p>All in all, though I disagree strongly with what Cpedia is doing, they do have the right to do it. This makes fighting back trickier, but far from impossible.</p>
<h4>Bottom Line</h4>
<p>What Cpedia is doing, in my opinion, is unethical. They are using quotes from various sites without adequate clarity or attribution. They are pumping thousands of pages of admittedly duplicate content into other search engines and are producing and encyclopedia that, by their own admission, is wildly inaccurate. </p>
<p>Though copyright may not be a viable litigation route, I have to wonder how libel will work in this case as repeating libel is, generally, <a href="http://www.dancingwithlawyers.com/freeinfo/libel-slander-mis-information.shtml">the same as making the libelous statement</a>. In short, those admitted inaccuracies in Cpedia could, in theory, come back to bite the company at a later date.</p>
<p>Considering that search engine liability in cases of libel is still being settled around the world, <a href="http://newsinfo.inquirer.net/breakingnews/infotech/view/20100425-266422/Google-fined-for-pedophile-libel-against-priest">Google won such a claim in the UK</a> but republishing this information on your own site and admitting it is inaccurate seems to be opening up new avenues for liability.</p>
<p>Would this be a likely claim against Cuil/Cpedia? Probably not. But only because the audience for the site is so small that it seems unlikely many will care. The fact that Cuil/Cpedia has seen so little success is a big part of why webmasters haven&#8217;t noticed the spammy nature of the issue and taken up arms.</p>
<p>To be certain, Cpedia flew under my radar until Melebeth asked me about it. I can imagine it is doing the same for many others right now as well.  </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2010/06/09/cpedia-a-spam-blog-disguised-as-an-encyclopedia/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>3 Count: Empire Building</title>
		<link>http://www.plagiarismtoday.com/2009/07/17/3-count-empire-building/</link>
		<comments>http://www.plagiarismtoday.com/2009/07/17/3-count-empire-building/#comments</comments>
		<pubDate>Fri, 17 Jul 2009 16:05:51 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Copyright News]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[global gaming factory]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[public-domain]]></category>
		<category><![CDATA[The-Pirate-Bay]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4048</guid>
		<description><![CDATA[This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at @plagiarismtoday. 1: New Pirate Bay Will Become a Pay Site...]]></description>
			<content:encoded><![CDATA[<p><em>This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at <a href="http://twitter.com/plagiarismtoday">@plagiarismtoday</a>.</em></p>
<h4>1: <a href="http://torrentfreak.com/new-pirate-bay-will-become-a-pay-site-090716/">New Pirate Bay Will Become a Pay Site</a></h4>
<p>First off today, in a move that is almost certain to rile up existing Pirate Bay users, the new owners have announced that they will converting the site into a paid service so they can pay copyright holders when their work is downloaded.</p>
<p>The announcement came from Wayne Rosso, who was appointed by the new owners of the site, Global Gaming Factory (GGF) to head up the operation. Rosso has experience with other P2P service, including Grokster, where he was the president of the company. </p>
<p>GGF agreed to purchase The Pirate Bay for approximately $7.8 last month. However, it appears that the sale is far from final as they first have to raise the money with which to make the purchase and need shareholder approval. Since shares in the company have gone down since the announcement, the latter part could be a pretty difficult step.</p>
<h4>2: <a href="http://news.bbc.co.uk/2/hi/technology/8156268.stm">Wikipedia painting row escalates</a></h4>
<p>Next up today, the dispute between Wikipedia and the National Portrait Gallery of London has been heading up. The dispute when a Wikipedia member, who has now been identified as David Coetzee uploaded over 3,000 images taken by the gallery of its works. Though the paintings themselves are in the public domain, under British law, photos and scans of them are not. </p>
<p>According to the gallery, the dispute centers more around Wikipedia harming their ability to sell prints and books based upon the images. That posting high-resolution versions on their site, for free, could hurt their efforts to recoup the £1 million in digitization fees they spent. They have reportedly offered medium-resolution images for Wikipedia to use.</p>
<p>Wikipedia, on the other hand, has accused the gallery of &#8220;empire building&#8221;, saying that the gallery is working to make claims on works now owned by the public and is looking to further their bottom line by restricting public access to works. </p>
<h4>3: <a href="http://www.stuff.co.nz/business/industries/2600744/Copyright-law-still-riles">Copyright law still riles</a></h4>
<p>Finally today, we have some of our first &#8220;reviews&#8221; of New Zealand&#8217;s new copyright proposal. The first one, which called for ISPs to disconnect users after two warnings, was scuttled after widespread user protest. However, the new one reworks the &#8220;three strikes&#8221; system by instead sending the alleged infringer to a tribunal where a variety of punishments can be levied, including fines and disconnections.</p>
<p>The reviews from the groups who lead the protest against the first draft say that the new one is an &#8220;improvement&#8221; but that there is still the danger for it to punish innocent users, especially in business and education settings.</p>
<p>It is unclear how the law would handle disconnections where there are multiple users involved, including many who likely do not share files, but it does seem that the new system would be in a better position to address those situations.</p>
<h4>Suggestions</h4>
<p>That&#8217;s it for the three count today, we&#8217;ll be back tomorrow with three more copyright links. If you have a link that you want to suggest a link for the column or have any proposals to make it better. Feel free to leave a comment or send me an email. I hope to hear from you. </p>
<h4>Want the Full Story?</h4>
<p>Tune in <a href="http://www.talkshoe.com/tc/22590">every Saturday morning for the live recording of the Copyright 2.0 Show</a> or wait and get the edited version <a href="http://www.plagiarismtoday.com/category/podcast/">Monday morning right here on Plagiarism Today</a>. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/07/17/3-count-empire-building/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>3 Count: Round 2</title>
		<link>http://www.plagiarismtoday.com/2009/07/13/3-count-round-2/</link>
		<comments>http://www.plagiarismtoday.com/2009/07/13/3-count-round-2/#comments</comments>
		<pubDate>Mon, 13 Jul 2009 15:16:44 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Copyright News]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[national portrait gallery]]></category>
		<category><![CDATA[new zealand]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[public-domain]]></category>
		<category><![CDATA[section 92a]]></category>
		<category><![CDATA[stephen fry]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4000</guid>
		<description><![CDATA[This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at @plagiarismtoday. 1: Round 2 in copyright fight First off today,...]]></description>
			<content:encoded><![CDATA[<p><em>This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at <a href="http://twitter.com/plagiarismtoday">@plagiarismtoday</a>.</em></p>
<h4>1: <a href="http://www.stuff.co.nz/technology/digital-living/2586890/Round-2-in-copyright-fight">Round 2 in copyright fight</a></h4>
<p>First off today, New Zealand is gearing up for a second round of its copyright fight over its controversial Section 92a legislation. The first round, which saw the government attempt to pass laws that would call for the disconnection of file sharers. was scuttled due in large part to protests by Internet users who felt that the punishments were disproportionate to the offense. </p>
<p>Recommendations for the new draft of 92a are due out later this week and there is widespread speculation as to what will be in the law. Though most don&#8217;t expect a full on repeat of the first round, either in terms of the protests or proposal, there is still talk that there may be a disconnection criteria in place and also the possibility of simply fining alleged infringers has been raised.</p>
<p>New Zealand&#8217;s move comes as France itself is going through round two of its &#8220;three strikes&#8221; litigation, the first round being scuttled by the country&#8217;s Constitutional Court. </p>
<h4>2: <a href="http://www.examiner.com/x-14795-Page-One-Examiner~y2009m7d12-Wikipedia-might-get-sued-by-the-National-Portrait-Gallery">Wikipedia might get sued by the National Portrait Gallery</a></h4>
<p>Next up, in a case that highlights an interesting difference between U.S. and English law, Wikipedia, or more specifically the Wikipedia user &#8220;Dcoetzee&#8221; may be sued by the National Portrait Gallery in London. The reason, the user uploaded several photographs of now-public domain portraits to the Wikipedia site, images that were taken by the gallery itself. The gallery has since demanded removal of the images from the site, a request that Wikipedia has not obliged.</p>
<p>Those familiar with U.S. law will know that simply taking a picture of a public domain work does not produce a copyrighted work as there is no requisite creativity. However, that is not so in England. Combine that with the fact the images are being &#8220;published&#8221; in the UK due to their presence on the servers and you have a recipe where this user could, very likely, be sued successfully in the UK.</p>
<p>The problem being, of course, that if the user is in the U.S., or any country other than the UK, claiming any damages won could prove to be very difficult.</p>
<p>Despite this, the case does highlight the kind of nuances that make &#8220;international copyright&#8221; a figment of our collective imagination. </p>
<h4>3: <a href="http://www.bbc.co.uk/blogs/technology/2009/07/fry_on_copyright.html">Stephen Fry on copyright</a></h4>
<p>Finally today, British actor and Twitter celebrity Stephen Fry spoke his mind a little bit on copyright at the iTunes Music Festival. </p>
<p>Though he said that he didn&#8217;t want to be misunderstood as a &#8220;&#8221;help yourself and be a pirate&#8217; advocate,&#8221; he did say that he didn&#8217;t approve of the litigation tactics by the movie and record industries (the former being an industry he&#8217;s a large part of), said that The Pirate Bay admins were unjustly pursued and that he had used Bittorrent himself to get access to an episode of House, which he had legitimately purchased earlier (and stars his old friend and partner Hugh Laurie).</p>
<p>In the end, Fry did admit that much of his income was generated by copyright protection and he did try to temper his comments some after the fact, however, the BBC article says that Fry was &#8220;preaching to the converted&#8221; with his talk and received a very warm reception at the festival.  </p>
<h4>Suggestions</h4>
<p>That&#8217;s it for the three count today, we&#8217;ll be back tomorrow with three more copyright links. If you have a link that you want to suggest a link for the column or have any proposals to make it better. Feel free to leave a comment or send me an email. I hope to hear from you. </p>
<h4>Want the Full Story?</h4>
<p>Tune in <a href="http://www.talkshoe.com/tc/22590">every Saturday morning for the live recording of the Copyright 2.0 Show</a> or wait and get the edited version <a href="http://www.plagiarismtoday.com/category/podcast/">Monday morning right here on Plagiarism Today</a>. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/07/13/3-count-round-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Chris Anderson Plagiarism Controversy</title>
		<link>http://www.plagiarismtoday.com/2009/06/24/the-chris-anderson-plagiarism-controversy/</link>
		<comments>http://www.plagiarismtoday.com/2009/06/24/the-chris-anderson-plagiarism-controversy/#comments</comments>
		<pubDate>Wed, 24 Jun 2009 20:19:45 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[cc]]></category>
		<category><![CDATA[citation]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Creative-Commons]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[publishing]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=3878</guid>
		<description><![CDATA[Wired Editor Chris Anderson has been accused of plagiarism in his recent book, but do the accusations hold up?]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/06/imageDB.cgi.jpeg" alt="imageDB.cgi" title="imageDB.cgi" width="120" height="182" class="alignleft size-full wp-image-3881" /></p>
<p><a href="http://www.ted.com/speakers/chris_anderson_ted.html">Chris Anderson</a> is the editor for Wired Magazine, a prominent tech journalist and a <a href="http://www.wired.com/techbiz/it/magazine/16-03/ff_free">long-time advocate of the &#8220;free&#8221; business model</a>. His upcoming book, enttield &#8220;Free: The Future of a Radical Price&#8221; has been the talk of many in the tech sectors as it deals with the idea of giving content away for free and then earning a living through other means.</p>
<p>However, even before the book has been officially released, it has become mired in controversy. Waldo Jaquith of the Virginia Quarterly Review (VGR), a literary review journal associated with the University of Virginia, <a href="http://www.vqronline.org/blog/2009/06/23/chris-anderson-free/">noticed similarities between some passages in Anderson&#8217;s book and other sources</a>. After putting some passages through Google, he found over a dozen instances where it appears Anderson copied content from Wikipedia as well as other sources.</p>
<p>These allegations were furthered by <a href="http://www.edrants.com/chris-anderson-plagiarist/">Edward Champion on his site</a>, who compared several more passages from other books and blog posts.</p>
<p>This has sparked a firestorm of controversy on both Twitter and the comments section of the VGR post.</p>
<p>Without actually looking at the work (I haven&#8217;t seen a copy of the book) or being able to do a formal plagiarism analysis, I can&#8217;t offer too much commentary, but I do want to briefly recap what has been said and what I think it means.<span id="more-3878"></span></p>
<h4>Both Sides</h4>
<p>According to Jaquith, he performed a &#8220;cursory&#8221; search of the book after he, &#8220;checked by hand several dozen suspect passages in the whole of the 274-page book.&#8221; He repeatedly emphasizes that this is not an exhaustive search of the work as he did not have an electronic copy at his disposal.</p>
<p>Jaquith listed eight different examples where at least some text duplicated verbatim. The lengthiest case involved pages 41-42 in Anderson&#8217;s book, which appears to copy heavily from a Wikipedia entry about the origin of the expression &#8220;Free Lunch&#8221;. Both of the works cite an article in the New York Times about &#8220;Free Lunch&#8221; counters but there is copying that goes beyond that including several sentences that are copied verbatim and other passages that appear to be close paraphrases.</p>
<p>These accusations were furthered by Champion, who listed five more examples on his blog, each involving passages that appeared to have at least some level of copying. </p>
<p>In all of the cases, the amount of pure verbatim plagiarism is fairly small, however, in many cases the amount of editing and rewriting appeared to be very small, either changing a few words or removing passages for brevity.</p>
<p>Anderson, for his part, doesn&#8217;t deny having copied from the sources, but claims that he had intended to cite the sources involved. In a statement he released he said:</p>
<blockquote><p>All those are my screwups after we decided not to run notes as planned, due to my inability to find a good citation format for web sources…</p>
<p>This all came about once we collapsed the notes into the copy. I had the original sources footnoted, but once we lost the footnotes at the 11th hour, I went through the document and redid all the attributions&#8230;</p></blockquote>
<p>He went on to say that he &#8220;missed&#8221; some of the attributions, especially some of those from Wikipedia where there was no individual author to credit, and that he plans on publishing those notes online before the book is released.</p>
<p>In a separate statement, Anderson&#8217;s publisher, <a href="http://hyperionbooks.com/">Hyperion</a>, said the following:</p>
<blockquote><p>We are completely satisfied with Chris Anderson’s response. It was an unfortunate mistake, and we are working with the author to correct these errors both in the electronic edition before it posts, and in all future editions of the book.</p></blockquote>
<p>This, however, has not stopped the war of words taking place on various social news sites and in the comments of the original post. The debate has also caught the attention of the mainstream media, including mentions in <a href="http://www.guardian.co.uk/books/2009/jun/24/wired-free-wikipedia-credit">The Guardian</a>, <a href="http://weblogs.baltimoresun.com/entertainment/books/blog/2009/06/free_by_chris_anderson_plagiar.html">The Baltimore Sun</a> and the <a href="http://latimesblogs.latimes.com/jacketcopy/2009/06/chris-andersons-free-borrows-freely-from-wikipedia-and-other-sources.html">L.A. Times</a>.</p>
<h4>My Thoughts</h4>
<p>Without having the work in front of me in an electronic format and only having these very limited samples, it is very hard for me to perform a thorough analysis. Given the fact that most of the plagiarism involved at least some rewriting, I would normally encourage both a thorough electronic analysis of the entire work and a close human one on suspect parts.</p>
<p>However, I can&#8217;t do that right now and all I have are the passages presented and Anderson&#8217;s side of the story. There is not enough here, by itself, to draw solid conclusions from, especially in the limited time I&#8217;ve had.</p>
<p>Indeed, Anderson&#8217;s story may have merit. If he&#8217;s able to produce the footnotes easily, it would indicate that he was keeping them and did, likely, intended them to be in the book (why keep footnotes if you aren&#8217;t planning on publishing them?).</p>
<p>But what I find curious is his claim that he and/or his publisher &#8220;decided not to run notes as planned, due to my inability to find a good citation format for web sources.&#8221;</p>
<p>This statement strikes me as odd for many reason. First, there is a well-established citation style for Web pages. Both <a href="http://owl.english.purdue.edu/owl/resource/557/09/">MLA</a> and <a href="http://www.liu.edu/cwis/cwp/library/workshop/citapa.htm">APA</a> styles have one. Wikipedia also provides links to each individual edit, making the argument about having to preserve the sources a difficult one. Of course, even without that there are <a href="http://webcitation.org/">plenty of on-demand caching services that could have helped</a>.</p>
<p>Second, I don&#8217;t fully understand why Anderson or his publisher would decide to do away with the footnotes at the last minute. It&#8217;s unclear if this was a technical issue, a design choice or something other issue. Removing footnotes from a book and rewriting the body to include inline citations &#8220;at the 11th hour&#8221; is very unorthodox in my experience and it would seem to almost be an invitation for mistakes and problems.</p>
<p>Finally, and perhaps most worrisome, as I read through the amount of the copying in some of the sections and passages, there is far more verbatim copying and close paraphrasing than one would usually use with a simple footnote. Typically a footnote is designed more to cite the source of information, not a direct quote and Anderson&#8217;s mention of doing a &#8220;write-through&#8221; of some of the citations seems to be a very odd way to handle the problem. </p>
<p>In short, the attribution has to match the use and, in at least some of these instances, it doesn&#8217;t appear to me to be the case.</p>
<p>It is important to remember that this is not like the <a href="http://www.plagiarismtoday.com/2009/05/18/the-maureen-dowd-plagiarism-scandal/">Maureen Dowd case</a> where we are talking about one quote with omitted attribution, but about at least a dozen and, by Anderson&#8217;s own admission, there was a problem with the footnotes for the whole book that, clearly was not straightened out adequately.</p>
<p>There is no nice way to say this, at the very least this is very sloppy editing and research and, unfortunately, it paints a very negative picture of the book. Even though it doesn&#8217;t appear Anderson intended to plagiarize, his liberal copying and rewriting of other texts combined with the lack of thoroughness in attribution is, at best, extremely sloppy research.</p>
<p>Unfortunately, this issue severely undermines the message of his book, parts of which it seems like I was going to agree with.</p>
<h4>Bottom Line</h4>
<p>In the end, it doesn&#8217;t really matter whether Anderson&#8217;s copying truly rises to the level of malicious or even lazy plagiarism. The proper thing for the publisher to do would be to not release the book until these errors can be fixed. I recognize that it would probably mean destroying thousands of printed copies, but there is a very simple and serious problem with releasing it as is.</p>
<p>Wikipedia, operates under a <a href="http://creativecommons.org/licenses/by-sa/3.0/">Creative Commons BY-SA license</a>, the same license as this site. The license not only has an attribution requirement, but the <a href="http://creativecommons.org/licenses/by-sa/3.0/legalcode">legal code</a> requires the person using the content to &#8220;keep intact all copyright notices for the Work and provide, reasonable to the medium or means.&#8221;</p>
<p>One could very easily, and likely successfully, argue that posting the attribution on a Web site separate from the book does not complete the CC license as it is not &#8220;reasonable to the medium or means&#8221;. In short, if one is able to show that this unattributed copying goes beyond fair use, it could be a copyright liability.</p>
<p>Other books, <a href="http://www.plagiarismtoday.com/2006/05/02/kaavya-scandal-update/">including Kaavya Viswanathan&#8217;s</a> were cancelled for less (in Anderson&#8217;s defense, Viswanathan&#8217;s book was a work of fiction, leading to a higher expectation of originality) and a person claiming to be an editor in the comments said that he has had to kill more than a few books for similar offenses.</p>
<p>It would be both prudent legally and much more respectful to the original creators for the book to be held back until these issues can be fixed and the book be thoroughly checked for other possible un-cited quotes.</p>
<p>Though looking at the evidence I have in front of me, I can not honestly tell if this was a case of malicious plagiarism, sloppy editing or a very bad mistake, but it is clear that there are issues with the citations in this book that need to be fixed and waiting to correct the electronic version or make edits to the future editions is not enough.</p>
<p>The bottom line is that, if we accept Anderson&#8217;s story at face value (and we have no evidence to make us doubt it), whoever made the decision to drop the footnotes at the last minute made a very serious error, one that almost invited this kind of problem. Without looking at the original citations, I can&#8217;t make any hard judgements about whether the citation was appropriate for copying, but if we assume that it was, the decision to change systems at the last minute was an unmitigated disaster.</p>
<p>I would love to do a thorough plagiarism analysis of the book and would gladly do so if I could be provided a suitable copy but, in lieu of that, I would encourage the publisher to perform one of their own. There are plenty of tools out there that can perform one cheaply and quickly. Doing so could save the publisher a great deal of headaches down the road. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/06/24/the-chris-anderson-plagiarism-controversy/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>3 Count: Turnabout is Fair Use</title>
		<link>http://www.plagiarismtoday.com/2009/05/22/3-count-turnabout-is-fair-use/</link>
		<comments>http://www.plagiarismtoday.com/2009/05/22/3-count-turnabout-is-fair-use/#comments</comments>
		<pubDate>Fri, 22 May 2009 15:46:56 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Copyright News]]></category>
		<category><![CDATA[4chan]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Creative-Commons]]></category>
		<category><![CDATA[fair-use]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[real]]></category>
		<category><![CDATA[realdvd]]></category>
		<category><![CDATA[wikimedia]]></category>
		<category><![CDATA[wikipedia]]></category>
		<category><![CDATA[YouTube]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=3573</guid>
		<description><![CDATA[This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at @plagiarismtoday. 1: Pornographic videos flood YouTube Yesterday, YouTube was raided...]]></description>
			<content:encoded><![CDATA[<p><em>This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at <a href="http://twitter.com/plagiarismtoday">@plagiarismtoday</a>.</em></p>
<h4>1: <a href="http://news.bbc.co.uk/2/hi/uk_news/8061979.stm">Pornographic videos flood YouTube</a></h4>
<p>Yesterday,  YouTube was raided by members of the 4Chan message boars, who uploaded score of pornographic clips to the site, many of which were labeled as children&#8217;s videos and begin with such content before cutting to graphic content. Some of the affected videos were for music groups among kids, including the Jonas Brothers.  </p>
<p>Why would a group of people work tirelessly to expose children to pornography? According to at least one poster it was because &#8220;YouTube keeps deleting music&#8221;. Whether this is the cause for the entire raid, remains to be seen.</p>
<p>Though YouTube worked quickly to remove the videos, thumbnails for many of them lingered long after the videos were pulled down and were available through a variety of searches. As of this writing, YouTube appears to have eliminated the videos for the most part and searches are working normally.</p>
<h4>2: <a href="http://news.cnet.com/8301-1023_3-10246638-93.html">At RealDVD hearing, MPAA says copying DVDs never legal</a></h4>
<p>It appears that Real&#8217;s claims about about fair use in the RealDVD case might have a new wrinkle. The MPAA has requested an estoppel ruling form the judge in the case that would bar Real from arguing fair use in the case. The reason? Real had argued the exact opposite about ten years ago in a similar case and won on those grounds.</p>
<p>The previous case dealt with Streambox VCR, a product that allowed the downloading and storing of Real&#8217;s streaming music and video, saying in part that, &#8220;The DMCA does not have a fair use exception allowing individuals to circumvent access and copy protection measures.&#8221;</p>
<p>Real, for their part, compared DVDs to CDs, where the copying of the data onto your hard drive (for example, iTunes) is an understood fair use and standard practice.</p>
<p>The judge heard closing arguments on the hearing and did not issue a ruling. That will be handed down at a later date.</p>
<h4>3: <a href="http://www.readwriteweb.com/archives/wikimedia_foundation_adds_creative_commons_license.php">Wikimedia Foundation to Add Creative Commons License for All Content</a></h4>
<p>Finally today, The Wikimedia Foundation, which includes Wikipedia, has added the Creative Commons BY-SA license to all of their content. Under the current system, existing content will be dual-licensed and new content will be CC-BY-SA licensed only. </p>
<p>The CC license was chosen because of requirements of GFDL, the former license, that made reusing content in Wikipedia difficult. Those requirements included the requirement to copy the full license along with any reuse of the content, which could have hindered many uses of the work.</p>
<p>This change was made possible in November of 2008 when the Free Software Foundation updated the GFDL license to allow double licensing and portability. This decision has not been approved by the board of directors at Wikimedia but was supported by over 75% of all voters in a recent Wikipedia survey.</p>
<h4>Suggestions</h4>
<p>That&#8217;s it for the three count today, we&#8217;ll be back tomorrow with three more copyright links. If you have a link that you want to suggest a link for the column or have any proposals to make it better. Feel free to leave a comment or send me an email. I hope to hear from you. </p>
<h4>Want the Full Story?</h4>
<p>Tune in <a href="http://www.talkshoe.com/tc/22590">every Saturday morning for the live recording of the Copyright 2.0 Show</a> or wait and get the edited version <a href="http://www.plagiarismtoday.com/category/podcast/">Monday morning right here on Plagiarism Today</a>. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/05/22/3-count-turnabout-is-fair-use/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>3 Count: Kindle Bricks</title>
		<link>http://www.plagiarismtoday.com/2009/04/15/3-count-kindle-bricks/</link>
		<comments>http://www.plagiarismtoday.com/2009/04/15/3-count-kindle-bricks/#comments</comments>
		<pubDate>Wed, 15 Apr 2009 13:53:04 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Copyright News]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[camming]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Creative-Commons]]></category>
		<category><![CDATA[hannah montanna]]></category>
		<category><![CDATA[Kindle]]></category>
		<category><![CDATA[piracy]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=3235</guid>
		<description><![CDATA[This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at @plagiarismtoday. 1: Wikipedia community vote on migration to CC BY-SA...]]></description>
			<content:encoded><![CDATA[<p><em>This is daily column on Plagiarism Today where the site brings you three of the days biggest, most important copyright and plagiarism news links. If you want to offer your feedback on the column, use the contact form or just follow me on Twitter at <a href="http://twitter.com/plagiarismtoday">@plagiarismtoday</a>.</em></p>
<h4>1: <a href="http://creativecommons.org/weblog/entry/13967">Wikipedia community vote on migration to CC BY-SA begins now</a></h4>
<p>First off today, in a move that could change the future of both the free culture movement and Wikipedia, there is a vote underway regarding converting the online encyclopedia from the Free Documentation License, a license that was aimed at instructional documents, to a Creative Commons BY-SA license, its rough equivalent of the CC licenses.</p>
<p>Though the two licenses are founded on most of the same principles, differences in the specific requirements prevent them from being completely compatible. As such, Wikipedia is holding a vote, in which anyone with over 25 edits can participate, The Creative Commons Organization, obviously, supports this initiative.</p>
<p>The move is motivated by two main factors. The first that the CC licenses have become the de facto standard for copyleft licensing by authors on the Web, making full compatibility an issue, and that the Free Documentation License comes with several restrictions, including the the need to distribute the copy of the license with every use.</p>
<h4>2: <a href="http://www.chicagotribune.com/entertainment/chi-hannah-pirate-14apr14,0,6438908.story">Busted for taking camera to Hannah Montana movie</a></h4>
<p>Next up, a Chicago-area couple was arrested this week for attempting to record the new Hannah Montana movie, apparently as part of a larger-scale DVD piracy operation. Police arrested the couple for balancing a video camera during a showing of the movie and, after a search of their home, found 44,000 CDs and DVDs containing pirated movies.</p>
<p>The couple, however, denies that they were involved with any piracy scheme, saying that the DVDs belonged to the mother of the husband. </p>
<p>Both the husband and the wife have been charged with criminal misuse of a motion-picture facility and the husband also faces charges of unlawful use of a sound-recording device, computer fraud and online sale of stolen property.</p>
<h4>3: <a href="http://www.crn.com/retail/216500680">Returning Product To Amazon Could Brick Your Kindle</a></h4>
<p>Finally today, if you are an Amazon customer and have a Kindle, you may want to be aware that returning too many items to Amazon could result in your account being banned and, if that happens, your Kindle, which is attached to your account, may become a nice paperweight.</p>
<p>The problem stems from the fact that the Kindle is tied to an Amazon account, where all of the purchased books are stored. If the account is banned for returning too many physical goods, it loses access to its electronic data as well.</p>
<p>Though one customer who had this happen to them won on an appeal and had their account and their Kindle restored, it is unnerving to say the least that Amazon could turn the Kindle into a brick.</p>
<h4>Suggestions</h4>
<p>That&#8217;s it for the three count today, we&#8217;ll be back tomorrow with three more copyright links. If you have a link that you want to suggest a link for the column or have any proposals to make it better. Feel free to leave a comment or send me an email. I hope to hear from you. </p>
<h4>Want the Full Story?</h4>
<p>Tune in <a href="http://www.talkshoe.com/tc/22590">every Saturday morning for the live recording of the Copyright 2.0 Show</a> or wait and get the edited version <a href="http://www.plagiarismtoday.com/category/podcast/">Monday morning right here on Plagiarism Today</a>. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/04/15/3-count-kindle-bricks/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Why Google Knol Has Failed</title>
		<link>http://www.plagiarismtoday.com/2008/09/23/why-google-knol-has-failed/</link>
		<comments>http://www.plagiarismtoday.com/2008/09/23/why-google-knol-has-failed/#comments</comments>
		<pubDate>Tue, 23 Sep 2008 16:51:03 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Punditry]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[google knol]]></category>
		<category><![CDATA[knol]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[wiki]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1787</guid>
		<description><![CDATA[Google's Knol service seems to have already fallen on hard times, and is wrestling with junk content, plagiarized articles and enough negative buzz to kill any goodwill the project ever had. ]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/09/google-knol-logo.jpg" alt="google-knol-logo.jpg" border="0" width="223" height="55" align="left" class="picleft" />It may be too early to say with 100% certainty that Google Knol has utterly failed. The site <a href="http://www.plagiarismtoday.com/2008/07/30/google-knol-the-future-of-spam/">has been out just shy of two months</a> and has actually attracted a great deal of content. </p>
<p>However, as a <a href="http://www.slate.com/id/2200401">recent article on The Slate has shown</a>, Knol has not exactly emerged into the Wikipedia-killer it was hyped up to be. The article sites rampant plagiarism, poorly-written articles and poorly-sourced facts as some of Knol&#8217;s signs of inferiority.</p>
<p>So, even if it is not time yet to truly call Knol a failure, we&#8217;re clearly to a point where we can start talking about what has gone wrong and what Google can do to fix it. </p>
<p>Unfortunately, the problems with Knol seem to be fundamental ones with the service itself and are not things that Google can fix easily. In short, if Google Knol is to become competitive with Wikipedia, or even blogs in general, it can not be the Knol we all know and despise. <span id="more-1787"></span><br />
<h4>Knowledge Gone Astray</h4>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/09/google-knol-2.jpg" alt="google-knol-2.jpg" border="0" width="226" height="79" align="right" class="picright" />The Slate article points to two critical differences between Wikipedia and Knol that seems to hinder Knol&#8217;s success.</p>
<ol>
<li><strong>Lack of Community Support:</strong> In order to edit an existing Knol, you have to get approval from the original author, this reduces community involvement and puts anonymous editors at the mercy of the &#8220;experts&#8221; that wrote the article.</li>
<li><strong>Repetition:</strong> Since articles cannot be easily edited, people tend to upload their own articles rather than try to build off ones that exist, this leads to topics with dozens of competing articles.</li>
<li><strong>Advertising Reward:</strong> Google allowed submitters to earn money from their content by means of advertising. This attracted people with bad intentions to the site.</li>
</ol>
<p>Strangely, I have to agree with all of these problems. When I first wrote about the site, I noted that it was a service practically asking to be spammed and, <a href="http://www.plagiarismtoday.com/2008/08/14/knol-spam-two-weeks-later/">according to my two-week follow-up</a>, I was right.</p>
<p>However, none of these problems with the site really cut to the core. Google has made some tactical mistakes with Knol but it has made a much larger one in its concept that will sink it no matter what other changes they make.</p>
<h4>The Problem is QUALITY</h4>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/09/google-knol-3.jpg" alt="google-knol-3.jpg" border="0" width="250" height="211" align="left" class="picleft" />A three-word summary of Knol would likely read &#8220;Quantity over Quality&#8221;.</p>
<p>Google, as a search engine company, approached the problem of creating an encyclopedia much like you would expect. They set up the rules of Knol in such a way that it would encourage as much content as possible to be uploaded to the service. </p>
<p>Then, according to the theory, a combination of human ratings and search algorithms would pluck the best results, rewarding those who knew wrote the most authoritative articles.</p>
<p>Unfortunately, the theory is flawed beyond all recognition. Consider these problems:</p>
<ul>
<li><strong>Search is Stupid:</strong> As Google&#8217;s traditional Internet search has shown, algorithms are mediocre, at best, at locating the authoritative sites. Google Knol, despite the controlled environment, does not seem to improve upon this.</li>
<li><strong>Humans Aren&#8217;t Much Better:</strong> Humans that use Knol are trying to find out information they don&#8217;t know. They are in almost no position to judge how authoritative an article is. Most humans will not check sources or verify what they read, meaning an authoritative-sounding pile of lies has a better chance of success than a more meek, factual article. </li>
<li><strong>Spammers Can Game the System:</strong> Not only can spammers easily upload articles to Google Knol, but they can game the system to vote for themselves and feign broad user support. They do this already with blogs using inbound links but the controlled environment of Knol actually makes it easier.</li>
<li><strong>Terms are More Important Than Being Number One:</strong> As with the Internet at large, you can gain more reward being number 10 for a popular keyword than being number one on a lesser-known one. Though this increases competition on popular keywords, it rewards mediocre content on popular phrases more than robust knowledge on all topics.</li>
<li><strong>The 95% Problem:</strong> The biggest problem, however, is that even if the theory worked perfectly, most of the content on Knol would be complete and utter garbage. Though the good stuff might be at the top, it doesn&#8217;t take much for users to sink into the cesspool that is below. This colors the experience, hurts the reputation of the good articles and makes people distrust Knol as a source of information.</li>
</ul>
<p>The end result of this is not that Google has created its own Wikipedia, instead, Google has created its own miniature Internet where they set the rules.</p>
<p>The problem is that this new Web offers no compelling reasons for creators or visitors to access it. Creators can make more money and get more traffic running a traditional blog while visitors have the same access to quality information outside of Google&#8217;s walled garden.</p>
<p>The end result is that Knol has created a service that benefits no one but the spammers that abuse it. As such, Knol has virtually seals its own fate unless it can turn that around.</p>
<h4>Fixing Knol</h4>
<p>I would argue that, with its current mission statement, Knol can&#8217;t be fixed. The problems that plague Knol are ones that exist on the Web at large, but Knol&#8217;s close proximity to Google&#8217;s heart amplifies all of these issues and makes it easier for those that wish to abuse the service.</p>
<p>In order to fix Knol, Google needs to step away from this idea that Knol is meant to be the anti-Wikipedia. Though Wikipedia has its problems and certainly isn&#8217;t a reliable source, it is both more useful and more reliable than Knol.</p>
<p>For all that Wikipedia has done wrong, it has done a fair amount right including build a huge community, produce an incredibly large number of entries, work to enforce a quality standard and keep itself moving forward. Sure, it makes mistakes, has vandals and can be a cesspool in places, it isn&#8217;t spinning its wheels with endless repetition of the same information.</p>
<p>Google, to beat Wikipedia cannot throw out what it does well in a bid to oppose it. Beating something does not mean being its foil. It means doing what it does better.</p>
<p>Knol&#8217;s premise is simply not as good as Wikipedia&#8217;s. It is time for Google to go back to the drawing board and figure out not what it can do different, but what it needs to do the same and then what it can improve.</p>
<h4>Conclusions</h4>
<p>For those of us who don&#8217;t use Knol, the failure of the service is actually good news. After all, it was our content that was making up so many of the plagiarized pages on the service. Knol&#8217;s failure to gain any traction in the past two months is a positive sign indicating that, while it is certainly an ideal for spammers, it is unlikely that they will get much out of it. </p>
<p>This means that Google&#8217;s failure could, in the long run, work to our benefit. However, until Knol has been proved to be completely useless for the spammers, it is likely that they will continue to pound away on it. </p>
<p>After all, if there is even a drop of benefit to be gained, the spammers will continue to try. They don&#8217;t invest anything in their efforts, other than setting up their computers and letting them go. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/09/23/why-google-knol-has-failed/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Knol Spam: Two Weeks Later</title>
		<link>http://www.plagiarismtoday.com/2008/08/14/knol-spam-two-weeks-later/</link>
		<comments>http://www.plagiarismtoday.com/2008/08/14/knol-spam-two-weeks-later/#comments</comments>
		<pubDate>Thu, 14 Aug 2008 15:59:48 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[google knol]]></category>
		<category><![CDATA[knol]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1529</guid>
		<description><![CDATA[It has been two weeks since Google Knol made its controversial debut. But are the predictions of a spam haven coming true? I took a look to find out. ]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/08/knol-logo.png" alt="knol-logo.png" border="0" width="224" height="50" align="left" class="picleft" />A little bit over two weeks ago I, as well as many others, <a href="http://www.plagiarismtoday.com/2008/07/30/google-knol-the-future-of-spam/" title="Google Knol Spam">expressed concern Google Knol would become a haven for Web spam</a>.</p>
<p>At that time, these predictions were mere speculation. Though a video would quickly come online showing that the spammers could already post automatically to Knol, there was little reason to suspect a spam attack beyond the nature of the service and Google&#8217;s history in this area.</p>
<p>This raises the question, now that the headlines have faded, has Google Knol become a haven for spam? Though it is hard to tell how serious things are given the short time span that has passed, it seems clear that at least some of the predictions were eerily correct.<span id="more-1529"></span><br />
<h4>Methodology</h4>
<p><img src="http://www.plagiarismtoday.comwp-content/uploads/2008/08/knol-results-20080814-110233.png" alt="knol-results.png" border="0" width="262" height="280" align="right" class="picright"/>In order to get a rough picture for how much spam existed on Google Knol and how it was affecting the rankings within the service, I decided to perform a very small study. </p>
<p>I searched for six keywords, three that are spammy in nature, three that are not. I then looked at the first page of the results and tried to find the highest-ranked case of duplicate content I could find. I did this using only string matching, so article synonymizers might have escaped detection.</p>
<p>This was just a cursory check and was designed to only locate articles that were clearly duplicate content. Also, this check was ONLY designed to catch duplicate content, not necessarily spam and, as you can see in many cases, the duplicate content was uploaded by the author.</p>
<p>Finally, if a search term did not produce at least one full page of results, ten items or more, I did not use it because such searches likely favored spammers. </p>
<p>However, despite my attempts to give Knol the benefit of the doubt, I had little trouble finding junk content.</p>
<p>Note: All links to Google Knol have been nofollowed just in case.</p>
<h4>Keyword: Poker</h4>
<p><strong><a href="http://knol.google.com/k/dafydd-lewis/poker/tc8qjfuo0q7b/2#" rel="nofollow">First Questionable Result</a></strong></p>
<p><strong>Rank:</strong> 6<br />
<strong>Number of Stars:</strong> 1<br />
<strong>Comments:</strong> 0<br />
<strong>Revisions:</strong> 2<br />
<strong>Original Article:</strong>: <a href="http://www.getrichcasino.co.uk/pokergames.html" rel="nofollow">British Casino</a><br />
<strong>Commentary:</strong> Appears to be a manual copy/paste job taking only a relevant portion of an article about different types of poker. The content also appears on many spam blogs. In short, this appears to be the result of a spam attack on Knol, at least at this time.</p>
<h4>Keyword: Viagra</h4>
<p><strong><a href="http://knol.google.com/k/ron-hashmonay/sildenafil-citrate/22er9vywcckoz/4#" rel="nofollow">First Questionable Result</a></strong></p>
<p><strong>Rank:</strong> 2<br />
<strong>Number of Stars:</strong> 5<br />
<strong>Comments:</strong> 0<br />
<strong>Revisions:</strong> 2<br />
<strong>Original Article:</strong>: <a href="http://www.viagra.com/content/about-viagra-ed-treatment.jsp" rel="nofollow">Viagra Web Site</a><br />
<strong>Commentary:</strong>  This is a tricky one as it appears to have been passed around the Web, especially via spam blogs, for a very long time. Most of the content is from the Viagra site with changes made to eliminate formatting. Once again, this appears to be the likely result of a spam attack, probably from an article database.</p>
<h4>Keyword: Mortgage</h4>
<p><strong><a href="http://knol.google.com/k/anonymous/mortgage/3in7a24ehakx6/3#" rel="nofollow">First Questionable Result</a></strong></p>
<p><strong>Rank:</strong> 2<br />
<strong>Number of Stars:</strong> 5<br />
<strong>Comments:</strong> 0<br />
<strong>Revisions:</strong> 2<br />
<strong>Original Article:</strong>: <a href="http://flexamortgage.com/english/index.php?action=page_display&#038;PageID=5" rel="nofollow">Mortgage Site?</a><br />
<strong>Commentary:</strong> This one is almost impossible to tell who has the original article. However, lengthier versions of this piece have been passed around the Web for a long time by all accounts. One of the most likely candidates is a mortgage site linked above. However, this Knol appears to be a likely spam attack as well. </p>
<h4>Keyword: Apple</h4>
<p><strong><a href="http://knol.google.com/k/guy-doenyas/apple/3lnaay85v8bst/2#" rel="nofollow">First Questionable Result</a></strong></p>
<p><strong>Rank:</strong> 1<br />
<strong>Number of Stars:</strong> 5<br />
<strong>Comments:</strong> 0<br />
<strong>Revisions:</strong> 2<br />
<strong>Original Article:</strong>: <a href="http://www.freebase.com/view/en/apple_inc" rel="nofollow">Freebase &#038; Wikipedia</a><br />
<strong>Commentary:</strong> The first non-spam keyword is the first case of clearly duplicate content being number one. Worst of all, this content was lifted from the Wikipedia entry on Apple, which has since been changed. </p>
<h4>Keyword: Movie</h4>
<p><strong><a href="http://knol.google.com/k/jay-johnson/outdoor-projectors-for-outdoor-movies/t3dak06d404w/3#" rel="nofollow">First Questionable Result</a></strong></p>
<p><strong>Rank:</strong> 4<br />
<strong>Number of Stars:</strong> 5<br />
<strong>Comments:</strong> 0<br />
<strong>Revisions:</strong> 3<br />
<strong>Original Article:</strong>: <a href="http://www.openaircinema.us/outdoorprojector.htm" rel="nofollow">Company Web Site</a><br />
<strong>Commentary:</strong> This isn&#8217;t a case of copyright infringement or scraping, the company itself is repurposing it&#8217;s work for Google Knol in a bid to get links to its site up. The entire article has been put up on Knol, complete with a link back to the site&#8217;s home page. Not necessarily unethical, but still somewhat against the spirit of Knol.</p>
<h4>Keyword: Dog</h4>
<p><strong><a href="http://knol.google.com/k/saman-rashid/dogs/3ilq1lmt7jxaq/5#" rel="nofollow">First Questionable Result</a></strong></p>
<p><strong>Rank:</strong> 1<br />
<strong>Number of Stars:</strong> 5<br />
<strong>Comments:</strong> 0<br />
<strong>Revisions:</strong> 3<br />
<strong>Original Article:</strong>: <a href="http://allpetsinformation.blogspot.com/2008/06/getting-to-know-your-new-dog-and.html" rel="nofollow">Blog</a><br />
<strong>Commentary:</strong> Another case of an author reposting their own work. Though the original blog appears to be drowning in ads, it still seems to be legitimate. Since the content could not be found anywhere else and the names match, I am forced to assume that the original author uploaded it to both places.</p>
<h4>Results</h4>
<p>When it was all said and done, none of the searches went more than six results without duplicate content and four of the six had duplicate material in the first two items.</p>
<p>This paints a very grim picture of Knol but it appears that much of the problem is not spammers, but authors seeking extra exposure for older works. </p>
<p>While this is clearly not a copyright violation and also not against Google&#8217;s policy on Knol, it does little to help the site. If Knol is to succeed, it needs a large volume high-quality original content.</p>
<p>However, at this point, Google has done a very poor job keeping duplicate content out of Knol and/or reducing its ranking. Also interesting is that the spam content and other duplications almost always have high ratings, indicating the possibility that spammers have started to game the voting system as well.</p>
<p>Things don&#8217;t look good for Knol. Considering it has barely been two weeks and things already look this bad, it seems likely that they are only going to get worse.</p>
<h4>Google Hits Back</h4>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/08/similar-content-knol.png" alt="similar-content-knol.png" border="0" width="234" height="98" align="left" class="picleft" />One thing I noticed as I was performing this &#8220;study&#8221; was that Google has added an element to the sidebar of every Knol entry entitled &#8220;Similar Content on the Web&#8221;.</p>
<p>This section provides links to other copies of the work that Google has detected on the Web along with percentages to indicate how much of the content has been duplicated. </p>
<p>It is unclear at this time if this section updates to point out possible cases of plagiarism for authors posting original content to Knol or if it is simply designed to indicate where the content may have come from and is a one-time check.</p>
<p>It is also interesting that Google&#8217;s search engine often detected many other results not found in the sidebar, including, at least a few times, the original site. </p>
<p>I will have more on this later.</p>
<h4>Conclusions</h4>
<p>The bottom line right now is that Google Knol is already a haven for spam and duplicate content. Whether it is authors republishing their work in hopes of getting a little extra Adsense revenue or spammers pushing out junk content, the junk results are high in the search and only get uglier the lower down you go.</p>
<p>Though <a href="http://tsdwebsitedesign.blogspot.com/2008/08/wikipedia-fulle-of-spam.html">Wikipedia has had its issues with spam</a> in the past, I can not recall it ever looking quite like this.</p>
<p>If Google Knol is to ever have a chance to become anything other than a spam haven, it needs to hit back now and do its part to keep duplicate content out. Google clearly has the tools already, but has been timid about using them, even to adjust ranking.</p>
<p>Knol needs original content to thrive but if it does not start encouraging and rewarding it now, it may create a situation where the few original authors are buried under a pile of duplicate content and searchers have no motivation to look to Knol for new information as almost all of it will be available elsewhere.</p>
<p>Hopefully, this can be the beginning of Google taking these issues seriously and using its tools to hit spammers hard. </p>
<h4>Related Links</h4>
<p><a href="http://mashable.com/2008/07/29/google-knol-splogs/">Mashable on Knol Spam</a><br />
<a href="http://www.problogger.net/archives/2008/07/24/googles-knol-a-wikipedia-killer-or-a-blog-killer/">Problogger on Knol</a><br />
<a href="http://www.problogger.net/archives/2008/07/24/googles-knol-a-wikipedia-killer-or-a-blog-killer/">Demerzel on Begging of Knol Spam</a><br />
<a href="http://natenead.com/google-knol-and-nofollow-or-dofollow-links-spam/">Nate Nead on Knol Dofollow/Nofollow</a><br />
<a href="http://usefularts.us/2008/08/06/pax-google-gives-the-future-of-spam-a-name-knol/">Usefularts on Knol Spam</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/08/14/knol-spam-two-weeks-later/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>PT Mentioned in Guardian</title>
		<link>http://www.plagiarismtoday.com/2008/08/07/pt-mentioned-in-guardian-again/</link>
		<comments>http://www.plagiarismtoday.com/2008/08/07/pt-mentioned-in-guardian-again/#comments</comments>
		<pubDate>Thu, 07 Aug 2008 16:00:54 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Housekeeping]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[google knol]]></category>
		<category><![CDATA[plagiarim]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[wiki]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1475</guid>
		<description><![CDATA[Plagiarism Today was honored this morning with a mention by the Guardian for its coverage of the recent Google Knol spam controversy.]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/08/guardian-logo.png" alt="guardian-logo.png" border="0" width="300" height="54" align="left" class="picleft" />I just wanted to quickly say welcome to the new readers who are arriving from the article published in <a href="http://www.guardian.co.uk/technology/2008/aug/07/searchengines.spam" title="UK Guardian on Knol">today&#8217;s Gardian regarding Google Knol</a>. It was a happy surprise this morning to wake up and find that they had included both a link and a mention about Plagiarism Today in their article.</p>
<p>For those who are interested in this topic, I will be writing and posting a follow up on Knol sometime in the coming week. I am still following the service. If you wish, you can <a href="http://feeds.feedburner.com/PlagiarismToday" title="Plagiarism Today RSS">subscribe to the RSS feed</a> and be notified when the article goes live, as well as get updates on other copyright and plagiarism matter.</p>
<p>My thanks goes out to the Guardian for thinking of me and I want to again welcome all of the new readers. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/08/07/pt-mentioned-in-guardian-again/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 12:32:04 -->
