<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism Todayduplicate-content | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/duplicate-content/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 06:51:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How Accurate is Google with Detecting Duplicates?</title>
		<link>http://www.plagiarismtoday.com/2012/01/09/how-accurate-is-google-with-detecting-duplicates/</link>
		<comments>http://www.plagiarismtoday.com/2012/01/09/how-accurate-is-google-with-detecting-duplicates/#comments</comments>
		<pubDate>Mon, 09 Jan 2012 19:11:47 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[duplicate-content]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Plagiarism]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=12183</guid>
		<description><![CDATA[Google says that it's rare for duplicates to rank above original works in their search engine. But how accurate is that? One site aims to find out.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2012/01/analyzethis-logo-sample.jpg" alt="Analyze This Logo" title="Analyze This Logo" width="193" height="43" class="alignleft size-full wp-image-12185" />Recently, <a href="http://www.fonerbooks.com/selfpublishing/">Morris Rosenthal</a> brought to my attention a Russian site entitled <a href="http://analyzethis.ru/?&#038;location=ru&#038;lang=en">Analyzethis.ru</a>. It&#8217;s a site that analyzes the results of various popular search engines from around the world and looks at various metrics including how well they filter out adult content, how much spam they contain and so on. </p>
<p>One of the factors they look at is how well the search engines are at ranking original texts over the duplicates. This, of course, is of great interest to anyone who posts original content online as, inevitably, it will be copied (legitimately and otherwise) and it&#8217;s important that the search engines do a reasonable job favoring the original author. </p>
<p>However, <a href="http://analyzethis.ru/?analyzer=original&#038;location=ru&#038;lang=en&#038;interval=year">the results from AnalyzeThis.ru</a> are, simply put, less than inspiriting. Google, according to the site, gets it right about 57% of the time according to the most recent survey. Even scarier, this is after a year of drastic improvement, up from under 10% in June of last year. Even more discouraging is that Google is easily the best, way ahead of Bing, which is currently hovering at 7%. </p>
<p>To be clear, I take these numbers with a good deal of skepticism. The site doesn&#8217;t clearly explain <a href="http://analyzethis.ru/?&#038;location=ru&#038;lang=en">how it obtains its data</a> but, even if the tests aren&#8217;t perfect, they highlight the point that Google isn&#8217;t either. Even after it&#8217;s updates, in these tests, Google still managed to barely get half of the cases right. Even if we assume this test generated 90% false positives in this area, that still leaves just under 5% of all works being mislabeled by Google as duplicates.</p>
<p>It&#8217;s easy to see how, with so many content creators and so many different works, this can become a very big problem very quickly.</p>
<h4>Google&#8217;s Problem</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2012/01/graph-sample1-300x109.jpg" alt="AnalyzeThis RU Graph" title="AnalyzeThis RU Graph" width="300" height="109" class="alignright size-medium wp-image-12187" />For Google (and other search engines) the problem of duplicate content is a thorny one. When a user enters a search query. They don&#8217;t want to see ten pages with the same content, they usually want a variety of links to choose from. So, if a lot of pages on the Web have the same content, Google has to decide which one is the original and give it preferential treatment. </p>
<p>However, finding the original is easier said than done.</p>
<p>Google uses a variety of factors to determine which is the original such as the trust it has with the domain, the age of the site/page and the number of inbound links. However, these metrics aren&#8217;t perfect and, inevitably, Google gets it wrong from time to time.</p>
<p>Spammers, in turn, count on this. They often copy large amounts of content hoping that Google will believe at least some of it is original. Sadly, it&#8217;s a numbers game and the spammers, despite improvements, are still winning.</p>
<p>If these numbers are remotely accurate, it&#8217;s easy to see how, while Google has definitely made things harder on the spammers, they can still find great success copying content.</p>
<p>This further emphasizes how important it is for webmasters and content creators to be vigilant with their content and not put their faith in Google to keep them ahead of the bad guys.</p>
<h4>Bottom Line</h4>
<p>Just to be clear, I&#8217;m still very skeptical about these exact numbers. Without knowing more about the methodology, it&#8217;s impossible to be sure that they are accurate.</p>
<p>However, even with imperfect results, they highlight just how flawed Google is, even after the recent (and significant) improvements. </p>
<p>This makes it clear that, despite what Matt Cutts said, the scenario of a duplicate outranking an original work is not &#8220;highly unlikely&#8221; and seems to happen fairly regularly.</p>
<p>Obviously, this is a tough problem for search engines to crack and, until they do, webmasters need to be vigilant with their content.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2012/01/09/how-accurate-is-google-with-detecting-duplicates/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>The SEO Benefits and Drawbacks of Creative Commons</title>
		<link>http://www.plagiarismtoday.com/2011/09/22/the-seo-benefits-and-drawbacks-of-creative-commons/</link>
		<comments>http://www.plagiarismtoday.com/2011/09/22/the-seo-benefits-and-drawbacks-of-creative-commons/#comments</comments>
		<pubDate>Thu, 22 Sep 2011 18:00:00 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Creative-Commons]]></category>
		<category><![CDATA[duplicate-content]]></category>
		<category><![CDATA[linking]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[search engine optimization]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=10924</guid>
		<description><![CDATA[As bloggers and other webmasters mull over the issue of SEO many wonder if and how Creative Commons fits in with their SEO strategies.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2011/09/web-browser-sample-image-300x224.jpg" alt="Web Browser Image" title="Web Browser Image" width="300" height="224" class="alignleft size-medium wp-image-11152" />When it comes to content reuse, both legitimate and unlawful, one of the biggest and most pervasive concerns has been search engine optimization (SEO) issues. Basically, if and how the content reuse affects the way Google and the other search engines rank the content </p>
<p>The problem largely centers around the issue of what is known as duplicate content, or content that can be found at more than one place online. This is a problem because search engines typically like to display a variety of results for every query and routinely hide or drop down pages that are too similar to each other.</p>
<p>That being said, SEO is also often about linking, something that those who use Creative Commons works are usually obliged to do (when appropriate for the medium). This means that, while licensing your content via Creative Commons may raise a spectre of duplicate content, it also opens the doors to increased search engine trust.</p>
<p>So how does Creative Commons impact SEO? There&#8217;s no simple answer but there is definitely a lot of theory to pour over.<span id="more-10924"></span></p>
<h4>The SEO of Licensing Your Work</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2011/09/cc.logo_.large_-300x71.png" alt="CC Logo" title="Creative Commons Logo" width="300" height="71" class="alignright size-medium wp-image-11153" />When you license your work under a Creative Commons License, or any open license for that matter, you&#8217;re putting it out there with the intention that others will use it. That, in turn, means multiple copies of your work floating around on the Web, each competing for the same attention from the search engines.</p>
<p>That is what brings about the aforementioned duplicate content issue. Google doesn&#8217;t want to give the same article (or a very similar one) two slots on the same results page so one has to either be pushed into the supplemental index or to a much lower ranking. </p>
<p>Usually, Google attempts to make a determination about which article is the original and which is the duplicate. However, as they recently admitted, <a href="http://www.plagiarismtoday.com/2011/09/13/can-twitterfacebook-help-fight-online-plagiarism/">they aren&#8217;t perfect at doing that</a> as they sometimes index sites out of order. So, even though <a href="http://www.webpronews.com/google-busts-the-duplicate-content-myth-2009-09">Google claims claim that the duplicate content &#8220;penalty&#8221; doesn&#8217;t exist</a>, there&#8217;s still a real risk of being hurt in the rankings due to multiple copies of a work floating around.</p>
<p>However, Creative Commons uses of a work are typically supposed to come with a link back to the source as well as an acknowledgement of the license the content was used under. This is part of why <a href="http://www.plagiarismtoday.com/2007/06/05/using-creative-commons-to-stop-scraping/">Creative Commons can be such a powerful force to stop scraping</a>. </p>
<p>These signs are pretty clear indications to the search engines as to which site is the original and that is why the SEO risk of legitimate Creative Commons use is much less than with scraping and/or plagiarism.</p>
<p>Also, since Creative Commons reuse is most often done by humans and not automated scrapers, the original has much more time online before the duplicates appear, increasing the likelihood it has been indexed and discovered as the original, further reducing the dangers. </p>
<p>For images, this risk is significantly less. The reason is that Google doesn&#8217;t do quite as good of a job spotting duplicate content in image search results and doesn&#8217;t put as much emphasis on originality. A simple image search will often turn up many different versions of the same image, often just different sizes.  </p>
<p>That being said, Google is getting somewhat better at this and that may become an increasing fear as time goes on.</p>
<p>However, on the positive side, both images and text uses of CC-Licensed works can be used to generate inbound links and many SEO experts <a href="http://www.seobythesea.com/2006/10/building-links-with-creative-commons/">view Creative Commons as a great way to generate such links to your sites and profiles</a>.</p>
<p>This assumes, of course, that everyone using CC-Licensed work is playing by the rules but, those that aren&#8217;t are committing copyright infringement the same as if the work wasn&#8217;t Creative Commons at all.</p>
<p>In the end, the risks are likely minimal but the value of the reward will depend on how much you can gain from the inbound links, something that will vary wildly from site to site.</p>
<h4>The SEO of Using Creative Commons Content</h4>
<p>On the flip side of the coin, you know instantly when you use a Creative Commons article or blog post that you&#8217;re getting something that&#8217;s been posted elsewhere and probably for some length of time. </p>
<p>As such, there&#8217;s probably little expectation that the article will be of much SEO value as Google will spot it as a duplicate and treat it as such. <a href="http://www.seroundtable.com/google-farmer-advice-13090.html">This is especially true since the Farmer/Panda updates</a> that came down hard on content farms and other sites seen has having &#8220;low quality&#8221; and duplicate content. </p>
<p>That being said, images may provide some SEO benefit as they can often be picked up despite being duplicates. This is especially true if they are resized prior to being reposted.</p>
<p>However, if you use Creative Commons content on your site, SEO benefit probably should not be your primary motivation. Typically, it&#8217;s more about filling blank spots in your posting schedule and getting useful content out to your existing readers, rather than trying to obtain some SEO benefit.</p>
<p>For the purpose of SEO, original and compelling content is almost always best.</p>
<p>This isn&#8217;t to say <a href="http://www.seo-gold.com/google-duplicate-content-penalty.html">you can&#8217;t rank well with duplicate content</a>, but the odds are slim and the next shift in the winds at Google could blow away your efforts.</p>
<p>Instead, if you want SEO benefit from Creative Commons content, your best bet is, most likely, to find content you can create derivative works from and use the content as a building block to make something new. </p>
<p>That, in turn, will be much more compelling to the search engines than a mere copy/paste.</p>
<h4>Bottom Line</h4>
<p>In the end, the SEO risks of licensing your content via Creative Commons are fairly low. Though duplicate content is real concern, legitimate CC-licensed use is far less likely to harm your site than scraping, plagiarism and other forms of non-attributed use. </p>
<p>That being said, Google isn&#8217;t perfect and mistakes will happen, but most likely there&#8217;s not much harm that can come from licensing your content this way, at least from an SEO-only standpoint, and given the potential for inbound links there is at least the potential for some gain.</p>
<p>Still, the choice of using a Creative Commons License is a personal one and, even if the SEO risks are low it still isn&#8217;t right for everyone. </p>
<p>Basically, this is just one factor of many to weigh and, even then, for some the risks may not outweigh the rewards, especially for those who already have a ton of inbound links and have little use for a few more. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2011/09/22/the-seo-benefits-and-drawbacks-of-creative-commons/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Can Blekko Help Detect Copied Content?</title>
		<link>http://www.plagiarismtoday.com/2010/11/30/can-blekko-help-detect-copied-content/</link>
		<comments>http://www.plagiarismtoday.com/2010/11/30/can-blekko-help-detect-copied-content/#comments</comments>
		<pubDate>Tue, 30 Nov 2010 18:07:59 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[blekko]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[duplicate-content]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Search-Engines]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=8426</guid>
		<description><![CDATA[The new search engine Blekko offers a new way to detect plagiarism and other duplicate content, but how well does it work?]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2010/11/blekko-logo.jpg" alt="Blekko Logo Image" title="Blekko Logo Image" width="181" height="53" class="alignleft size-full wp-image-8448" /><a href="http://blekko.com/">Blekko</a> is a new search engine that is aiming at the search leaders, including Google, by offering a more open and more spam-free search experience. With a tagline &#8220;Slash the Web&#8221; <a href="http://blekko.com/ws/+/about">Blekko has laid down an Internet searcher&#8217;s bill of rights</a> that encourages users to create &#8220;slashes&#8221; and that will customize what appears in their results.</p>
<p>For example, if you search for &#8220;Phones&#8221; and add the /android slash you&#8217;ll only get results for related to the Android operating system.  Likewise, you can use slashed to manipulate the results in various ways, including selecting a date range, a political slant or only certain kinds of sites (forums, blogs, etc.).</p>
<p>Two of the more interesting slashes are <a href="http://www.labnol.org/internet/check-online-plagiarism/18120/">/duptext and /domainduptext</a>, which supposedly will check either a page or a domain&#8217;s content to find where it is being duplicated and how it is being misused. For webmasters, this could mean a powerful new tool for tracking duplicate content on the Web and tracking down those who are misusing their work.</p>
<p>So, as with other systems, I put it to the test and was, in a word, disappointed with the results. Though I think Blekko has a lot of potential in other areas, it doesn&#8217;t seem that duplicate content detection is one of its better uses, at least not at this time.<span id="more-8426"></span></p>
<h4>How Blekko&#8217;s Plagiarism Checker Works</h4>
<p>Using Blekko&#8217;s duplicate content detection system is actually fairly easy. All one has to do is search for the URL they want to check the content of and then add the /duptext tag to the end of the URL.</p>
<p>For example:</p>
<blockquote><p><a href="http://blekko.com/ws/http://www.plagiarismtoday.com/2010/11/23/5-rules-for-the-next-plagiarism-scandal/+/duptext">http://www.plagiarismtoday.com/2010/11/23/5-rules-for-the-next-plagiarism-scandal/ /duptext</a></p></blockquote>
<p>You can do this with any page on the Web and the results are usually presented in a few seconds.</p>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2010/11/blekk-example.jpg" alt="Blekko Example" title="Blekko Example" width="511" height="192" class="alignnone size-full wp-image-8427" /></p>
<p>As you can see, it breaks out the information by hosts and URLs and, from there, based on those that are on-site and off-site. Below the chart is a list of links where the duplicate content is present.</p>
<p>You can also check an entire domain for duplicate content by looking for just the domain and adding &#8220;/domainduptext&#8221; to the end. For example.</p>
<blockquote><p><a href="http://blekko.com/ws/plagiarismtoday.com+/domainduptext">plagiarismtoday.com /domainduptext</a></p></blockquote>
<p>However, with this slash you get significantly less information, basically just a list of domains where your duplicate content is suspected of appearing and links to their SEO pages. </p>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2010/11/blekk-example2-500x193.jpg" alt="Blekko Example 2" title="Blekko Example 2" width="500" height="193" class="alignnone size-large wp-image-8428" /></p>
<p>The question, however, is &#8220;How well does it work?&#8221; Unfortunately, after a few searches, the answer appears to be a disappointing one.</p>
<h4>Testing it Out</h4>
<p>As is typical with my tests, I decided to have Blekko do a duplicate content check on several works with a relatively known amount of plagiarism, 2 poems, one short story and one post on Plagiarism Today.</p>
<p>Here are the results of those tests:</p>
<h4>Test 1: Poem 1</h4>
<p>I tried out Blekko on an old poem of mine that I knew had seen widespread copying, both plagiarized and attributed. However, after performing the search, <a href="http://blekko.com/ws/http://www.ravensrants.com/friends-or-lovers/+/duptext">Blekko failed to find a single copy of the poem on any other site</a>, even though <a href="http://www.google.com/search?sourceid=chrome&#038;ie=UTF-8&#038;q=%22Though+our+voices+could+call+out+in+comfort%22">a simple Google search finds about 40 results</a>, though many are admittedly duplicates.</p>
<p><strong>Blekko Results:</strong> 0 <strong>Google Results:</strong> 40</p>
<h4>Test 2: Poem 2</h4>
<p>Testing with another poem produced very similar results. However, this time <a href="http://blekko.com/ws/http://www.ravensrants.com/ghost-within-my-mind/+/duptext">Blekko didn&#8217;t even find duplicates on my site</a> and instead simply indicated that there were no duplicates at all. However, once again, <a href="http://www.google.com/search?sourceid=chrome&#038;ie=UTF-8&#038;q=%22like+vapors+disappearing+before+my+eyes%22#q=%22like+vapors+disappearing+before+my+eyes%22&#038;hl=en&#038;safe=off&#038;prmd=iv&#038;ei=Y-DzTJH7A8H_lgeqxOmBDQ&#038;start=60&#038;sa=N&#038;filter=0&#038;fp=5831956345d34357">a simple Google search turned up about forty results though</a> though, as with before, many were duplicates or copies on my domain.</p>
<p><strong>Blekko Results:</strong> 0 <strong>Google Results:</strong> 39</p>
<h4>Test 3: Story</h4>
<p>Following the lack of luck with the two poems, I then tried an old short story of mine that had seen a small amount of copying. However, once again, <a href="http://blekko.com/ws/http://www.ravensrants.com/soulripper/+/duptext">Blekko failed to find any results that were not on my domain</a> and <a href="http://www.google.com/search?sourceid=chrome&#038;ie=UTF-8&#038;q=%22However,+by+lunchtime+the+sun+had+come+out+and+most+of+the+snow+had+melted+away.%22#q=%22However%2C+by+lunchtime+the+sun+had+come+out+and+most+of+the+snow+had+melted+away.%22&#038;hl=en&#038;safe=off&#038;prmd=iv&#038;filter=0&#038;fp=5831956345d34357">a quick Google search turned a duplicate of the story</a> on a DeviantArt account.</p>
<p><strong>Blekko Results:</strong> 0 <strong>Google Results:</strong> 1</p>
<h4>Test 4: PT Post</h4>
<p>Finally, I tried an old, popular post from Plagiarism Today to see how well its content was detected. However, once again, <a href="http://blekko.com/ws/http://www.plagiarismtoday.com/2006/12/07/what-porn-can-teach-us-about-piracy/+/duptext">Blekko failed to return any results</a> and <a href="http://www.google.com/search?sourceid=chrome&#038;ie=UTF-8&#038;q=%22No+one+seems+to+be+able+to+go+agree+on+exactly+how+much+of+the+traffic+on+file+sharing%22">Google found a duplicate version of the piece</a> on what appears to be a BlogSpot spam blog (one I was previously unaware of too).</p>
<p><strong>Blekko Results:</strong> 0 <strong>Google Results:</strong> 1</p>
<h4>Test 5: Whole Domain</h4>
<p>Finally, in a bid to see what would happen if I ran my entire old literature domain through Blekko using the /domainduptext slash, it <a href="http://blekko.com/ws/ravensrants.com+/domainduptext">found only 6 offsite domains and 11 offsite URLs</a>, even though many individual pieces see more reuse than that. It was missing many domains with widespread reuse of my work (legitimate and plagiarized) including blogspot.com, myspace.com and deviantart.com to name just a few.</p>
<p>Worse still, I couldn&#8217;t examine any of the individual links as clicking the link provided by Blekko just took me to the SEO page for that domain, not to a list of suspect URLs on the site or even to the domain itself.</p>
<p>In short, if I wanted to find out exactly how my content was used on these sites, it was up to me to find it.</p>
<p><strong>Blekko Results:</strong> 11 <strong>Google Results:</strong> N/A</p>
<h4>Other Issues</h4>
<p>It became pretty clear that Blekko was missing a lot of duplicate content with its searches. My suspicion is that its because it tries to hone in only what it considers the best sites and cuts out spam blogs and other sites it deems to be of low value.</p>
<p>While this may be great for searchers, it creates a real problem when checking for duplicate content as these are often the exact sites you need to find. </p>
<p>However, that can&#8217;t be the only cause of the problem. <a href="http://blekko.com/ws/%22Though+our+voices+could+call+out+in+comfort%22">If you use Blekko to do search for quotes from the relevant pieces</a>, you get much more respectable results. Though the results aren&#8217;t nearly as good as Google in this area, they are definitely much more useful than via either of the slashes.</p>
<p>But the biggest problem is what one does after they find content reuse via Blekko. With the /domainduptext slash you can&#8217;t even access the individual URLs to investigate further. Using the /duptext slash is a much more robust tool, taking you to a page where the duplicate content is highlighted, but in the pages I did check the results were hit and miss, as many as half of the pages linked had no duplicate content at all.</p>
<p>All in all, as useful as Blekko is for other kinds of searches, or at least as useful as it might be, it doesn&#8217;t handle duplicate content searches very well, certainly no better than Copyscape or even regular Google.</p>
<h4>Bottom Line</h4>
<p>None of this is meant to be a slight against Blekko in any other regard. The other searches I did with it were actually pretty useful and, though I wasn&#8217;t swayed enough to change my default search engine, I did enjoy a lot of what Blekko had to offer and can see myself making some slashes for my use.</p>
<p>In the end though, it just isn&#8217;t a good tool for detecting plagiarism, copyright infringement or other kinds of duplicate content. Though the idea is solid and its integration with other SEO functions very appealing to some, it just isn&#8217;t accurate or complete enough at this time.</p>
<p>Still, as with  other tools I&#8217;ve reviewed, there is hope for the future. But it remains to be seen if this will be a priority for Blekko, which is clearly targeting a more generic search audience. Duplicate content detection is a high-specialized skill and the tools to find a keyword on the Web aren&#8217;t the same as the ones to find an article on every site it appears. </p>
<p>As such, this will most likely remain a nice idea by a decent search engine that just isn&#8217;t practical.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2010/11/30/can-blekko-help-detect-copied-content/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Stopping Self Content Theft</title>
		<link>http://www.plagiarismtoday.com/2010/02/03/stopping-self-content-theft/</link>
		<comments>http://www.plagiarismtoday.com/2010/02/03/stopping-self-content-theft/#comments</comments>
		<pubDate>Wed, 03 Feb 2010 16:00:14 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[duplicate-content]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[self-plagiarism]]></category>
		<category><![CDATA[Social-Networking]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=5488</guid>
		<description><![CDATA[Though we should always be diligent with monitoring how others use our content, we should be equally carefully about how how use our own work. ]]></description>
			<content:encoded><![CDATA[<table align="left" cellspacing="15">
<tbody>
<tr>
<td><a href="http://www.flickr.com/photos/35052582@N04/3307764859/" title="Office Space" target="_blank"><img src="http://farm4.static.flickr.com/3654/3307764859_14a6c03892_m.jpg" alt="Office Space" border="0"></a><br /><small><a href="http://creativecommons.org/licenses/by-sa/2.0/" title="Attribution-ShareAlike License" target="_blank"><img src="http://www.plagiarismtoday.comwp-content/uploads/2010/02/cc.png" alt="Creative Commons License" border="0" width="16" height="16" align="absmiddle"></a> <a href="http://www.photodropper.com/photos/" target="_blank">photo</a> credit: <a href="http://www.flickr.com/photos/35052582@N04/3307764859/" title="WallTea" target="_blank">WallTea</a></small></td>
</tr>
</tbody>
</table>
<p>Feeding Google&#8217;s insatiable appetite for content is on of the main reasons why infringers scrape and plagiarize content and also one of the biggest reasons why it is important to monitor and, in many cases, defend against it.</p>
<p>The logic is simple enough, the more copies of a work that appear, especially without proper attribution, the less likely that the search engines will give credit to the original source. This can erode search engine performance, especially for smaller and less-established sites or those in highly-competitive fields.</p>
<p>However, duplicate content doesn&#8217;t just come from plagiarists and spammers, it also comes from oneself and our own actions when dealing with our own content. Some of it is errors within our site, some of it is in how we approach social networking and social news.</p>
<p>So even as we are enforcing our rights elsewhere, we have to be careful about how we use our own works. Though it might not be infringement, it can certainly have a very negative impact on you and your site and is worth dealing with all the same.<span id="more-5488"></span></p>
<h4>Starting at Home</h4>
<p>The first steps to dealing with duplicate content have to start on your own site or blog. Many people don&#8217;t realize how many opportunities there are to create duplicate content on your site, even by pure accident. </p>
<p>Consider the following examples from a simple blog:</p>
<ol>
<li><strong>Tag Pages:</strong> Tag pages have much of the same content as individual post pages and are generated by most blogging applications.</li>
<li><strong>Archive Pages:</strong> Monthly, yearly and other archive pages, similar to tag pages, have the same content, or significant portions of it, repeated.</li>
<li><strong>Category Pages:</strong> As with Archives and Tags, category pages repeat content.</li>
<li><strong>Printable Pages:</strong> Many themes include printable versions of content pages that can be indexed as duplicates.</li>
<li><strong>Comment Pages:</strong> Finally, depending on the way comments are set up, a separate page with duplicate content can be created for the version with comments.</li>
</ol>
<p>Depending on how your blog is set up, it is entirely possible that your article appears six times or more on your site. Google, and other search engines, have to make a decision about which page is the best page and link to it. However, it doesn&#8217;t always make the right decision and, in extreme cases, can even decide that the site is spamming and either lower its ranking or remove it.</p>
<p>Thus, it is important to make sure that you keep this duplicate content to a minimum and do your part to let the search engines know what you want them to link to. Here are a few tips:</p>
<ol>
<li><strong>Show Summaries:</strong> When possible, only use article summaries and link to the full article. There is no reason for your tag, archive or category pages to display the full text of every entry.</li>
<li><strong>Use Robots.txt:</strong> Use your robots.txt file to prevent search engines from indexing unneeded pages, such as your printable pages. However, <a href="http://andybeard.eu/1121/seo-linking-gotchas-even-the-pros-make.html">use caution with this method</a>.</li>
<li><strong>Use Canonical Tag:</strong> <a href="http://searchengineland.com/canonical-tag-16537">Google, Yahoo and Bing all support the canonical tag</a>, which tells search engines which page is the best to include in the index.</li>
<p>In short, be very clear what versions of your content are ideal and try to keep the duplicates to an absolute minimum. Doing so will greatly help search engines tell which page to link to, helping both you provide a better service.</p>
<h4>Away from Home</h4>
<p>The other problem with self-defeating content use lies away from the home site. Where once an entire person&#8217;s presence was in their home page, now it can be scattered all over the Web, including other sites they run and social networking sites that they integrate with and use.</p>
<p>While it may seem like a great idea to post your content on every site you take part in, it can confuse the search engines. You want your efforts in social media to support your search engine strategy, not replace your original site. However, many people unwittingly do exactly that.</p>
<p>Here are a few ways to avoid that:</p>
<ol>
<li><strong>Unique Content for Each Site:</strong> If you run multiple sites, you need to have unique content for each. You can use snippets of content to cross promote and certainly link between them, but don&#8217;t repost everything. It confuses search engines and readers alike.</li>
<li><strong>Use Snippets:</strong> When posting your content on other sites, use snippets and link to the original works. The likelihood of this replacing your content, in human or search engine eyes, is slim to none.</li>
<li><strong>Require Links:</strong> Whenever any content of yours appears on another site, even in snippet form, request links back to the original, specifically SE-friendly ones.</li>
</ol>
<p>In short, be careful how you use your content. Though linked use isn&#8217;t likely to hurt you with the search engines, if you aren&#8217;t careful you can really eat up your own site by spreading your work too thin, too carelessly.</p>
<h4>Bottom Line</h4>
<p>When we think of content reuse, we think of what others do with our work. However, the fact is we are all the biggest reusers of our own work and, perhaps, the most important.</p>
<p>Though we can and should track how others use our content, as well as prevent uses that are against our wishes, it is also important to keep an eye on ourselves and make sure that our actions are working for us and with our strategy.</p>
<p>As with anything else in life, the best place to start your content strategy is by looking at yourself and your own actions, after all, you are your own biggest customer.</ol>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2010/02/03/stopping-self-content-theft/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Workfriendly Goes Offline</title>
		<link>http://www.plagiarismtoday.com/2008/07/10/workfriendly-goes-offline/</link>
		<comments>http://www.plagiarismtoday.com/2008/07/10/workfriendly-goes-offline/#comments</comments>
		<pubDate>Thu, 10 Jul 2008 14:36:24 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[duplicate-content]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[plagiarim]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[search spam]]></category>
		<category><![CDATA[Spam]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[workfriendly]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1295</guid>
		<description><![CDATA[Safe-surfing site and "accidental scraper" Workfriendly is now offline after more than two years of pushing duplicate content into Google. ]]></description>
			<content:encoded><![CDATA[<p><IMG SRC="http://www.plagiarismtoday.com/images/workfriendlylogo1-20080710-093524.png" alt="Workfriendly Logo" align="left" class="picleft">Workfriendly, a site previously reported on Plagiarism Today <a href="http://www.plagiarismtoday.com/2007/11/09/workfriendly/" title="Workfriendly an Accidental Scraper">back in November 2007</a> and again in <a href="http://www.plagiarismtoday.com/2008/04/08/workfriendly-yet-another-issue/" title="Another Workfriendly Issue">April of this year</a>, stopped functioning sometime within the past few days, bringing an end to the problems it created for many Webmasters.</p>
<p>The site currently is just a &#8220;parked&#8221; domain page running ads for the domain&#8217;s registrar, GoDaddy. According to the <a href="http://whois.domaintools.com/workfriendly.net" title="Workfriendly Whois">whois information for the site</a>, the domain was &#8220;updated&#8221; on the eighth, indicating that it possibly expired and was transferred to another owner. </p>
<p>Workfriendly attempted to disguise Web surfing as a Microsoft Word document by formatting Web pages to appear as text in a Word file while bordering the site content with a fake border designed to look like the application. This was supposed to make it &#8220;safer&#8221; to surf at work as it would raise less suspicion should anyone see your monitor.</p>
<p>The site created problems, however, when it allowed search engines to index its modified pages, injecting many thousands of of pages worth of duplicate content into Google. It also created headaches by not obeying certain meta tags, causing links to break on some sites and for Google to report those errors as broken links on the original domain.</p>
<p>It is unclear at this time if the outage is temporary or permanent, however, the site has been down for at least two days, making a temporary outage increasingly unlikely. </p>
<p><strong>Hat tip:</strong> Special thanks to <a href="http://www.sciencebase.com/">David Bradley of Sciencebase</a> (stupid typos, thanks for the catch!) for letting me know that Workfriendly is not working</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/07/10/workfriendly-goes-offline/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Modified Scraping on the Rise</title>
		<link>http://www.plagiarismtoday.com/2007/11/08/modified-scraping-on-the-rise/</link>
		<comments>http://www.plagiarismtoday.com/2007/11/08/modified-scraping-on-the-rise/#comments</comments>
		<pubDate>Thu, 08 Nov 2007 22:33:21 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Personal Experiences]]></category>
		<category><![CDATA[Prevention]]></category>
		<category><![CDATA[Blogspot]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[duplicate-content]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[Search-Engine-Spam]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogs]]></category>
		<category><![CDATA[synonymized]]></category>
		<category><![CDATA[thesaurus]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/11/08/modified-scraping-on-the-rise/</guid>
		<description><![CDATA[It appears that Google&#8217;s push to handle duplicate content may be having an unintended side effect. Even though a recent report by Attributor indicates that the search engine has done a terrible job separating originals from copies, the spammers don&#8217;t seem to be taking any chances. Spam bloggers are no longer content on scraping entries...]]></description>
			<content:encoded><![CDATA[<p>It appears that Google&#8217;s <a href="http://www.plagiarismtoday.com/2006/12/21/google-addresses-duplicate-content/">push to handle duplicate content</a> may be having an unintended side effect. </p>
<p>Even though a <a href="http://www.plagiarismtoday.com/2007/11/06/attributor-launches-content-monitoring-service/">recent report by Attributor</a> indicates that the search engine has done a terrible job separating originals from copies, the spammers don&#8217;t seem to be taking any chances. </p>
<p>Spam bloggers are no longer content on scraping entries and republishing, but are modifying them in a variety of ways. On <a href="http://www.blogherald.com">The Blog Herald,</a> another site I write for, <a href="http://www.blogherald.com/2007/11/07/sploggers-get-craftier-or-should-i-say-sploggers-son-cada-vez-mas-complicado/">editor Tony Hung wrote about</a> a site that seemed to either be synonymizing or double translating content from the site. On tenforty, blogger Deb <a href="http://tenforty.blogspot.com/2007/10/basic-blogging-etiquette-dos-and-donts.html">wrote about a case</a> where her story was translated into another language before being reposted on a spam Blogspot blog.</p>
<p>What started out as a rare phenomenon is now turning into a regular occurrence. Unfortunately, as the tactics of spammers change, so must the tactics of those who seek to protect their content and this calls for a new look at how we protect our works on the Web.</p>
<p><span id="more-724"></span><strong>Background</strong></p>
<p>The technology behind modified scraping has been around for several years. I <a href="http://www.plagiarismtoday.com/2005/12/05/synonymized-plagiarism-a-new-threat/">first wrote about it</a> in December of 2005. Back then the problem was fairly rare and the concept was still somewhat new. However, it seems as if more and more spammers are catching on to the trick and, possibly, that new spam blog networks are cropping up to take advantage of the technique.</p>
<p>The idea is that posting verbatim copied works is dangerous. Not only are you likely to get caught and shut down, but Google and other search engines assign penalties to duplicate content, making it harder to get those search results needed to make spamming profitable.</p>
<p>Since editing content by hand takes too long and defeats the purpose of scraping, spammers have started creating ways of modifying or &#8220;spinning&#8221; content before reposting it in order to fool the search engines into thinking that spam site is actually both legitimate and original.</p>
<p>To achieve that, they use one of several techniques including, but not limited to, the following:</p>
<ol>
<li><strong>Synonymized Content:</strong> The most basic approach takes an article and swaps out occasional words for synonyms according to a built-in thesaurus. Such a system can actually create hundreds or thousands of articles from a single source by using different combinations of synonyms.</li>
<li><strong>Translated Content:</strong> This approach runs the content through an automatic translation program similar to what you find on <a href="http://babelfish.altavista.digital.com/babelfish/tr">Babelfish</a>. Though the translations are far from perfect and leave the work in a foreign language, it is usually intelligible by humans and search engines alike. This will likely become a more popular technique as the Web gains more of an audience in non-English-speaking countries and those markets become more valuable for spammers.</li>
<li><strong>Double Translated Content:</strong> The same as translated content, but this kind translates the translation back into English. This produces a heavily modified and often unintelligible outcome that bears little resemblance to the original in many cases. This type of scraper is purely theoretically at this time but very likely does exist.</li>
</ol>
<p>In all of the above cases, the outcome is the same, the scraped article bears little resemblance to the original, making it much more difficult to detect and to stop, both for the victims and for the search engines.</p>
<p><strong>Changing Strategies</strong></p>
<p>The problem for bloggers when dealing with this type of plagiarism is that the typical methods of detection simply don&#8217;t work. <a href="http://www.copyscape.com">Copyscape</a>, though <a href="http://www.plagiarismtoday.com/2007/10/02/copyscape-improved-again/">drastically improved</a>, will struggle with this kind of scraping as will all other plagiarism checkers that work by looking for verbatim copying. This includes <a href="http://www.turnitin.com">high-end academic solutions</a>.</p>
<p>Even <a href="http://google.com/alerts">Google Alerts</a> can be thwarted by this if the phrase being searched for is modified in the process of spinning the article. Though <a href="http://www.plagiarismtoday.com/2006/06/05/product-preview-blogwerx-sentinel/">Blogwerx</a> is working on a product that can detect synonymized scraping, it is clear that any system to search for this kind of abuse is going to require a great deal of power and, most likely, some expense to the user.</p>
<p>The focus then becomes not abandoning the old ways of detecting plagiarism, but on adding new ways to guard against this threat. Those methods should include the following:</p>
<ol>
<li><strong>Digital Fingerprinting:</strong> I&#8217;ve been beating on the <a href="http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/">digital fingerprinting wardrum</a> for some time but such fingerprints are the most natural defense against modified scraping. Fingerprints, if done right, have no synonyms and no translations. They will remain intact no matter how the article is spun and can easily be searched.</li>
<li><strong>Uncommon Uses:</strong> Since FeedBurner doesn&#8217;t rely on text detection to <a href="http://www.plagiarismtoday.com/?p=183">determine who is using your feed and where</a>, their tools will remain effective even if the post is modified, that is, so long as FeedBurner&#8217;s code is not removed.</li>
<li><strong>Using Names:</strong> If you can&#8217;t use the digital fingerprint plugin you can create your own by either entering your own fingerprint in the footer of all your posts, editing your RSS template or simply using your name, if very unique, at the top of your works. Like fingerprints, names do not have easy translations or synonyms and are unlikely to be altered. Even better, vanity searches can let you know who else is talking about your work.</li>
</ol>
<p>In short, it is important to find elements that do not have easy translations or synonyms and focus on searching for those. These methods can, and should, be used in addition to other searching techniques to ensure that more human plagiarists or other kinds of scrapers, such as search engine scrapers, are also detected.</p>
<p>Even though these methods do not provide a perfect defense to modified scraping, it is a step in the right direction.</p>
<p><strong>Conclusions</strong></p>
<p>The good news with this kind of spam blog is that the risk being penalized in the search engines for being a victim of scraping goes down drastically. Since spammers avoid any potential duplicate content penalty, you do too.</p>
<p>However, none of this says that the scrapers won&#8217;t target keywords similar to your own and then use your own content to beat you in the results. That type of abuse might, in fact, be more likely than ever considering that Google will also not recognize the spam blog as junk and discard it.</p>
<p>As a result, even if we discount the emotional reasons for fighting plagiarism, there is still a great deal of need to monitor our content and ensure that those who make use of it, do so in an acceptable way.</p>
<p>Unfortunately, that will likely remain a game of cat and mouse for many years to come as the plagiarists and scrapers are rapidly changing their techniques to adapt to new situations. Clearly, we have to do the same.</p>
<p>Fortunately, at this phase, the adaptations are not that difficult but the future remains much less certain. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/11/08/modified-scraping-on-the-rise/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Update: Copyscape Drastically Improved</title>
		<link>http://www.plagiarismtoday.com/2007/09/18/update-copyscape-drastically-improved/</link>
		<comments>http://www.plagiarismtoday.com/2007/09/18/update-copyscape-drastically-improved/#comments</comments>
		<pubDate>Tue, 18 Sep 2007 17:27:50 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[copyscape]]></category>
		<category><![CDATA[duplicate-content]]></category>
		<category><![CDATA[duplicate-content-filter]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[google-api]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/09/18/update-copyscape-drastically-improved/</guid>
		<description><![CDATA[Since my previous review of Copyscape Premium, I&#8217;ve been communicating with Gideon Greenspan, one of the co-founders of Copyscape, about the issues I experienced with the service. He, and the others who work on Copyscape, have been very interested in my results and in improving their service. They repeated my tests with roughly the same...]]></description>
			<content:encoded><![CDATA[<p>Since my <a href="http://www.plagiarismtoday.com/2007/08/28/copyscape-premium-an-epic-letdown/">previous review of Copyscape Premium</a>, I&#8217;ve been communicating with Gideon Greenspan, one of the co-founders of <a href="http://www.copyscape.com">Copyscape</a>, about the issues I experienced with the service. He, and the others who work on Copyscape, have been very interested in my results and in improving their service. </p>
<p>They repeated my tests with roughly the same results and discovered that there is an issue with the <a href="http://code.google.com/">Google API</a>, which they use as their backend, that was limiting its usefulness at detecting plagiarism in works that had been lifted many different times.</p>
<p>I received an email from Greenspan yesterday telling me that they&#8217;ve made adjustments on their end and they have greatly improved Copyscape&#8217;s ability to detect widely plagiarized works.</p>
<p>Eager to see if it worked, I opened up my Copyscape Premium account and decided to give it another try. The differences were immediately clear.</p>
<p><span id="more-657"></span><strong>A Definite Change</strong></p>
<p>To retest Copyscape Premium, I decided to try the service again with two of the poems I used in my original experiment.</p>
<p>In my first run, neither poem produced any results and subsequent tests since then only turned up one or two copies. This is despite the fact that both works are widely available on the Web, both as legitimately licensed copies and as plagiarized works.</p>
<p>When I analyzed the first poem, using the print page to avoid scanning comments (at Greenspan&#8217;s suggestion), Copyscape picked up ten results. The second one, using the same method, picked up fifteen.</p>
<p>To call the difference noticeable would be an understatement. Copyscape had gone from being completely ineffective in my experiments to producing some very powerful results.</p>
<p>First Run:</p>
<p><img src='http://www.plagiarismtoday.com/wp-content/uploads/2007/09/ccbefore1.png' alt='ccbefore1.png' /></p>
<p>Second Run:</p>
<p><img src='http://www.plagiarismtoday.com/wp-content/uploads/2007/09/ccafter1.png' alt='ccafter1.png' /></p>
<p><strong>Limitations</strong></p>
<p>As great as the improvements to Copyscape are, the results do have a few caveats.</p>
<p>First, a text search for one of the poems only produced four results. Still, far more than previous, but far less than using the printable page. The reasons for this are unclear.</p>
<p>The second, and largest, caveat is that the results were still only a fraction of the actual copies available. When I did a Google search for the two poems, the first produced 25 results, the second produced 36.</p>
<p>All in all, that leaves over half of all the copies of the test works undetected by Copyscape.</p>
<p>However, according to Greenspan, the fact that my works were so heavily plagiarized and copied was, and still is, part of the problem. Since Google tends to put duplicates of a work in their duplicate content filter, which in turn seems to hide it from the Google API, works that are copied less or are paired with other content may fare much better.</p>
<p>Unfortunately, this appears to be an issue on Google&#8217;s end and is not something Copyscape can easily correct.</p>
<p><strong>Conclusions</strong></p>
<p>It is clear that the steps Copyscape took to tweak their service had a drastic impact. Though it is still somewhat limited in how it handles widely-plagiarized works, it may be effective enough for many to consider using.</p>
<p>After all, even though Copyscape does not seem to detect every single use of the work, it does offer greater convenience and tools to help monitor and track plagiarism cases. Whether or not this trade off is worth the cost involved in using the premium service is a decision best left up to individual Webmasters.</p>
<p>Personally, I&#8217;m going to continue to watch the service and test it. Greenspan has said there may be future adjustments that could improve it even more. For me, Copyscape is reaching the tipping point where the benefits start to outweigh the drawbacks, however, I still have to hold back from giving an unrestrained recommendation since so many copies were still undetected.</p>
<p>The best thing to do right now is consider the content you post, the nature of the plagiarism you&#8217;ve faced and see if Copyscape Premium is a good fit. </p>
<p>It may not be right for everyone, but there are now many people who it may be right for. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/09/18/update-copyscape-drastically-improved/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 11:24:51 -->
