<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism Todayspinning | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/spinning/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 06:51:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Spinning, Spamming and Twitter</title>
		<link>http://www.plagiarismtoday.com/2009/06/16/spinning-spamming-and-twitter/</link>
		<comments>http://www.plagiarismtoday.com/2009/06/16/spinning-spamming-and-twitter/#comments</comments>
		<pubDate>Tue, 16 Jun 2009 17:43:05 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[sammers]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[Spam]]></category>
		<category><![CDATA[spinning]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=3789</guid>
		<description><![CDATA[Twitter spams for a new set of "Spinning" have been popping up all over the Web. They claim to help keep your content safe, but is it all just spin. ]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/06/spinner-ick-300x190.jpg" alt="spinner-ick" title="spinner-ick" width="300" height="190" class="alignleft size-medium wp-image-3790" /></p>
<p>If you follow copyright or related keywords on Twitter, you&#8217;ve probably been seeing a decent amount of Twitter spam related to a series of &#8220;Spinner&#8221; apps (I am not linking to any of them in this article) that promise to, among other things, protect your content from being misused.</p>
<p>The problem is that these applications do nothing of the sort. Not only are the creators of this product (it appears to be one product for different platforms) spamming Twitter, but the product they are using is more likely to be used for the purpose of misusing content than ever protecting it.</p>
<p>The reason is because of the way spinning applications work, what they actually do and the market they are actually targeted for.</p>
<p>To explain that, we have to take a deeper look at spinning applications and how they work.<span id="more-3789"></span></p>
<h4>What Spinning Is</h4>
<p>The idea behind content spinning is very simple. In the English language, most words have synonyms that mean something similar, though not necessarily the exact same thing. Thus, by using combinations of synonyms, you can spin the same section of content into several unique items.</p>
<p>Take for example this simple sentence:</p>
<blockquote><p>The cat rode on the boat.</p></blockquote>
<p>If we add synonyms for &#8220;cat&#8221;, &#8220;rode&#8221; and &#8220;boat&#8221; we can get several different variations including:</p>
<blockquote><p>The feline used the ship.<br />
The tabby floated on the trawler.<br />
the calico sailed on the vessel.</p></blockquote>
<p>You can also get about a dozen other sentences using combinations of those three synonyms. For example:</p>
<blockquote><p>The tabby sailed on the ship.<br />
The feline floated on the vessel.<br />
Ect. Etc. Etc.</p></blockquote>
<p>Every one of these sentences mean roughly the same thing, but by replacing a few words with close synonyms, they appear to be unique.</p>
<p>If you apply this over the course of an entire article, say a 500 word post, you can get thousands of iterations the same work, each of which are unique to both the reader and to the search engines. </p>
<p>The latter audience is the reason most use spinning software. Due to the way search engines detect duplicate content, most would not be able to see any measurable similarity between the sentences above. In short, spinning allows you to take one article and covert it into thousands of new ones, each of which, in the eyes of Google, will be relatively unique.</p>
<p>The problem with spinning is that it rarely results in good writing. When we create a post, we usually choose what we feel are the best words among the synonyms. Altering those words, either automatically using a thesaurus, which may spinners do, or by hand, as the ones above do, results in lower-quality words being used.</p>
<p>This actually turns away human visitors and, for the most part, makes spinning only useful for for spam operations. It&#8217;s a fast, easy way to create thousands of low quality, but unique, pages for Google.</p>
<h4>Why it won&#8217;t Help</h4>
<p>Spinning won&#8217;t help anyone protect their content. According to the marketing material, these spinning apps can assist by letting you quickly rewrite an article or produce multiple versions of it in the event it is lifted. So, for example, if a scraper were to grab this article, I could just spin it, repost it and the damage is negated. </p>
<p>This has three major problems. First, rather than dealing with the infringement, either through legal means or by simply producing more content and trouncing the copycat the search results, you&#8217;re giving them the better version of the content and the keywords that came with it. Second, even if you don&#8217;t concede the original and keep it up, you&#8217;re now spamming Google with thinly veiled duplicate content. If it is detected, it will be dealt with.</p>
<p>The biggest problem is that, if you want to rewrite an article to avoid duplicate content issues, you can do so easily without the aid of a spinner program. The effort required to set up an article for spinning is only justified if you plan on creating dozens, hundreds or thousands of variations of it.</p>
<p>In short, it is only useful if you plan on spamming.</p>
<h4> Why Spinning is Bad NewsI</h4>
<p>Spammers love spinning programs because it lets them take a very small amount of original content and turn it into enough to fill many sites. However, search engines hate it because it doesn&#8217;t add a significant amount of original content to the index but is difficult to detect as duplicate.</p>
<p>Content creators worry about it because it is a way for spammers to use their content that will likely both avoid detection and outnumber the original content many times over. Since spammers usually leave keywords intact when spinning a post, some have found themselves competing with poorly-reworded copies of their own entries and articles in results critical to their brand.</p>
<p>If those who use spinners either use their own content or articles where such use is permitted, such as public domain content, then they are only gaming the search engines. However, many do seem to base their articles on work that has been written by others, especially legitimate sites in their desired spam field.</p>
<p>In short, these apps, which are being advertised via Twitter spam, are for the purpose of generating Web spam. Though their marketing claims other benefits, there is only one practical use for these tools, generating thousands of near-duplicate articles to fool the search engines.</p>
<p>It is that simple.</p>
<h4>Bottom Line</h4>
<p>The good news in all of this is that Webmasters do not have a great deal to fear from spinning spam. Though <a href="http://www.plagiarismtoday.com/2007/11/08/modified-scraping-on-the-rise/">it has been on the rise for years now</a>, it has proved to be a fairly ineffective form of spamming. </p>
<p>The reason, from what I have gathered, is that even though you can create thousands of articles from one, the time it takes to set up an article for such spinning is prohibitive. It requires a great deal of work on the front end, especially if the application doesn&#8217;t have a thesaurus. </p>
<p>The bigger reason, however, seems to be that these types of articles, due to their dubious quality, don&#8217;t seem to attract many links. As such, the search engines don&#8217;t take much notice of them. </p>
<p>Still, it is important for Webmasters to be aware that this is going on and to be looking out for it. The good news is that, if it is detected, it is an infringement the same as if it were verbatim, as it is a derivative work (unless an ordinary observer could not tell that one was based on the other).</p>
<p>The fact that these products are now being marketed as means of preventing content theft might be a sign that they have not performed well in the spamming world. With easier, cheaper and more effective techniques for generating Web spam, it doesn&#8217;t seem as if these spinning programs are a good deal in any regard, especially at the prices they routinely command.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/06/16/spinning-spamming-and-twitter/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Spotting Spam Blogs</title>
		<link>http://www.plagiarismtoday.com/2008/07/15/spotting-spam-blogs/</link>
		<comments>http://www.plagiarismtoday.com/2008/07/15/spotting-spam-blogs/#comments</comments>
		<pubDate>Tue, 15 Jul 2008 15:57:02 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Personal Experiences]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[search spam]]></category>
		<category><![CDATA[Spam]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[spinning]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1302</guid>
		<description><![CDATA[Spammers are making it harder and harder to separate their creations from those of amateur bloggers. However, by understanding various ways to spot spam blogs and how spammers try to beat those methods, you can better detect junk sites yourself. ]]></description>
			<content:encoded><![CDATA[<p><img class="picleft size-full wp-image-1305" title="splogspot_logo" align="left" src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/splogspot_logo.gif" alt="SplogSpot.com Logo" width="258" height="133" /></p>
<p>When people find out that their content is being copied without permission, how they seek to handle it is often determined, in part, by whether or not the site is a spam blog.</p>
<p>Where many might be willing to forgive copying by a novice blogger, especially with the promise of a link back, most are not prepared to have their content used so a spammer can trick the search engines and sell questionable items.</p>
<p>This means that, very often, I am forced to make snap judgments about whether a site is a spam blog or not, something that is becoming increasingly difficult as spammers have improved their techniques.</p>
<p>So how does one tell if a blog is a spam blog? The answer is not as simple as it once was but there are still ways one can detect a spammy site.</p>
<p><span id="more-1302"></span></p>
<h4>The Spammer Dilemma</h4>
<p>Spammers, over the years, have gotten better and better at making their blogs look human-edited. Though they still can not make their sites appear to be &#8220;good&#8221; blogs, they, in many cases, can pass off as the efforts of novice bloggers or of non-native English speakers.</p>
<p>This can create quite a problem when approaching a suspected spam blog. Is it a spammer using the default Blogspot template or is it someone new to blogging that doesn&#8217;t know how to change the template? Is the strange word choice the result of <a title="Spinning Spammers" href="http://www.plagiarismtoday.com/2007/11/08/modified-scraping-on-the-rise/">automated spinning</a> or someone learning English? If the spam blog did its job, it can be difficult to say.</p>
<p>However, most would agree that being heavy-handed with humans who copy, especially those who make some attempt to provide attribution, is counter-productive. Especially when you consider that the person struggling with English may either grow into an important blogger or, worse yet, already be a major figure in their part of the world, it becomes clear why telling humans from machines is important.</p>
<p>But how to do it? There are several different ways, but unfortunately none of them seem to work 100% of the time.So it is important to take all of the methods below into account, look at how spammers beat them, and develop an informed opinion.</p>
<h4>PageRank Check</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/pagerank.png" alt="" title="pagerank" width="118" height="97" align="right" class="picright alignright size-full wp-image-1310" />One of my sneakier tricks was to check the site&#8217;s PageRank and see if Google had given it either a n/a or a 0. Either would indicate that the site was either very new or had been deemed spam by Google. Either way, it certainly warranted suspicion.</p>
<p><strong>How Spammers Beat It:</strong> Tricking Google. This method has become less effective as Google seems to be assigning PageRank to more and more obvious spam blogs. That is a subject for another article.</p>
<p><strong>Turning the Tide:</strong> PageRank is still a decent indicator of spamminess, but it is no longer as reliable as it was. It is best to ignore PageRank if you have other reasons to be suspicious of a blog.</p>
<h4>&#8220;About&#8221; Page</h4>
<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/aboutpage.png" alt="" title="aboutpage" width="144" height="163" align="left" class="picleft alignleft size-full wp-image-1312" />Since spammers that use WordPress installs typically spend as little time as possible setting up their blogs, they routinely leave the &#8220;About&#8221; page, which is created as part of the install, with its default text. Very few human-generated sites have this problem.</p>
<p><strong>How Spammers Beat It:</strong> Spammers have started either deleting or filling in the about page. However, those that fill in the page often use it as an opportunity to keyword stuff, often further tipping their hand as a spam blog.</p>
<p><strong>Turning the Tide:</strong> If an about page does not have actual information about the site or the owner, it is very likely spam. Some spammers are starting to include fake information, but few seem to be able to resist the opportunity to keyword stuff and link.</p>
<h4>Posting Rate</h4>
<p>The goal of a spam blog is to get as much junk content into it as possible, as such, spammers routinely have extremely high posting frequency, often well over 100 posts per day. It would be physically impossible for a human to post so much content without the aid of a machine, creating a dead giveaway that the site is spam.</p>
<p><strong>How Spammers Beat It:</strong> Some spammers have begun to show restraint, only having their blogs update a few times per day and at irregular intervals, to more closely mimic a human blogger.</p>
<p><strong>Turning the Tide:</strong> The content is more telling than the frequency, unless the posting frequency is outrageous. Consider an extremely high posting volume to be a dead spam giveaway but don&#8217;t write off a site because it has a reasonable rate.</p>
<h4>Formulaic Posting</h4>
<p>We&#8217;ve all seen the spam blogs that start out with something like &#8220;I saw an interesting post today about&#8230;&#8221; and then proceeds to inject a few keywords and quote from the scraped article. By themselves, these posts may appear semi-legitimate, especially with trackbacks, but are clearly spam when you look at them in group.</p>
<p><strong>How Spammers Beat It:</strong> Spammers have started to use multiple post templates in the same blog. However, the limited set means that, if this method is chosen, it is still easily detected over the course of about ten posts.</p>
<p><strong>Turning the Tide:</strong> Check and see if the posts have the same pattern, are roughly the same length or all contain quoted material. These are all signs of a spam blog.</p>
<h4>Ugly Templates</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/uglylayout.png" alt="" title="uglylayout" width="162" height="123" align="right" class="picright alignright size-full wp-image-1316" />Sometimes the first sign a blog is spam is the template that it is in. If the template is the default WordPress theme or a stock BlogSpot theme without modifications, it&#8217;s a likely tip off of spam content.</p>
<p><strong>How Spammers Beat It:</strong> Spammers have been getting better about mixing up their themes. Most spam software applications come with a variety of themes that are rotated and, given the ease with which most blogs can be skinned, spam blogs can be amazingly varied.</p>
<p><strong>Turning the Tide:</strong> Fortunately, spammer themes still don&#8217;t have any elements of hand-crafting. There are very rarely custom images (or contain only very crude ones), the CSS often looks off, the color scheme is often jarring and the elements many times do not fit together correctly. If you see a glaring mistake that would be caught by anyone looking at the site, it is likely spam.</p>
<h4>Domain Names</h4>
<p>Spam blogs are typically restricted to three types of domains, 1) .us, .info and other strange extensions 2) domains stuffed with keywords (and often hyphens) 3) Free blog hosts (primarily Blogspot still).</p>
<p><strong>How Spammers Beat It:</strong> Spammers are participating in the domain aftermarket, snatching up expired domains that have had sites on them previously. This helps them carry both the PageRank of the old site, in some cases, and obtain a more &#8220;honest&#8221; name. Spammers are also spreading to other free blog services, including little-known ones, as well as social networking sites.</p>
<p><strong>Turning the Tide:</strong> If you are unsure about a domain, use <a title="Domain Tools" href="http://www.domaintools.com">Domain Tools</a> to investigate it. Look specifically for false whois information or other irregularities. Still, most spam blogs are hosted on spam domains. Better ones are too expensive for spammers to buy in bulk and are more profitable at auction than as spam tools.</p>
<h4>Ad Excess/Spam Blogroll</h4>
<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/longblogroll.png" alt="" title="longblogroll" width="194" height="189" align="left" class="picleft alignleft size-full wp-image-1314" />Many spam blogs earn their money by framing the content in a slew of ads, generally from one of the public advertising networks. If not, then they often times use the blogroll to put out obviously spammy links in hopes of building PageRank and search engine position for those domains.</p>
<p><strong>How Spammers Beat It:</strong> The formula is simple, fewer ads, fewer links, more spam blogs. Spammers have begun to show restraint with both their ads and their outbound links but are creating larger and larger spam farms to compensate. Spammers are also turning to alternate sources of revenue, such as Amazon afiiliate IDs, to better hide their activities. Others will mix &#8220;good&#8221; links with &#8220;spam&#8221; ones in their blogroll to further hide the nature of the site.</p>
<p><strong>Turning the Tide:</strong> One spam link is too many. Hover over the URLs in the Blogroll and check for any that are suspicious or out of place. When checking for ads, look not so much as quantity, but for the appearance that they were simply &#8220;stuck in&#8221;. Spammers don&#8217;t have time to integrate ads with their site usually.</p>
<h4>Conclusions</h4>
<p>When looking through these elements, any one of these would make me suspicious of a site&#8217;s origin, save perhaps if the site were hosted on a free blog host. Two, in turn, would make it a likely spam blog and three or above would make it a virtual lock.</p>
<p>The bottom line is that, while spammers are not making it any easier to spot their handiwork, it can still be detected by a careful eye (or a not-so-careful eye in many cases).</p>
<p>Though the spammer&#8217;s survival depends on staying under the radar and fooling humans and search engines alike, the nature of creating tens of thousands of junk blogs means that sacrifices have to be made and the results will have limitations.</p>
<p>By exploiting those weaknesses, we can continue to detect and stop spam and separate the spammers from those who are just getting started.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/07/15/spotting-spam-blogs/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 05:24:47 -->
