<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism TodaySplogs | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/splogs/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 06:51:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>The Changing Face of RSS</title>
		<link>http://www.plagiarismtoday.com/2010/09/13/the-changing-face-of-rss/</link>
		<comments>http://www.plagiarismtoday.com/2010/09/13/the-changing-face-of-rss/#comments</comments>
		<pubDate>Mon, 13 Sep 2010 16:24:21 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[bloglines]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[RSS scraping]]></category>
		<category><![CDATA[Spam-Blogging]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=7774</guid>
		<description><![CDATA[As the closure of Bloglines illustrates, RSS may be shifting away from from being a destination and transitioning into a very different role.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2010/09/bloglines-logo.jpg" alt="" title="bloglines-logo" width="277" height="62" class="alignleft size-full wp-image-7777" /></p>
<p>This weekend saw dueling announcements that painted a sharp contrast about the future of RSS. First, the big news from a slow weekend. was that <a href="http://mashable.com/2010/09/11/bloglines-discontinued/">Ask.com is shuttering it&#8217;s Bloglines Reader</a>. Once one of the most popular RSS readers, Bloglines has maintained a strong following but has largely taken a backseat to Google Reader as a Web-based RSS reader.</p>
<p>Still, its closure comes as surprise and is <a href="http://www.blogher.com/rss-dead-bloglines-close-october-1?wrap=blogher-topics/blogging-social-media-0&#038;crumb=10">being viewed as a sign that RSS is about to meet its maker</a>.</p>
<p>However, a separate announcement came from Automattic, <a href="http://mashable.com/2010/09/10/wordpress-subscriptions/">which has added a new RSS-based subscription feature to its WordPress.com offering</a>. The idea is to make RSS usable and approachable to people who aren&#8217;t as tech savvy as your average RSS user and compile RSS feeds into a Facebook-like news stream.</p>
<p>What does this mean for RSS as one Internet company enters the field and another makes an exit? There&#8217;s no clear answers but one thing is certain, content creators need to pay close attention.<span id="more-7774"></span></p>
<h4>A Changing Relationship</h4>
<p>Bloggers and other content creators have always had a love/hate relationship with RSS. On one hand, it has been a powerful and compelling way to engage and stay in contact with readers. Everything from RSS readers to RSS-to-email services have kept readers connected and reading more content. </p>
<p>On the other hand, RSS has been accused by some of robbing sites of page views, <a href="http://www.plagiarismtoday.com/2006/09/26/why-my-feeds-are-long/">a notion FeedBurner disputes</a>, and has <a href="http://www.plagiarismtoday.com/2006/08/29/why-rss-scraping-isnt-ok/">opened the door to RSS scraping by spam bloggers and other garbage sites</a>.</p>
<p>However, RSS adoption never really took off. In 2008, <a href="http://www.micropersuasion.com/2008/10/rss-adoption-at.html">RSS adoption was at 11%</a> and it largely peaked there. The years since have seen widespread growth in social networking, including Facebook and Twitter, but limited growth in RSS use. </p>
<p>This is a story I&#8217;ve seen first hand here at Plagiarism Today. My RSS subscribers, according to FeedBurner, have been flat (though FeedBurner stats have not been very reliable) even though I&#8217;ve seen growth in pageviews, <a href="http://www.facebook.com/plagiarismtodayfans">Facebook</a> and <a href="http://twitter.com/plagiarismtoday">Twitter</a> subscriptions. Though RSS is still a major way to access this site&#8217;s content, now more people view the site on Twitter and Facebook than RSS.</p>
<p>Simply put, mainstream users never really understood or made use of RSS, finding it too complicated and hating having a &#8220;second inbox&#8221;. RSS may never be widely used for its intended function, but that also doesn&#8217;t mean that it&#8217;s dead, <a href="http://www.techcrunchit.com/2009/05/05/rest-in-peace-rss/">contrary to what TechCrunch may say</a>, just an indication that the function is changing.</p>
<h4>Changing Faces</h4>
<p>RSS may not be a destination much longer but it is still an important tool and an important means. Feeding Facebook, Twitter and other social news sites requires a standard to distribute the content, something RSS provides.</p>
<p>But as RSS&#8217; relationship with readers change, so it will with content creators. Slowly, it&#8217;s possible content creators will abandon public RSS feeds and favor private ones they feed to their various channels. The idea of a partial feed will become less egregious as RSS&#8217;s function is pushed to fit into 140 characters and other short status updates.</p>
<p>Where once RSS was viewed as the ideal way to follow a site, it is slowly becoming the way to feed the new ideal ways to follow a site. In short, it&#8217;s the tool that enables realtime, not the destination for it.</p>
<h4>Looking Forward</h4>
<p>Many creators, myself included, have been downplaying RSS on their sites for some time but the question is what will you be doing? Will you be looking at making your RSS private, switching to a truncated feed or staying the same? Will you be shifting focus to other methods for readers to connect or continue with RSS?</p>
<p>I don&#8217;t have any firm answers for myself, only what I&#8217;ve done in the past (almost without realizing what I was doing), but I can&#8217;t see myself ditching RSS completely or truncating my feed. There are still several thousand who read this site via RSS and I have no desire to cut them off or impede their reading.</p>
<p>But it is clear RSS won&#8217;t be the way of the future for me, or likely many other bloggers, at least not as a destination.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2010/09/13/the-changing-face-of-rss/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>CrowdSourcing Spam Blogging?</title>
		<link>http://www.plagiarismtoday.com/2010/03/25/crowdsourcing-spam-blogging/</link>
		<comments>http://www.plagiarismtoday.com/2010/03/25/crowdsourcing-spam-blogging/#comments</comments>
		<pubDate>Thu, 25 Mar 2010 19:26:27 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogging]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=6131</guid>
		<description><![CDATA[Want to run a spam blog without the pesky copyright problems or actual work? Here's one idea that probably won't work.]]></description>
			<content:encoded><![CDATA[<table align="left" cellspacing="15">
<tbody>
<tr>
<td><a href="http://www.flickr.com/photos/44258887@N04/4461086890/" title="Musubi mold" target="_blank"><img src="http://farm3.static.flickr.com/2443/4461086890_aa49f2e738_m.jpg" alt="Musubi mold" border="0"></a><br /><small><a href="http://creativecommons.org/licenses/by-sa/2.0/" title="Attribution-ShareAlike License" target="_blank"><img src="http://www.plagiarismtoday.com/wp-content/plugins/photo-dropper/images/cc.png" alt="Creative Commons License" border="0" width="16" height="16" align="absmiddle"></a> <a href="http://www.photodropper.com/photos/" target="_blank">photo</a> credit: <a href="http://www.flickr.com/photos/44258887@N04/4461086890/" title="lovelihood" target="_blank">lovelihood</a></small></td>
</tr>
</tbody>
</table>
<p>For spam bloggers, or sploggers as they are often known, copyright is one of the most daunting challenges. It only takes one or two copyright complaints to bring down a spam blog network by alerting the host, destroying a significant amount of work. Likewise, a few complaints to advertisers can strip a splogger of a large percentage of their income.</p>
<p>Because of this, splogs have been working on finding ways to feign legitimacy. This helps them both stay online longer as hosts are more reluctant to take them down, helps them better establish a rapport with the search engines, their end goal in most cases, and appeal more to human visitors.</p>
<p>They&#8217;ve used many tactics to meet this goal including truncating content use to comply with fair use, <a href="http://www.plagiarismtoday.com/2009/06/16/spinning-spamming-and-twitter/">spinning content</a> so that it is unrecognizable and even skipping on borrowing content at all and <a href="http://www.plagiarismtoday.com/2006/10/20/why-scrape/">simply using automatic content generation</a>.</p>
<p>However, several readers have drawn my attention to a new kind of spam site, one that, according to their site, gets its readers to submit RSS feeds for inclusion and instead tries to hide behind a veil of user-generated content. This idea of crowd-sourcing spam is a relatively new one to me, one that actually closely mirrors YouTube&#8217;s &#8220;wild west&#8221; early days, but is almost certainly going to upset many bloggers who have had their content used without permission.<span id="more-6131"></span></p>
<h4>The Example</h4>
<p><em><strong>Note:</strong> All links to the site have been nofollowed. Please visit those links carefully and note that you do so at your own risk. The links are included purely for demonstration purposes.</em></p>
<p><a href="http://thebloghub.com" rel="nofollow">TheBlogHub</a> is, by all appearances, a very large and prolific spam blog network. It republishes the full RSS feeds from roughly 50,000 sites without truncation and while hotlinking the original source images.</p>
<p>This includes many of the Web&#8217;s most popular blogs including <a href="http://thebloghub.com/pages/TechCrunch" rel="nofollow">TechCrunch</a> (which appears to be out of date), <a href="http://thebloghub.com/pages/Mashable-_45_-All-That's-New-on-the-Web" rel="nofollow">Mashable</a> and <a href="http://thebloghub.com/pages/Engadget" rel="nofollow">Engadget</a> (Also out of date).</p>
<p>However, according to the site, all the RSS feeds are submitted by users of the service. The exact nature of this service is unclear beyond the <a href="http://thebloghub.com/About-Us">site&#8217;s mission statement</a> of &#8220;to provide quick and easy access to relevant blogs and articles for our guests and members, whilst promoting the respective blogs and their authors.&#8221;</p>
<p>But the site does very little to actually promote authors. Not only is the full content used, but the <a href="http://thebloghub.com/robots.txt" rel="nofollow">site&#8217;s robots.txt</a> file encourages search engines to read the content, thus making it a direct competitor with the original articles and there is no link back to the individual posts, just a small link back to the home page at the top of a site&#8217;s content. </p>
<p>The site also accepts comments on its service, which has the potential to further fragment the audience and conversation for the blogs involved.</p>
<p>To make matters even worse, though the site does offer a means to <a href="http://thebloghub.com/Report-Content" rel="nofollow">request removal of content</a>, you are required to give some form of verification that it is your content. However, to add a feed into the service there appears to be no such need. The site does, however, offer a means to file a DMCA-like notice buried in their terms of service but the email address bounces mail as undeliverable.</p>
<p>Hosted on <a href="http://web24.com.au">Web24</a>, the site appears to be based out of Australia and has ties to an Australian company, other sites of which are advertised heavily on the site.</p>
<p>In short, despite the fact that this site proudly proclaims not to be a spam blog network, it at the very least bears all the signs of being as such. If its goal is to truly be a legitimate service, is has many steps that it should take to be more cooperative with the original authors.</p>
<p><em><strong>Note:</strong> An attempt to email the creators of the site via both the listed email address and the address listed to receive DMCA complaints were returned as undeliverable. </em></p>
<h4>Why This is a Problem</h4>
<p>If these sites are truly crowdsourcing the locating and addition of RSS feeds, which is up for debate, it can create challenges for content creators whose works are being reused without permission.</p>
<p>First, in some cases, the sites may qualify for safe harbor. If the content is actually provided by the direction of users and they can show they did not profit directly from the infringement, they may be able to claim safe harbor. However, this is heavily muddled by the Grokster ruling which holds companies can be held liable for &#8220;inducing&#8221; copyright infringement. However, this only applies to the U.S. and the issue becomes further muddled when other nations become involved.</p>
<p>Second, hosts will be much less likely to take down such sites if they seem legitimate. Instead, they will more likely pass on any infringement notices to the owners of the site, allowing them the chance to remove it and continue on with the other content.</p>
<p>Finally, content creators will be more inclined to treat these sites as legitimate and contact the owners directly, if possible, to resolve these matters. Even if the site is intentionally or tacitly encouraging infringement and benefiting from it, copyright holders will treat them as if they were other legitimate hosts.</p>
<p>The problem with all of this, however, is that it seems unlikely to me that users would, willingly, crowdsource a spam blog network. Contributing RSS feeds to a service for &#8220;centralization&#8221; seems like an unlikely service to attract thousands of visitors. Instead, it seems to me much more likely that these sites merely attempt to give the appearance of legitimacy by feigning as if the content is submitted from 3rd parties.</p>
<h4>Bottom Line</h4>
<p>Still, I have no way of knowing with any certainty what is going on in this particular case. But whether they are actually receiving the feeds from users who are agreeing to their terms of service or simply pretending, the result is the same, scraped content from many thousands of sites, the majority of which almost certainly never gave permission. </p>
<p>A spam blog is a spam blog. Whether it is created intentionally, through recklessness or even simple mistake, the outcome is the same.</p>
<p>As such, the spam blogs need to be dealt with accordingly. Though contacting the owner might be best in cases where it seems to be a simple mistake, such as with an RSS reader that was accidentally exposed to the broader Web, in other cases it is most likely best to go with the hosts or advertisers if possible.</p>
<p>Though I typically encourage people to try and sort disagreements over copyright face-to-face. However, with spammers it is usually a waste of time. As with the case in this site, two letters seeking comment bounced back, including one to the email address supposedly set up to receive notices of copyright infringement.</p>
<p>If your feed is republished on the above site, for example, and you want it removed. You would likely be better off reaching out to their host, especially since all of their contact addresses no longer work. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2010/03/25/crowdsourcing-spam-blogging/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>How to Report Abuse to WordPress.com</title>
		<link>http://www.plagiarismtoday.com/2009/08/18/how-to-report-abuse-to-wordpress-com/</link>
		<comments>http://www.plagiarismtoday.com/2009/08/18/how-to-report-abuse-to-wordpress-com/#comments</comments>
		<pubDate>Tue, 18 Aug 2009 16:36:28 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[spam spam blogs]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4393</guid>
		<description><![CDATA[If you've ever need to report abuse to WordPress.com admins, a new post by myself on Lorelle on WordPress will help you get the resolution that you want.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/08/lorelle-logo.png" alt="lorelle-logo" title="lorelle-logo" width="190" height="103" class="alignleft size-full wp-image-4394" /></p>
<p>Today&#8217;s Plagiarism Today post is actually on the site of my good friend Lorelle VanFossen. She&#8217;s away from her blog for a bit and has asked me to post a few things to keep the blog going until she returns. </p>
<p>Specifically she requested a guide on filling complaints (copyright, spam, etc.) to WordPress.com as there appears to be a great deal of confusion about it. So I&#8217;ve written a post clarifying the process and providing links to the relevant policy pages on Automattic&#8217;s sites. </p>
<p>The hope is that, anyone who needs to file a terms of service violation regarding a WordPress.com blog should have no trouble knowing where to go to now.</p>
<p>So, without further ado, <a href="http://lorelle.wordpress.com/2009/08/18/how-to-report-abuse-to-wordpress-com/">go check out the post!</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/08/18/how-to-report-abuse-to-wordpress-com/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>4 WordPress Plugins I Would Love</title>
		<link>http://www.plagiarismtoday.com/2009/07/31/4-wordpress-plugins-i-would-love/</link>
		<comments>http://www.plagiarismtoday.com/2009/07/31/4-wordpress-plugins-i-would-love/#comments</comments>
		<pubDate>Fri, 31 Jul 2009 18:45:36 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Punditry]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[plugins]]></category>
		<category><![CDATA[RSS scraping]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogs]]></category>
		<category><![CDATA[Wordpress]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4218</guid>
		<description><![CDATA[As an avid WordPress user, there are a few plugins I'd like to see developed to help people protect their content.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/07/plugin-logo.png" alt="plugin-logo" title="plugin-logo" width="284" height="56" class="alignleft size-full wp-image-4222" /></p>
<p>I&#8217;m a big fan of WordPress and one of the key reasons has been the <a href="http://wordpress.org/extend/plugins/">plugins</a>. There&#8217;s a great community of WordPress developers out there that create very powerful plugins that do everything from make your site run faster to keeping the comment spammers at bay.</p>
<p>However, one area that has been a bit of a disappointment has been when it comes to tracking and content protection. Though <a href="http://wordpress.org/extend/plugins/copyfeed/">Copyfeed</a> was a <a href="http://www.plagiarismtoday.com/2007/05/24/copyfeed-plugin-now-available-in-english/">veritable Swiss army knife for RSS tracking and protection</a>, it hasn&#8217;t been updated in over a year and doesn&#8217;t work with current versions. </p>
<p><a href="http://www.plagiarismtoday.com/2006/10/05/update-digital-fingerprint-plugin-beta-2/">Likewise, the Digital Fingerprint plugin</a> has fallen on hard times as well.</p>
<p>WordPress plugins are in a great position to help bloggers track and protect their content. Not only do they have direct access to the server and its files, including the feed, but they operate largely in the administration area, where most WordPress users go for all of their blogging information.</p>
<p>With that in mind, here are four WordPress plugins I have not been able to find but would like to see created. If there are any developers looking for a weekend project, maybe they&#8217;ll consider one. Likewise, if I&#8217;ve overlooked a plugin, please let me know so I can update this post.<span id="more-4218"></span></p>
<h4>4. Proper Licensing</h4>
<p>There seems to be about a thousand WordPress plugins that let you add a Creative Commons License to your site. This seems odd to me as it is pretty trivial to just add the CC license yourself via your themes editor, but if you prefer to use a plugin, that is perfectly fine.</p>
<p>However, there seems to be no plugins that help people license the content correctly, including completing the license. <a href="http://www.plagiarismtoday.com/2008/04/09/photodropper-creative-commons-made-easy/">PhotoDropper</a> is an excellent example of correctly licensing content as it goes into the blog, but I want something to help others as they license the content from my site.</p>
<p>Specifically, I&#8217;d like a &#8220;reuse this article&#8221; link to appear on my site and provide the visitor with HTML code that they can use to paste the work into their site. That will include proper attribution and, if appropriate, a link to the CC license. Ideally, I&#8217;d like to have this plugin remove images (or at least the option to) so to avoid hotlinking issues.</p>
<p>This could be used with CC licenses, which would be ideal, but could also be used for those who want specific &#8220;author boxes&#8221; on their reused articles. Either way, the end result would be that, with a click, a copy and a paste, a visitor would be able to republish my articles and complete my license terms.</p>
<p>Note: There is already a plugin for licensing RSS feeds called <a href="http://sourcedfrom.com/">SourcedFrom</a> that I will be talking about more next week.</p>
<h4>3. Non-Repudiation Integration</h4>
<p>One of the more difficult problems on the Web is knowing who published something first. Though not likely useful in a court of law, it can be VERY useful in the court of public opinion. To help with that, non-repudiation services such MyFreeCopyright, Numly and Registered Commons have formed to record when a post is saved.</p>
<p>It would be nice if the process of submitting to these services would be automatic so that, when I hit &#8220;Publish&#8221; the work is automatically uploaded, timestamped and stored. Technically both <a href="http://numly.com">Numly</a> and Registered Commons have WordPress plugins but both are out of date and <a href="http://registeredcommons.org">Registered Commons</a> was too difficult to ever be practical.</p>
<p>However, the limitation here is likely due to the lack of robust APIs. <a href="http://myfreecopyright.com">MyFreeCopyright</a> and <a href="http://www.safecreative.org/">SafeCreative</a>, the two leading free services, both lack publicly available APIs at this time (though both say they are working on it).</p>
<p>This may be a situation where someone has to create the service first and the plugin second&#8230;</p>
<h4>2. FairShare Integration</h4>
<p><a href="https://fairshare.attributor.com/fairshare/">FairShare</a> is a free service by <a href="http://attributor.com">Attributor</a> that follows your feed and produces a second feed for you to subscribe to that locates copies of your works on the Web. It works very well for bloggers and uses the same matching technology as the main Attributor service, which is used by many major publishers.</p>
<p>Integrating this into WordPress makes a lot of sense. A simple version might just subscribe to the FairShare feed in the admin panel where more advanced ones could look at the content of the entries and prioritize them in some way. </p>
<p>This might be tricky as the FairShare feed is just a regular RSS feed with the content laid out in a table. Also, the feed structure could change at any time as changes are made to the service. Still, given how frequent content reuse is and how much of the potential audience is on other sites, it seems like it could be a worthwhile addition.</p>
<h4>1. CopyFeed Replacement</h4>
<p>CopyFeed, when it worked, was a powerhouse against RSS feed scraping let you track where your feed was being used and then block the spammers from accessing it. No need to send cease and desist letters or takedown notices unless you wanted the old content removed or the spammer constantly worked to circumvent your blocks. </p>
<p>It required careful use, especially since you could block legitimate visitors as well, but it was a powerful plugin that was great for those who did not use FeedBurner. However, it doesn&#8217;t appear to work with current WordPress versions and seems to be dead in its development</p>
<p>That being said, the plugin is GPL, so there may be an opportunity for another developer to revive and fix it.</p>
<h4>Bottom Line</h4>
<p>Though I don&#8217;t expect all or even any of the plugins above to materialize, I hope that by tossing these ideas out a conversation will start about the role such plugins could play in licensing, tracking and protecting content. Maybe then one of these ideas, or an offshoot of them, will catch the eye of a developer who will take it up.</p>
<p>Even just one of these plugins could be a huge asset for bloggers. I&#8217;m hoping that someone else will see that and consider at least one to be a worthwhile venture&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/07/31/4-wordpress-plugins-i-would-love/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>6 Steps to Find a Host&#8217;s DMCA Contact</title>
		<link>http://www.plagiarismtoday.com/2009/07/16/6-steps-to-find-a-hosts-dmca-contact/</link>
		<comments>http://www.plagiarismtoday.com/2009/07/16/6-steps-to-find-a-hosts-dmca-contact/#comments</comments>
		<pubDate>Thu, 16 Jul 2009 17:49:47 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[abuse]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[copyright inffringement]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogs]]></category>
		<category><![CDATA[takedown notice]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4036</guid>
		<description><![CDATA[Once you've found an infringer and identified their host, the real quest often begins. Finding out who to contact can be the hardest part of all.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/07/domain-tools-logo-1-300x68.png" alt="domain-tools-logo-1" title="domain-tools-logo-1" width="300" height="68" class="alignleft size-medium wp-image-4039" /></p>
<p>You&#8217;ve found someone infringing your content and you want them to stop. However, after contacting the infringer directly, they are either unresponsive outright refuse to take any action. You decide that it is time to contact the host but, after performing your due diligence and <a href="http://www.plagiarismtoday.com/2009/05/14/video-finding-the-host-2/">finding the host of the site</a>, you find yourself lost on how to actually get ahold of the right person. </p>
<p>The problem is compounded by a reluctance on the part of hosts to make this information readily available. Some of it is a desire to discourage false notifications, however, much of it is just a general desire to keep complaints to a minimum while still complying with the various laws. After all, every complaint costs them money both in terms of time and, most likely, lost customers.  </p>
<p>The good news is that there are many places that you can look for an email address or other contact information. It is just a matter of knowing where to look.<span id="more-4036"></span></p>
<h4>Step 1: Look on the Site Itself</h4>
<p>The first step for finding a DMCA agent to file a notice of copyright infringement is to look on the host&#8217;s site itself. Online service providers, as part of the DMCA, are required to post this information on their site and the vast majority do so.</p>
<p>However, this doesn&#8217;t mean that it is easily visible. This information is very often heavily buried. </p>
<p>The best place to start is at the bottom of the host&#8217;s home page. There look for a &#8220;legal&#8221; section as it is most likely to have the contact information you need. </p>
<p>Failing that, you can skim through the site&#8217;s terms of service and other policies to see if there is any relevant mention of copyright or, if needed, reporting abuse. If that doesn&#8217;t work, look at the site&#8217;s &#8220;contact us&#8221; page to see if they included it there. Finally, if that fails, take a look at the site&#8217;s privacy policy as it usually comes with an email address aimed at the company&#8217;s legal department.</p>
<h4>Step 2: The Copyright Office</h4>
<p>If you can&#8217;t find the DMCA contact information on the host&#8217;s site directly, it might be worthwhile to take a few moments to visit the <a href="http://www.copyright.gov/onlinesp/list/index.html">Copyright Office&#8217;s directory of designated agents</a>. Though this list is difficult to use (based on image-only PDFs) <a href="http://www.plagiarismtoday.com/2007/08/21/the-need-to-modernize-the-dmca-agent-list/">and tends to be out of date</a>, it is a solid backup in the event you can&#8217;t find a DMCA contact on the site itself. </p>
<p>Be sure to check the age on any documents you open and be very careful to open the right file as many companies have very similar names.</p>
<h4>Step 3: Ask</h4>
<p>If neither of these sources produce any useful information, but you have an address that you know is valid, just wrong for the purpose, such as a &#8220;support&#8221; address, send an email there and ask where to direct the issue. It might add an extra step to the process but it is very effective when hosts do respond.</p>
<p>However, don&#8217;t give bogged down waiting for a reply, some will discard all non-related questions. So, once a day or two has passed, it is time to move on.</p>
<h4>Step 4: Check for Another Host</h4>
<p>If you&#8217;ve gotten this far and haven&#8217;t had any luck, it is worthwhile to take a moment and see if there is another host involved that it might be better to contact. For example, some Web hosts lease their servers from a larger company. Depending on how you found your host, you may want to contact the larger company or the smaller one instead.</p>
<p>The way to check for this is simple. Perform your <a href="http://www.domaintools.com">Domain Tools</a>Domain Tools</a> check a second time, look at the two lines marked &#8220;Name Servers&#8221; and make a note of the domain. Then proceed to the IP Whois information, as instructed in the video, and compare the two. If there is a discrepency, it could be a sign that there are two hosts involved. </p>
<p>Try the domain that you are not looking at, most likely the domain of the name servers, and repeat steps 1 through 3 to see if you can find contact information for them. </p>
<h4>Step 5: The IP Whois Records</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/07/abuse-lt-300x162.png" alt="abuse-lt" title="abuse-lt" width="300" height="162" class="alignright size-medium wp-image-4041" /></p>
<p>If you used <a href="http://www.domaintools.com">Domain Tools</a> to locate the host of the site, you most likely have already looked at the whois information for the IP address. Though the initial purpose behind this was to find out who the host of a site is, or at least who owns the IP address, this record also contains some potentially valuable contact information as well.</p>
<p>Specifically, there will usually be an email account specified to receive abuse complaints. If you can find that account, it&#8217;s likely a safe bet that it&#8217;s a valid starting point for your contact (though they might not be the last).</p>
<h4>Step 6: Guess</h4>
<p>If you&#8217;ve reached this point, you&#8217;re probably at your wits end. If nothing else has produced a valid contact or the other addresses you have written haven&#8217;t panned out, the best thing you can do is simply guess. The most common addresses for DMCA-related questions, that I see, are as follows:</p>
<ul>
<li>abuse@</li>
<li>dmca@</li>
<li>legal@</li>
<li>copyright@</li>
<li>dmca-agent@</li>
<li>support@</li>
</ul>
<p>Odds are that at least one of those will get through to a human being. Though sending additional messages to webmaster@ and mail@ might not be a bad idea. </p>
<p>However, if you do send an email to all of these addresses, do expect the majority to bounce. The hope is that one or two will get through. </p>
<h4>Bottom Line</h4>
<p>In the end, finding out who to contact about a DMCA agent takes a bit of sleuthing, especially if I don&#8217;t already have the site on my <a href="http://www.plagiarismtoday.com/dmca-contact-information/">DMCA Contact List</a>. However, if you&#8217;ve found the correct host, the hard part has already been done.</p>
<p>All you have do now is keep looking and, hopefully, find the information that you need. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/07/16/6-steps-to-find-a-hosts-dmca-contact/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Is Lijit Creating Spam Blogs?</title>
		<link>http://www.plagiarismtoday.com/2009/02/12/is-lijit-creating-spam-blogs/</link>
		<comments>http://www.plagiarismtoday.com/2009/02/12/is-lijit-creating-spam-blogs/#comments</comments>
		<pubDate>Thu, 12 Feb 2009 17:02:12 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[aggregation]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[lijit]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogging]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=2823</guid>
		<description><![CDATA[A new service by Lijit has raised eyebrows in the content community. Does their new aggregation service provide a new solution for content licensing or just push the boundaries of what is spam?]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2009/02/logo_250.jpg" alt="logo_250" title="logo_250" width="250" height="179" class="alignleft size-full wp-image-2827" /></p>
<p>Lijit is a popular and well-respected search application for bloggers. Many sites, including this one, use it because it not only drastically improves upon the default WordPress search, but also because it allows searchers to pull content from sites within the blogger&#8217;s network, including social network sites and other blogs.</p>
<p>However, a new service of Lijit has been causing some controversy. It&#8217;s <a href="http://www.lijit.com/content_networks/learn_more">new content networks service</a> has already been <a href="http://mashable.com/2009/02/11/lijit-content-networks/">accused of being similar to scraper and spam blogs</a>.</p>
<p>So I decided to take a look and see for myself. However, what I found very worrisome and gave me a great deal of reason to caution bloggers to pause and think before they made the jump to join or create a network on Lijit.<span id="more-2823"></span></p>
<h4>The Big Idea</h4>
<p>The idea behind the content networks service is one that should be familiar to bloggers and Webmasters. In these networks, a group of bloggers in a similar subject come together and pool their content into a single site (or as Lijit calls it, &#8220;publication&#8221;) and, in exchange, content providers get their sites linked in the blogroll of the hub site and have the articles linked back to their source.</p>
<p>In short, content networks are not &#8220;networks&#8221; so much as &#8220;aggregators&#8221;. These sites act as a hub for all of the content on the member blogs, in some cases republishing the full content from the RSS feeds.</p>
<p>The theory is that, if member sites link to this hub, visitors will explore the network site, find new content to follow and this will increase readership for all of the blogs involved. It is very similar, in that regard, to other networks and traffic-generating schemes that have been tried over the years, usually without success.</p>
<p>However, the question remains, is this service a spam blog and should bloggers consider signing up? </p>
<h4>The Spam Question</h4>
<p>Determining whether or not these content networks push the boundaries of spam blogs or scraper blogs is pretty difficult as the definition itself itself is hard to nail down. What we can do is look at how the sites operate, the good and the bad, and make a determination. For this purpose, we will be looking at the network blog for their new <a href="http://www.securitybloggers.net/">Security Blog Network</a>. </p>
<p>The first thing that most will notice is that the network site is scraping the whole content of the RSS feeds, including any footers, such as Feed Flares, that are added. Though other network sites, such as the <a href="http://cycling.lijitnetworks.com/">Cycling Bloggers Network</a>, use truncated feeds it is clear that the service is both capable of and often does use the full content. </p>
<p>The good news is that, once you look past the full content reuse, it is clear that the site does follow many of the content reuse best practices. On the site in question, clicking the headline of any given story will take you to the original story and the &#8220;Comments&#8221; link also goes to the original page as well. I was unable to find a &#8220;permalink&#8221; that did not reference back to the original source. Furthermore, all of the links to the original content are &#8220;dofollow&#8221; links, ensuring that the search engines will pick up the original sites.</p>
<p>On the flip side, images are hotlinked from the original source and the entire site can, and is, easily indexed by the search engines. Currently, <a href="http://www.securitybloggers.net/robots.txt">the robots.txt file</a> does not ban any search engines from any on the site (though only tag pages s<a href="http://www.google.com/search?client=safari&#038;rls=en-us&#038;q=site:securitybloggers.net&#038;ie=UTF-8&#038;oe=UTF-8">eem to be indexed right now</a> as the permalinks point to the original content). </p>
<p>Though it is clear that these network sites could do more to minimize the negative impact they could have on the bloggers that join them, it is also clear that they are not trying to outright rip off the content of hapless Webmasters who sign up.</p>
<p>The end result is that, while I don&#8217;t think Lijit is doing something that is outright spammy, I also can&#8217;t advise anyone to consider joining one of these networks. Between duplicate content issues, network imbalance and oversaturation of content, it seems likely that many will find more drawbacks than benefits.</p>
<h4>Conclusions</h4>
<p>I recognize that many will disagree with me when I say that Lijit&#8217;s content networks aren&#8217;t outright spam. As the Mashable article pointed out, the appearance, presence of unwanted RSS content (such as Feed Flares) and bold advertising blocks seem to further that notion. </p>
<p>Indeed, if I found that one of these sites were scraping my content I would likely assume it was a spam blog and react accordingly. These sites look like spam blogs, scrape content like a spam blogs and will likely rank better than most spam blogs due to the large number of legitimate sites linking to them.</p>
<p>To some, they might not be spam, they are something worse.</p>
<p>However, it is important to remember that these networks are purely opt-in. Sites have to register and agree to let their content be used in this manner. If a blogger doesn&#8217;t want their material to appear on one of these sites, all they have to do is not register.</p>
<p>If bloggers want to allow their content to be used by a site that shares many features with a spam blog and provides what would seem to be little hope of an equitable return, this becoming more true the larger the original blog grows, then it is not my place to say no.</p>
<p>Though I would feel better if Lijit would add extra protections to ensure that search engines are not confused, such as mandating partial feeds or blocking the search engines from indexing the site, the end decision is the blogger&#8217;s.</p>
<p>Personally, I find that these sites border a little too closely to spam blog behavior for me to seriously consider joining. Unlike other content networks, such as BlogBurst, which add editorial value and a human element to the reuse, Lijit&#8217;s service is more about straight scraping and republishing. </p>
<p>However, the service, at this time, doesn&#8217;t sour me enough on the Lijit name and product to drop their search tool. I don&#8217;t think that Lijit is &#8220;evil&#8221; or &#8220;bad&#8221;, but that their new service, while well-intended, has some potentially ugly side effects.  </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/02/12/is-lijit-creating-spam-blogs/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>The Worst DMCA Response I&#8217;ve Seen</title>
		<link>http://www.plagiarismtoday.com/2009/02/04/the-worst-dmca-response-ive-seen/</link>
		<comments>http://www.plagiarismtoday.com/2009/02/04/the-worst-dmca-response-ive-seen/#comments</comments>
		<pubDate>Wed, 04 Feb 2009 17:06:53 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Personal Experiences]]></category>
		<category><![CDATA[Villains]]></category>
		<category><![CDATA[aggregation]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Hosting]]></category>
		<category><![CDATA[joyent]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogs]]></category>
		<category><![CDATA[takedown]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=2695</guid>
		<description><![CDATA[I've sent hundreds of takedown notices and seen a wide variety of replies, but Joyent has taken the cake by not just failing to act, but actively defending a potential spam blogger. ]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2009/02/joyent-logo.png" alt="joyent-logo" title="joyent-logo" width="150" height="42" class="alignleft size-full wp-image-2699" /><em><strong>Update:</strong> This story has been updated. See below for the new information.</em> When you send in a DMCA notice, a spam blog complaint or any other kind of abuse report to a Web host, there are a lot of different responses that you might receive.</p>
<p>The vast majority of hosts seem to take these issues seriously and pull down any spammy or infringing content quickly. Others seem to ignore any such requests and others reply with polite, but clearly stock, letters saying that they can&#8217;t help.</p>
<p>And then there is <a href="http://www.joyent.com/">Joyent</a>.</p>
<p>Joyent takes things to a new level. Not only does the site not help when someone is misusing your content, but they actually support scrapers and defend them in email responses.</p>
<p>Though many hosts have a policy of inaction against infringement, something that is legally dangerous and ethically dubious, none that I have seen have spent the time and energy Joyent&#8217;s representative did to protect them and defend them.<span id="more-2695"></span></p>
<h4>What Happened</h4>
<p>Sunday, during some of my testing of <a href="http://www.fairshare.cc">FairShare</a><a href="http://www.plagiarismtoday.com/2009/02/03/attributor-announces-fairshare-service/">, a system officially launched yesterday</a>, I was notified about a site on a .info domain that had used nearly 100% of <a href="http://www.plagiarismtoday.com/2009/01/28/you-think-us-copyright-law-is-bad/">my recent article about U.S. Copyright Law</a>. </p>
<p>I looked at the site in question and it appeared to either be a spam blog or a misguided aggregator. It was displaying the full content of dozens of entries on one page, using hotlinked images and the only original content was a pair of &#8220;thumbs up&#8221; and &#8220;thumbs down&#8221; buttons with each article that seemed to do nothing. </p>
<p>Though it linked to Plagiarism Today in the headline, it was not linking to or referencing my CC license nor was it referencing the &#8220;Share Alike&#8221; attribute. When combined with the hotlinked images, I decided to take action, filing a DMCA notice with their host, Joyent.</p>
<p>However, that was no simple process. Never having encountered this host, I searched their site for information about their DMCA agent. However, all I found was <a href="http://www.joyent.com/about/legal/safe-harbor">a postal address</a> (Note: Link is to a privacy safe harbor, not copyright safe harbor). I checked for a registration with the <a href="http://www.copyright.gov/onlinesp/list/j_agents.html">U.S. Copyright Office</a>, but found nothing. Since all other abuse requests seemed to be routed through their &#8220;Support&#8221; account, I emailed it and, since it was a weekend and they had <a href="http://help.joyent.com/">limited support hours listed</a>, I did not expect to hear back.</p>
<p>However, I heard back relatively quickly (within an hour), especially amazing considering it was also Super Bowl Sunday. The letter was from Linda Derezinski, who identified herself as the &#8220;Director of Support&#8221;. Her letter was short and said in part that:</p>
<blockquote><p>&#8220;That is just a RSS feed aggregation site which is properly identifying as such, it is not taking your work and showing it as theirs&#8230;.  If you do not wish them to show your RSS feed you of course are able to block them from reading it.&#8221;</p></blockquote>
<p>In short, Derezinski was telling me that it was my responsibility as the provider of the RSS to &#8220;block&#8221; this site from republishing my feed wholesale on their site. Even though that is impossible with FeedBurner and impractical without knowing more about how the site is obtaining the feed.</p>
<p>I&#8217;m used to working with and educating hosts about copyright issues, but never have I had such a strong defense for a spam blogger. I responded back with some basic information about the DMCA and links to information on the law, including the <a href="http://chillingeffects.org/dmca512/faq.cgi">Chilling Effects FAQ</a>. </p>
<p>There was a slightly greater delay in this response but, after about an hour, Derezinski wrote back a second time saying the following: </p>
<blockquote><p>&#8220;You have your site identified as  creative commons&#8230;. They properly identify that the article is from your site.  This is not DMCA issue.&#8221;</p></blockquote>
<p>I wrote back a second time, this go around linking to my article about <a href="http://www.plagiarismtoday.com/2007/06/05/using-creative-commons-to-stop-scraping/">using Creative Commons to Stop Scraping</a>, which includes a discussion with then-VP Mike Linksvayer. I pointed out that the site did not complete the license, neither linking to the license or mentioning the share-alike element.</p>
<p>That was the last I heard of the matter. As of this writing, the article in question is still up on the page in question (though it may cycle off any time now, after being up for a week) and dozens of other articles from different sources are still up, all with full text and hyperlinked images.</p>
<p>To recap what happened. I filed a DMCA notice in what seemed to be a straightforward scraping case, Joyent first defended the scraper saying that it was my responsibility to block them and then, after I showed that was not the case, then tried to tell me there wasn&#8217;t a copyright issue since I had a CC license. Once I showed that the site was in violation of my license, there was nothing but silence.</p>
<p>Needless to say this has me frustrated, but not defeated.</p>
<h4>Offering a Defense</h4>
<p>In the interest of fairness, I&#8217;m going to offer some potential defenses here. </p>
<ol>
<li><strong>Splitting Hairs</strong>: I&#8217;ll admit openly that the representative was correct that I do have a CC license and the site in question did link back to PT. Though the use was a violation of the license, something I feel very comfortable saying, it is understandable why someone unfamiliar with the details of CC might think that the use was compliant.</li>
<li><strong>Questionable Spam Blog:</strong> There is room for debate if this site was a spam blog or an aggregator. Though it clearly violates the best practices for aggregation, it does seem to attempt to add some value. This doesn&#8217;t make it less of an infringement, for a host that specializes in applications, it might have been seen as a defense.</li>
<li><strong>Incorrect Person:</strong> Though it was the only email address provided, it is clear that Derezinski is neither familiar with nor accustomed to DMCA complaints. I could have sent the complaint via postal mail (I find it stunning that they don&#8217;t even offer a fax) but it is likely that the article would have scrolled off or changed URLs in that time.</li>
</ol>
<p>I have no doubt in my mind that this is not an appropriate way to handle a copyright complaint (as I said in the title, the worst that I had seen) and that this company violates many of the best practices of the DMCA (Not registering their agent with the USCO, not having an email address, not having a direct copyright policy, etc.) but I do want to give them the benefit of the doubt.</p>
<p>I hope that this was a misunderstanding and nothing more.</p>
<h4>Conclusions</h4>
<p>One of the jobs that I do in my consulting practice is work with hosts on these kinds of issues. I&#8217;m often recruited to check for any flaws in their DMCA system and even do test runs of their takedown process. I have a lot of experience in this area. </p>
<p>This is the worst response that I have seen in seven plus years of handling DMCA issues. Though I applaud hosts for looking at DMCA notices and considering copyright issues, this was not a case where the notice was clearly false, the use raised fair use questions or had other common issues that arise with erroneous DMCA notices.</p>
<p>What makes it worse is that the host came down clearly on the side of wholesale RSS scraping and republishing. Meaning that, if you are a spam blogger reading this that engages in RSS scraping, Joyent is likely a natural choice for you as their staff, at least judging from this correspondence, feels that you are doing nothing wrong or illegal.</p>
<p>My hope is that this was just a miscommunication but it has been over 48 hours since my last correspondence with them and no action has been taken on my ticket. If I do not hear more within the next 24 hours, I will look at calling them or filing the notices with Google, provided that the work hasn&#8217;t scrolled off or naturally disappeared. </p>
<h4>Update</h4>
<p>At about 5 PM central time the day this story was posted, I was contacted by Peter Watridge, who said that Derezinski has &#8220;escalated&#8221; the ticket it to him and he had not been able to get it in a timely matter. He is going to work with the owner of the domain to get the work removed. </p>
<p>Though this is great news and certainly puts Joyent in a better light. The initial response was still not what I would characterize as appropriate. I will update again when/if the content is removed. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/02/04/the-worst-dmca-response-ive-seen/feed/</wfw:commentRss>
		<slash:comments>50</slash:comments>
		</item>
		<item>
		<title>Attributor Analyzes TrueAudience</title>
		<link>http://www.plagiarismtoday.com/2008/11/19/attributor-analyzes-trueaudience/</link>
		<comments>http://www.plagiarismtoday.com/2008/11/19/attributor-analyzes-trueaudience/#comments</comments>
		<pubDate>Wed, 19 Nov 2008 18:08:15 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Attributor]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Spamming]]></category>
		<category><![CDATA[Splogs]]></category>
		<category><![CDATA[study]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=2123</guid>
		<description><![CDATA[A recent study by content tracking service Attributor has found that, for many publishers, their audience off their site completely dwarfs the pageviews they can count.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/04/attributor-logo.jpg" alt="" title="attributor-logo" width="206" height="77" class="alignleft size-full wp-image-1000" />A recent study by content tracking company Attributor attempted to determine the true audience of a Web publisher by analyzing both the viewership the site&#8217;s content gets on its own site and what it gets on other sites where it is copied onto, usually without a license.</p>
<p>The results were stunning. <a href="http://www.attributor.com/blog/trueaudience/">According their report</a>, on average the sites that they studied had 140% more views of their content on other sites than the original. This meant that well over half of all views of the content took place on sites other than the creator&#8217;s and were unavailable for either monetization or, in many cases, attribution.</p>
<p>Though the results are interesting, they likely are not at all surprising to many who deal with copyright and plagiarism issues on the Web. With human copying, RSS aggregation (both good and bad) and other republication as common as it is, many had already suspected that the audience of a content was much larger off the site than on it. Attributor is simply one of the first to conduct a study that shows it.</p>
<p>However, there are several elements of the study that are interesting beyond the initial findings and may offer clues as to what Webmasters are most at risk of having their content misused.<span id="more-2123"></span></p>
<h4>How it Was Done</h4>
<p><a href="http://www.attributor.com/docs/TrueAudience.pdf">According to the report</a> (PDF), they analyzed 100 publishers from the <a href="http://lists.compete.com/">Compete Top 1000</a>, first discarding their existing customers and sites with partial feeds, and added some publishers into the list on top of that to ensure a good mix of different topics.</p>
<p>They then ran the sites through the content matching service and analyzed all of the copies that had higher than a 50% match and more than 125 words the same. After removing known licensees, they looked at the sites that had information on and used traffic estimates from Compete to get an approximate idea of how large the viewership was on these match sites.</p>
<p>After that, they then broke down the results by broad category to see which kinds of publishers had the largest &#8220;Audience Multiplier&#8221;, meaning viewership of their content on other sites. </p>
<p>The results were that, on average, the publishers tested had nearly 60% of their content viewership on other sites. This leads to missed opportunities both for linkbuilding and for monetization as well as possible causes for removal requests.</p>
<p>This obviously will be of great interest to many Web publishers, who are looking for ways to maximize their audience in the face of an economic slow down, but may not come as a surprise to those that have studied these issues.</p>
<p>However, other findings of the study are potentially even more useful to Webmasters, especially those in high-risk fields.</p>
<h4>High-Risk Topics</h4>
<p>The study, in addition to looking at publishers in general, broke down their results by content category and the results there were staggering.</p>
<p>Of the sites listed, automotive sites fared the worst. For them, they had nearly seven times the audience on other sites than they did on their own. Travel sites also had a high multiplier, over five times the amount and movie reviews had just under five. </p>
<p>In each of the cases above, the sites have audiences on other parts of the Web that easily dwarf their own traffic, meaning they are experiencing the highest level of unlicensed copying and the sites that are copying them have the highest amount traffic levels.</p>
<p>The topics that fared best were politics and health, both of which had barely over one. However, in both cases, the audience is still larger on the rest of the Web, only in these cases it is by a very small margin.</p>
<p>Why there is such a wide divide between the different topics is very difficult to say. Without the full statistics, which were not available in the report, it could be due to a variety of reasons. Though it seems unlikely that one site would be scraped and republished significantly less than another, especially since nearly all of the topics have a strong spammer following, it could be a sign that copying and pasting has had a higher degree of success in certain categories.</p>
<p>For example, with political sites, most people visit the one or two sites that they trust rather than performing blind searches. However, when automotive problems arise, people tend to put queries into Google and trust the search results. Even though both sites likely have many copies of their content, one is able to rely on their brand loyalty to keep much of their audience close by.</p>
<p>However, this is only a guess, but it is clear that it is time to think about the sites in danger slightly differently. Technology and health were two categories with very low multipliers, though they both have a very high tendency to attract spammers.</p>
<h4>Some Caveats</h4>
<p>It is worth noting that the study, while useful, should not be considered scientifically valid. The sampling size is too small and the traffic statistics, though likely about as good as possible, leaves room for error.</p>
<p>It is very difficult to imagine a more thorough study being performed without the backing of a major university, but any study in this area is likely to face similar challenges and limitations. </p>
<p>The other element is that this study focuses on large publishers and not regular bloggers. Whether this means that bloggers would have a much higher audience multiplier due to their smaller initial audience or a smaller one due to less copying and scraping remains to be seen.</p>
<p>Though, likely, the results on the different categories of content and their relative risk may transfer well to smaller publishers, a separate study would likely be needed for smaller bloggers to see how their audience compares.</p>
<p>Still, the purpose of the study is not to necessarily achieve these goals, but to illustrate the possibility of a much larger audience outside of the original site. This is something many have suspected but, to my knowledge, this is the first study to attempt to discuss the issue.</p>
<h4>Conclusions</h4>
<p>The bottom line is simple, most likely the audience for your content is much larger off your site than it is on it. What you do about that is completely up to you. Whether you attempt to monetize it, turn it into promotion or request removal of it (or a combination of all three) is up to the individual author and the course they want to take.</p>
<p>No matter what though, it is clear that this audience and its potential (both for harm and for good) is too big to ignore and it is important to start tracking and understanding what is going on. Whether it is through a professional service such as Attributor, one targeted at individuals such as <a href="http://www.copyscape.com">Copyscape</a> or even simple Google searches, the time to understand these uses is now.</p>
<p>What the Attributor study illustrates, more than anything, is the need for an even deeper understanding of how this copying takes place, what it means for publishers and what strategies could they use to maximize their benefit from it.</p>
<p>This is an area ripe for exploration moving forward and one that will require a great deal of creativity and work.</p>
<p><em>Disclosure: I work as a consultant for Attributor.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/11/19/attributor-analyzes-trueaudience/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Email a DMCA to Google</title>
		<link>http://www.plagiarismtoday.com/2008/07/22/how-to-email-a-dmca-to-google/</link>
		<comments>http://www.plagiarismtoday.com/2008/07/22/how-to-email-a-dmca-to-google/#comments</comments>
		<pubDate>Tue, 22 Jul 2008 16:01:57 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Blogger]]></category>
		<category><![CDATA[Blogspot]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[DMCA-notice]]></category>
		<category><![CDATA[email]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[pdf]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[Splogging]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1369</guid>
		<description><![CDATA[Google has traditionally been one of the toughest hosts to work with, Its requirement of a scanned signature and limited contact information makes them hard to reach, unless you know where to look.]]></description>
			<content:encoded><![CDATA[<p><IMG SRC="http://www.plagiarismtoday.com/images/google-logo-20080722-110104.png" alt="Google Logo" align="left" class="picleft">Google is one of the most frustrating hosts to work with and it is because of steps six and seven in <a href="http://www.google.com/blogger_dmca.html" title="Google DMCA">their process to send them a notice of copyright infringement</a>.</p>
<p>Step six specifically instructs you to &#8220;sign the paper&#8221; and step seven only provides two means of sending it in. The first is via postal mail, the second is via fax.</p>
<p>The problem with these steps is that the vast majority of hosts accept, and even encourage, DMCA notices via email. Email is by far faster and easier to send for such complaints and, according to the <a href="http://www.ftc.gov/os/2001/06/esign7.htm" title="ESIGN Act">ESIGN Act</a>, should be able to deliver a perfectly acceptable electronic signature.</p>
<p>However, Google&#8217;s limited contact information and requirement of a physical signature seems to turn a digital matter into one that needs pen and paper to resolve. This has caused many to give up on sending notices to Google and just tolerate any misuse that takes place on Google&#8217;s servers, including Blogger.</p>
<p>Fortunately, there is a way around this and a means to successfully email your DMCA notices to Google. All one has to do is be a little bit creative. <span id="more-1369"></span><br />
<h4>Step 1: Scan Your Signature</h4>
<p><IMG SRC="http://www.plagiarismtoday.com/images/sig-20080722-104805.png" alt="SIgnature"align="right" class="picright">The first step is to get your handwritten signature into your computer some way. This is because Google, even when accepting notices via email, still requires a handwritten signature most of the time.</p>
<p>There are several ways you can get your signature into an image file. First, you can simply sign a sheet of paper and scan it. Second, you can use a program such as Paint and draw your signature (best done with a stylus). Finally, you can take a picture of your signature with a good digital cam.</p>
<p>The end goal is to get your signature in clear black lettering on a white background.</p>
<h4>Step 2: Get your DMCA Notice Ready</h4>
<p>Next, obtain a copy of a good DMCA notice. If you wish, you can use the <a href="http://www.plagiarismtoday.com/stock-letters/" title="DMCA Notice">stock letters</a> available on this site.</p>
<p>Once you have your notice, paste it into a word Processor that is capable of both importing images and exporting to PDF. You can use Microsoft Word, OpenOffice or many other applications for this purpose.</p>
<p>Then simply fill in the needed information as if you were just emailing it out and make sure that all of your information is accurate. </p>
<h4>Step 3: Import Your Signature</h4>
<p>Once you have the notice in your word processor, scroll down to where the signature goes and import your handwritten signature as an image file into the document. </p>
<p>Take a moment to make sure that it looks right and is ready for export.</p>
<h4>Step 4: Export File to PDF</h4>
<p><img src="http://www.plagiarismtoday.comwp-content/uploads/2008/07/primopdf-logo-20080722-110449.png" alt="Primo PDF Logo" align="left" class="picleft"/>Export the file to a PDF using the best tools available. Though most word processors can do this export directly, if you need to print the file to a PDF you can use a <a href="http://www.primopdf.com/" title="Primo PDF">free PDF creator</a>. </p>
<p>Most of the details about the quality of the PDF are unimportant, just make sure that the PDF is clear enough to be read and small enough to be emailed.</p>
<h4>Step 5: Email the PDF</h4>
<p>Though Google does not advertise their email address on their site, they did file a designation with the U.S. Copyright Office that <a href="http://www.copyright.gov/onlinesp/agents/google.pdf" title="Google PDF">provides an email for submitting DMCA notices</a>. </p>
<p>You can send the PDF to that email address. Simply include it as an attachment and paste the text of the notice into the email.</p>
<p>There is no word if that email address will work after the designated agent returns to the office.</p>
<h4>Conclusions</h4>
<p>For most Web hosts, sending a DMCA notice is simply a matter of having a good stock letter, dropping it into an email, filling out the necessary information and then sending it on. It is only Google, to my knowledge, that requires this kind of effort. </p>
<p>However, Google receives such a large number of DMCA notices that it is important to be prepared to work with them. Not only do they host Blogger, which remains one of the most common hosts for spam blogs, but they host Google Groups and other services that could host infringing content.</p>
<p>They are also one of the top advertising networks with Adsense and their <a href="http://www.google.com/adsense_dmca.html" title="Adsense DMCA">policy for reporting infringers using Adsense</a> closely mirrors the one for Blogger. This technique should also work for that.</p>
<p>Most importantly though, they are also the number one search engine in the world and the <a href="http://www.google.com/dmca.html" title="Google DMCA">search DMCA policy</a> also closely mirrors the other two, complete with physical signature requirement and fax/snail mail only filing. This technique should work with it as well.</p>
<p>Until new systems come online to help us report spam and copyright infringement to Google, this is the best technique to file a DMCA notice via email. </p>
<p>It is a bit complicated and convoluted, but it is certainly preferable to the delays of postal mail and the hassle of a fax.  </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/07/22/how-to-email-a-dmca-to-google/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>Spotting Spam Blogs</title>
		<link>http://www.plagiarismtoday.com/2008/07/15/spotting-spam-blogs/</link>
		<comments>http://www.plagiarismtoday.com/2008/07/15/spotting-spam-blogs/#comments</comments>
		<pubDate>Tue, 15 Jul 2008 15:57:02 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Personal Experiences]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Scraping]]></category>
		<category><![CDATA[search spam]]></category>
		<category><![CDATA[Spam]]></category>
		<category><![CDATA[Spam-Blogs]]></category>
		<category><![CDATA[spinning]]></category>
		<category><![CDATA[Splogs]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1302</guid>
		<description><![CDATA[Spammers are making it harder and harder to separate their creations from those of amateur bloggers. However, by understanding various ways to spot spam blogs and how spammers try to beat those methods, you can better detect junk sites yourself. ]]></description>
			<content:encoded><![CDATA[<p><img class="picleft size-full wp-image-1305" title="splogspot_logo" align="left" src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/splogspot_logo.gif" alt="SplogSpot.com Logo" width="258" height="133" /></p>
<p>When people find out that their content is being copied without permission, how they seek to handle it is often determined, in part, by whether or not the site is a spam blog.</p>
<p>Where many might be willing to forgive copying by a novice blogger, especially with the promise of a link back, most are not prepared to have their content used so a spammer can trick the search engines and sell questionable items.</p>
<p>This means that, very often, I am forced to make snap judgments about whether a site is a spam blog or not, something that is becoming increasingly difficult as spammers have improved their techniques.</p>
<p>So how does one tell if a blog is a spam blog? The answer is not as simple as it once was but there are still ways one can detect a spammy site.</p>
<p><span id="more-1302"></span></p>
<h4>The Spammer Dilemma</h4>
<p>Spammers, over the years, have gotten better and better at making their blogs look human-edited. Though they still can not make their sites appear to be &#8220;good&#8221; blogs, they, in many cases, can pass off as the efforts of novice bloggers or of non-native English speakers.</p>
<p>This can create quite a problem when approaching a suspected spam blog. Is it a spammer using the default Blogspot template or is it someone new to blogging that doesn&#8217;t know how to change the template? Is the strange word choice the result of <a title="Spinning Spammers" href="http://www.plagiarismtoday.com/2007/11/08/modified-scraping-on-the-rise/">automated spinning</a> or someone learning English? If the spam blog did its job, it can be difficult to say.</p>
<p>However, most would agree that being heavy-handed with humans who copy, especially those who make some attempt to provide attribution, is counter-productive. Especially when you consider that the person struggling with English may either grow into an important blogger or, worse yet, already be a major figure in their part of the world, it becomes clear why telling humans from machines is important.</p>
<p>But how to do it? There are several different ways, but unfortunately none of them seem to work 100% of the time.So it is important to take all of the methods below into account, look at how spammers beat them, and develop an informed opinion.</p>
<h4>PageRank Check</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/pagerank.png" alt="" title="pagerank" width="118" height="97" align="right" class="picright alignright size-full wp-image-1310" />One of my sneakier tricks was to check the site&#8217;s PageRank and see if Google had given it either a n/a or a 0. Either would indicate that the site was either very new or had been deemed spam by Google. Either way, it certainly warranted suspicion.</p>
<p><strong>How Spammers Beat It:</strong> Tricking Google. This method has become less effective as Google seems to be assigning PageRank to more and more obvious spam blogs. That is a subject for another article.</p>
<p><strong>Turning the Tide:</strong> PageRank is still a decent indicator of spamminess, but it is no longer as reliable as it was. It is best to ignore PageRank if you have other reasons to be suspicious of a blog.</p>
<h4>&#8220;About&#8221; Page</h4>
<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/aboutpage.png" alt="" title="aboutpage" width="144" height="163" align="left" class="picleft alignleft size-full wp-image-1312" />Since spammers that use WordPress installs typically spend as little time as possible setting up their blogs, they routinely leave the &#8220;About&#8221; page, which is created as part of the install, with its default text. Very few human-generated sites have this problem.</p>
<p><strong>How Spammers Beat It:</strong> Spammers have started either deleting or filling in the about page. However, those that fill in the page often use it as an opportunity to keyword stuff, often further tipping their hand as a spam blog.</p>
<p><strong>Turning the Tide:</strong> If an about page does not have actual information about the site or the owner, it is very likely spam. Some spammers are starting to include fake information, but few seem to be able to resist the opportunity to keyword stuff and link.</p>
<h4>Posting Rate</h4>
<p>The goal of a spam blog is to get as much junk content into it as possible, as such, spammers routinely have extremely high posting frequency, often well over 100 posts per day. It would be physically impossible for a human to post so much content without the aid of a machine, creating a dead giveaway that the site is spam.</p>
<p><strong>How Spammers Beat It:</strong> Some spammers have begun to show restraint, only having their blogs update a few times per day and at irregular intervals, to more closely mimic a human blogger.</p>
<p><strong>Turning the Tide:</strong> The content is more telling than the frequency, unless the posting frequency is outrageous. Consider an extremely high posting volume to be a dead spam giveaway but don&#8217;t write off a site because it has a reasonable rate.</p>
<h4>Formulaic Posting</h4>
<p>We&#8217;ve all seen the spam blogs that start out with something like &#8220;I saw an interesting post today about&#8230;&#8221; and then proceeds to inject a few keywords and quote from the scraped article. By themselves, these posts may appear semi-legitimate, especially with trackbacks, but are clearly spam when you look at them in group.</p>
<p><strong>How Spammers Beat It:</strong> Spammers have started to use multiple post templates in the same blog. However, the limited set means that, if this method is chosen, it is still easily detected over the course of about ten posts.</p>
<p><strong>Turning the Tide:</strong> Check and see if the posts have the same pattern, are roughly the same length or all contain quoted material. These are all signs of a spam blog.</p>
<h4>Ugly Templates</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/uglylayout.png" alt="" title="uglylayout" width="162" height="123" align="right" class="picright alignright size-full wp-image-1316" />Sometimes the first sign a blog is spam is the template that it is in. If the template is the default WordPress theme or a stock BlogSpot theme without modifications, it&#8217;s a likely tip off of spam content.</p>
<p><strong>How Spammers Beat It:</strong> Spammers have been getting better about mixing up their themes. Most spam software applications come with a variety of themes that are rotated and, given the ease with which most blogs can be skinned, spam blogs can be amazingly varied.</p>
<p><strong>Turning the Tide:</strong> Fortunately, spammer themes still don&#8217;t have any elements of hand-crafting. There are very rarely custom images (or contain only very crude ones), the CSS often looks off, the color scheme is often jarring and the elements many times do not fit together correctly. If you see a glaring mistake that would be caught by anyone looking at the site, it is likely spam.</p>
<h4>Domain Names</h4>
<p>Spam blogs are typically restricted to three types of domains, 1) .us, .info and other strange extensions 2) domains stuffed with keywords (and often hyphens) 3) Free blog hosts (primarily Blogspot still).</p>
<p><strong>How Spammers Beat It:</strong> Spammers are participating in the domain aftermarket, snatching up expired domains that have had sites on them previously. This helps them carry both the PageRank of the old site, in some cases, and obtain a more &#8220;honest&#8221; name. Spammers are also spreading to other free blog services, including little-known ones, as well as social networking sites.</p>
<p><strong>Turning the Tide:</strong> If you are unsure about a domain, use <a title="Domain Tools" href="http://www.domaintools.com">Domain Tools</a> to investigate it. Look specifically for false whois information or other irregularities. Still, most spam blogs are hosted on spam domains. Better ones are too expensive for spammers to buy in bulk and are more profitable at auction than as spam tools.</p>
<h4>Ad Excess/Spam Blogroll</h4>
<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2008/07/longblogroll.png" alt="" title="longblogroll" width="194" height="189" align="left" class="picleft alignleft size-full wp-image-1314" />Many spam blogs earn their money by framing the content in a slew of ads, generally from one of the public advertising networks. If not, then they often times use the blogroll to put out obviously spammy links in hopes of building PageRank and search engine position for those domains.</p>
<p><strong>How Spammers Beat It:</strong> The formula is simple, fewer ads, fewer links, more spam blogs. Spammers have begun to show restraint with both their ads and their outbound links but are creating larger and larger spam farms to compensate. Spammers are also turning to alternate sources of revenue, such as Amazon afiiliate IDs, to better hide their activities. Others will mix &#8220;good&#8221; links with &#8220;spam&#8221; ones in their blogroll to further hide the nature of the site.</p>
<p><strong>Turning the Tide:</strong> One spam link is too many. Hover over the URLs in the Blogroll and check for any that are suspicious or out of place. When checking for ads, look not so much as quantity, but for the appearance that they were simply &#8220;stuck in&#8221;. Spammers don&#8217;t have time to integrate ads with their site usually.</p>
<h4>Conclusions</h4>
<p>When looking through these elements, any one of these would make me suspicious of a site&#8217;s origin, save perhaps if the site were hosted on a free blog host. Two, in turn, would make it a likely spam blog and three or above would make it a virtual lock.</p>
<p>The bottom line is that, while spammers are not making it any easier to spot their handiwork, it can still be detected by a careful eye (or a not-so-careful eye in many cases).</p>
<p>Though the spammer&#8217;s survival depends on staying under the radar and fooling humans and search engines alike, the nature of creating tens of thousands of junk blogs means that sacrifices have to be made and the results will have limitations.</p>
<p>By exploiting those weaknesses, we can continue to detect and stop spam and separate the spammers from those who are just getting started.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/07/15/spotting-spam-blogs/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 07:10:02 -->
