<?xml version="1.0" encoding="UTF-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
> <channel><title>Comments on: Digital Fingerprints to Detect RSS Scraping</title> <atom:link href="http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/feed/" rel="self" type="application/rss+xml" /><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/</link> <description>Content Theft, Plagiarism, Copyright Infringement</description> <lastBuildDate>Thu, 18 Mar 2010 15:21:04 +0000</lastBuildDate> <generator>http://wordpress.org/?v=abc</generator> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <item><title>By: My Favorite Plagiarism-Fighting Tools &#124; PlagiarismToday</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-125903</link> <dc:creator>My Favorite Plagiarism-Fighting Tools &#124; PlagiarismToday</dc:creator> <pubDate>Wed, 08 Jul 2009 19:32:13 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-125903</guid> <description>[...] Runner Up: Google Alerts + Digital Fingerprints [...]</description> <content:encoded><![CDATA[<p>[...] Runner Up: Google Alerts + Digital Fingerprints [...]</p> ]]></content:encoded> </item> <item><title>By: PlagiarismToday &#187; Legal and Ethical Link Blogging</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-64042</link> <dc:creator>PlagiarismToday &#187; Legal and Ethical Link Blogging</dc:creator> <pubDate>Wed, 12 Sep 2007 22:05:26 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-64042</guid> <description>[...] Digital Fingerprints: We&#8217;ve discussed many times using digital fingerprints to track RSS usage. This is another reason to do that. If you are comfortable with the reuse of your feed, this will [...]</description> <content:encoded><![CDATA[<p>[...] Digital Fingerprints: We&#8217;ve discussed many times using digital fingerprints to track RSS usage. This is another reason to do that. If you are comfortable with the reuse of your feed, this will [...]</p> ]]></content:encoded> </item> <item><title>By: PlagiarismToday &#187; Transcraping: Multi-Lingual Content Theft</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-63414</link> <dc:creator>PlagiarismToday &#187; Transcraping: Multi-Lingual Content Theft</dc:creator> <pubDate>Wed, 29 Aug 2007 20:00:20 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-63414</guid> <description>[...] is a name or a site address, both of which will go unmodfied. But another option includes using digital fingerprints, which should be nonsense to a translation program, and will also remain [...]</description> <content:encoded><![CDATA[<p>[...] is a name or a site address, both of which will go unmodfied. But another option includes using digital fingerprints, which should be nonsense to a translation program, and will also remain [...]</p> ]]></content:encoded> </item> <item><title>By: I&#8217;m a FART! - MacBros&#8217; Place - Everybody&#8217;s Entitled To My Opinion</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-47394</link> <dc:creator>I&#8217;m a FART! - MacBros&#8217; Place - Everybody&#8217;s Entitled To My Opinion</dc:creator> <pubDate>Mon, 12 Mar 2007 22:38:26 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-47394</guid> <description>[...] here here and here it is! Well actually here they are. But one of my favorite pages is this one by Lorelle, [...]</description> <content:encoded><![CDATA[<p>[...] here here and here it is! Well actually here they are. But one of my favorite pages is this one by Lorelle, [...]</p> ]]></content:encoded> </item> <item><title>By: PlagiarismToday &#187; Laziest Spam Blog Ever</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-47335</link> <dc:creator>PlagiarismToday &#187; Laziest Spam Blog Ever</dc:creator> <pubDate>Mon, 12 Mar 2007 15:01:33 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-47335</guid> <description>[...] Alerts remains a useful tool in combating plagiarism, especially when used in conjunction with the Digital Fingerprint Plugin and other statistically improbable [...]</description> <content:encoded><![CDATA[<p>[...] Alerts remains a useful tool in combating plagiarism, especially when used in conjunction with the Digital Fingerprint Plugin and other statistically improbable [...]</p> ]]></content:encoded> </item> <item><title>By: PlagiarismToday &#187; The Six Worst Ways to Protect Content</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-33216</link> <dc:creator>PlagiarismToday &#187; The Six Worst Ways to Protect Content</dc:creator> <pubDate>Fri, 05 Jan 2007 11:41:35 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-33216</guid> <description>[...] If a full feed meets your site&#8217;s goals, whatever they may be, truncating it just to prevent content theft is shooting yourself in the foot. Most will be better off tracking use of their feed and then following through on infringement that way. [...]</description> <content:encoded><![CDATA[<p>[...] If a full feed meets your site&#8217;s goals, whatever they may be, truncating it just to prevent content theft is shooting yourself in the foot. Most will be better off tracking use of their feed and then following through on infringement that way. [...]</p> ]]></content:encoded> </item> <item><title>By: PlagiarismToday &#187; Google Alerts Adds Blog Search</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-22314</link> <dc:creator>PlagiarismToday &#187; Google Alerts Adds Blog Search</dc:creator> <pubDate>Fri, 27 Oct 2006 18:39:14 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-22314</guid> <description>[...] However, with the addition of the blog search, it can be used easily in conjunction with Maxtor&#8217;s Digital Fingerprint Plugin (original article) to easily create an alert to let you know any time your work is automatically scraped and reposted on another site. [...]</description> <content:encoded><![CDATA[<p>[...] However, with the addition of the blog search, it can be used easily in conjunction with Maxtor&#8217;s Digital Fingerprint Plugin (original article) to easily create an alert to let you know any time your work is automatically scraped and reposted on another site. [...]</p> ]]></content:encoded> </item> <item><title>By: PlagiarismToday &#187; Five Essential Wordpress Content Protection Plugins</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-18561</link> <dc:creator>PlagiarismToday &#187; Five Essential Wordpress Content Protection Plugins</dc:creator> <pubDate>Mon, 09 Oct 2006 16:13:05 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-18561</guid> <description>[...] Though still in beta, Maxpower&#8217;s plugin has already proved to be an invaluable tool in detecting content theft. It works by putting a unique phrase or string of characters into each entry of the feed and then performing searches for that term. Any hits are potential scrapers.   Though the plugin has some minor weaknesses, it has come a very long way since beta one and is a much-needed layer of protection for anyone running Wordpress. It is completely Feedburner compatible and can be used in conjunction with other RSS feed plugins. It&#8217;s also one of the most convenient plugins available, operating everything from within the Wordpress administration area [...]</description> <content:encoded><![CDATA[<p>[...] Though still in beta, Maxpower&#8217;s plugin has already proved to be an invaluable tool in detecting content theft. It works by putting a unique phrase or string of characters into each entry of the feed and then performing searches for that term. Any hits are potential scrapers.   Though the plugin has some minor weaknesses, it has come a very long way since beta one and is a much-needed layer of protection for anyone running Wordpress. It is completely Feedburner compatible and can be used in conjunction with other RSS feed plugins. It&#8217;s also one of the most convenient plugins available, operating everything from within the Wordpress administration area [...]</p> ]]></content:encoded> </item> <item><title>By: Fresh links &#171; Stop bitacle.org</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-18026</link> <dc:creator>Fresh links &#171; Stop bitacle.org</dc:creator> <pubDate>Fri, 06 Oct 2006 12:31:51 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-18026</guid> <description>[...] Blogs Are Stupid: Thieving Bastards MotherPie: Bitacle Steals Content&#8230; Miguel Angel Mata: ¡Paremos a Bitacle! //engtech: Bitacle emphasizes the problem with RSS PlagiarismToday: Digital Fingerprints to Detect RSS Scraping PlagiarismToday: Is Adsense Broken? E-Mercadeo.com: Stop Bitacle LiveJournal: Anyone know a good intellectual property lawyer? DiarioIP: Bitacle y los parásitos Kevin Burton&#8217;s Feed Blog: Crush Bitacle Blog Music save my life: Bitacle (10月3日) ALT1040: Stop Bitacle spam+blogs=trouble: what to do about bitacle Movie Addiction: Stop bitacle.org [...]</description> <content:encoded><![CDATA[<p>[...] Blogs Are Stupid: Thieving Bastards MotherPie: Bitacle Steals Content&#8230; Miguel Angel Mata: ¡Paremos a Bitacle! //engtech: Bitacle emphasizes the problem with RSS PlagiarismToday: Digital Fingerprints to Detect RSS Scraping PlagiarismToday: Is Adsense Broken? E-Mercadeo.com: Stop Bitacle LiveJournal: Anyone know a good intellectual property lawyer? DiarioIP: Bitacle y los parásitos Kevin Burton&#8217;s Feed Blog: Crush Bitacle Blog Music save my life: Bitacle (10月3日) ALT1040: Stop Bitacle spam+blogs=trouble: what to do about bitacle Movie Addiction: Stop bitacle.org [...]</p> ]]></content:encoded> </item> <item><title>By: JB</title><link>http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/comment-page-1/#comment-17856</link> <dc:creator>JB</dc:creator> <pubDate>Thu, 05 Oct 2006 14:59:42 +0000</pubDate> <guid
isPermaLink="false">http://www.plagiarismtoday.com/?p=347#comment-17856</guid> <description>MaxPower,I&#039;m glad that you liked the article, after re-reading it I realized that my tone was more harsh than intended. I really do love the plugin and, though it has some weaknesses, I think it&#039;s VERY valuable, especially considering it was just at Beta 1 when I wrote about it.&lt;i&gt;While this plugin is only as strong as search engines’ ability to index splogs, sites that don’t get indexed probably aren’t that important in the big picture. Can it be easy to make money splogging if you aren’t in the major indexes? I think not, but I could be wrong.&lt;/i&gt;Sadly, yes. Other alternatives do exist including comment/trackback spam and spam pinging. Granted, you are correct that a splog would be MUCH less effective without a good search engine presence, I still don&#039;t want my content appearing on links pointed to by comment spammers.You are right that those without a search engine presence are much less important, I&#039;m just not quite ready to completely ignore the ones without. Besides, the best solution is to detect theft before it hits the search engines, especially Google, to avoid potential penalties.&lt;i&gt;As to adding a tracking image, the main splogs I am familiar with remove links and images (A HREF and IMG) leaving only the text. Wouldn’t this render tracking images useless?&lt;/i&gt;Very true. I guess I should have thought that one through better. The ideal solution would probably be tracking the feed page itself. Since most sploggers host their apps on their server, it could easily be distinguished from normal uses.&lt;i&gt;Its not clear to me if FeedBurner does anything at all other than report ‘uncommon usage’. I can’t block IPs using FeedBurner, and I lose a substantial amount of control over the feed&lt;/i&gt;I talked with FB about this shortly after my article on cloaking to stop scraping. I was told that they were going to look into something similar for them. No word yet. It takes a while to develop these kinds of features.As far as the splog goes, email FeedBurner tech support about it. Let them know and they&#039;ll track it down. If you can do that, you&#039;ll improve the metrics for everyone. I, personally, have not encountered this problem (I&#039;ve had a few scrape my summaries via Technorati feeds) so I can&#039;t comment.I don&#039;t think the solution is to quit FeedBurner, but rather, to use it in tandem with your plugin. The more layers, the better.&lt;i&gt;Awareness is always the first step.&lt;/i&gt;Well put! I agree there one hundred percent.Thank you for the wonderful plugin and the update!</description> <content:encoded><![CDATA[<p>MaxPower,</p><p>I&#8217;m glad that you liked the article, after re-reading it I realized that my tone was more harsh than intended. I really do love the plugin and, though it has some weaknesses, I think it&#8217;s VERY valuable, especially considering it was just at Beta 1 when I wrote about it.</p><p><i>While this plugin is only as strong as search engines’ ability to index splogs, sites that don’t get indexed probably aren’t that important in the big picture. Can it be easy to make money splogging if you aren’t in the major indexes? I think not, but I could be wrong.</i></p><p>Sadly, yes. Other alternatives do exist including comment/trackback spam and spam pinging. Granted, you are correct that a splog would be MUCH less effective without a good search engine presence, I still don&#8217;t want my content appearing on links pointed to by comment spammers.</p><p>You are right that those without a search engine presence are much less important, I&#8217;m just not quite ready to completely ignore the ones without. Besides, the best solution is to detect theft before it hits the search engines, especially Google, to avoid potential penalties.</p><p><i>As to adding a tracking image, the main splogs I am familiar with remove links and images (A HREF and IMG) leaving only the text. Wouldn’t this render tracking images useless?</i></p><p>Very true. I guess I should have thought that one through better. The ideal solution would probably be tracking the feed page itself. Since most sploggers host their apps on their server, it could easily be distinguished from normal uses.</p><p><i>Its not clear to me if FeedBurner does anything at all other than report ‘uncommon usage’. I can’t block IPs using FeedBurner, and I lose a substantial amount of control over the feed</i></p><p>I talked with FB about this shortly after my article on cloaking to stop scraping. I was told that they were going to look into something similar for them. No word yet. It takes a while to develop these kinds of features.</p><p>As far as the splog goes, email FeedBurner tech support about it. Let them know and they&#8217;ll track it down. If you can do that, you&#8217;ll improve the metrics for everyone. I, personally, have not encountered this problem (I&#8217;ve had a few scrape my summaries via Technorati feeds) so I can&#8217;t comment.</p><p>I don&#8217;t think the solution is to quit FeedBurner, but rather, to use it in tandem with your plugin. The more layers, the better.</p><p><i>Awareness is always the first step.</i></p><p>Well put! I agree there one hundred percent.</p><p>Thank you for the wonderful plugin and the update!</p> ]]></content:encoded> </item> </channel> </rss>

<!-- W3 Total Cache: Minify debug info:
Engine:             disk
Group:              single
JavaScript info:
   Location     |    Last modified    |         Size | Path
  include-nb    | 2010-03-18 19:49:42 |          422 | http://files.plagiarismtoday.com/scripts/dropdowns.js (/home/plagiari/public_html/wp-content/w3tc/min/minify_eee85f9a0229376e1be9e983e2ebaced.js)
-->

<!-- W3 Total Cache: CDN debug info:
Engine:             cf
-->

<!-- W3 Total Cache: Db cache debug info:
Engine:             disk
Total queries:      19
Cached queries:     2
Total query time:   0.018
SQL info:
    # | Time (s) |    Caching (Reject reason)     |   Status   | Query
    1 |    0.001 |  disabled (query is rejected)  | Not cached | SELECT option_name, option_value FROM wp_options WHERE autoload = 'yes'
    2 |        0 |  disabled (query is rejected)  | Not cached | SELECT option_value FROM wp_options WHERE option_name = 'aiosp_post_title_format' LIMIT 1
    3 |    0.001 |  disabled (query is rejected)  | Not cached | SHOW TABLES LIKE 'wp_feedfooter_rss_map'
    4 |    0.002 |            enabled             |   Cached   | SELECT comment_date_gmt FROM wp_comments WHERE comment_approved = '1' ORDER BY comment_date_gmt DESC LIMIT 1
    5 |    0.001 |            enabled             | Not cached | SELECT   wp_posts.* FROM wp_posts  WHERE 1=1  AND YEAR(wp_posts.post_date)='2006' AND MONTH(wp_posts.post_date)='10' AND DAYOFMONTH(wp_posts.post_date)='4' AND wp_posts.post_name = 'digital-fingerprints-to-detect-rss-scraping' AND wp_posts.post_type = 'post'  ORDER BY wp_posts.post_date DESC
    6 |    0.001 |            enabled             | Not cached | SELECT wp_comments.* FROM wp_comments  WHERE comment_post_ID = '347' AND comment_approved = '1'  ORDER BY comment_date_gmt DESC LIMIT 10
    7 |    0.002 |            enabled             | Not cached | SELECT t.*, tt.*, tr.object_id FROM wp_terms AS t INNER JOIN wp_term_taxonomy AS tt ON tt.term_id = t.term_id INNER JOIN wp_term_relationships AS tr ON tr.term_taxonomy_id = tt.term_taxonomy_id WHERE tt.taxonomy IN ('category', 'post_tag') AND tr.object_id IN (347) ORDER BY t.name ASC
    8 |        0 |            enabled             | Not cached | SELECT post_id, meta_key, meta_value FROM wp_postmeta WHERE post_id IN (347)
    9 |        0 |            enabled             |   Cached   | SELECT post_id, meta_value FROM wp_postmeta WHERE meta_key='_pprredirect_url' and post_id in (SELECT post_id FROM wp_postmeta WHERE meta_key='_pprredirect_active')
   10 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2009-07-08 19:32:13'
   11 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2007-09-12 22:05:26'
   12 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2007-08-29 20:00:20'
   13 |    0.002 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2007-03-12 22:38:26'
   14 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2007-03-12 15:01:33'
   15 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2007-01-05 11:41:35'
   16 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2006-10-27 18:39:14'
   17 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2006-10-09 16:13:05'
   18 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2006-10-06 12:31:51'
   19 |    0.001 |            enabled             | Not cached | SELECT COUNT(comment_ID) FROM wp_comments WHERE comment_post_ID = 347 AND comment_parent = 0 AND comment_approved = '1' AND comment_date_gmt < '2006-10-05 14:59:42'
-->

<!-- W3 Total Cache: Page cache debug info:
Engine:             disk
Key:                w3tc_38f83a8a91137b00579cf1daa7aafb07_page_4526ac859c3be11f6c74ee5308db7f62
Caching:            disabled
Reject reason:      user agent is rejected
Status:             not cached
Creation Time:      0.543s
Header info:
X-Powered-By:       W3 Total Cache/0.8.5.2
X-Pingback:         http://www.plagiarismtoday.com/xmlrpc.php
Last-Modified:      Thu, 18 Mar 2010 15:21:04 GMT
ETag:               "c2f9cd86c24cda3b43d77e39b7d48669"
Content-Type:       text/xml; charset=UTF-8
-->