<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism Todaydigital fingerprints | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/digital-fingerprints/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 17:55:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>5 Fast Google Hacks for Finding Plagiarism</title>
		<link>http://www.plagiarismtoday.com/2011/01/31/5-fast-google-hacks-for-finding-plagiarism/</link>
		<comments>http://www.plagiarismtoday.com/2011/01/31/5-fast-google-hacks-for-finding-plagiarism/#comments</comments>
		<pubDate>Mon, 31 Jan 2011 18:49:15 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[digital fingerprints]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[hack]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[translation]]></category>
		<category><![CDATA[tweak]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=8843</guid>
		<description><![CDATA[Here are five fast, easy and fun ways to hack Google to make it an even better plagiarism detector.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2010/08/gis-logo.jpg" alt="Google Image Search Logo" title="Google Image Search Logo" width="294" height="122" class="alignleft size-full wp-image-7475" />If you are looking for your content online or trying to find plagiarism in content you&#8217;ve been handed, Google can be both your best friend and your worst enemy. Though it&#8217;s frustrating when a spammer or plagiarist outranks you in Google for your own work and Google&#8217;s removal policies have, at least in the past, <a href="http://www.plagiarismtoday.com/2010/08/03/popup-pirates-who-profits-from-piracy/">left a lot to be desired</a>, it&#8217;s also one of the best tools around for discovering copied content online.</p>
<p>However, Google has a lot of &#8220;under the hood&#8221; tweaks and features that make such searches a lot easier but it seems that a lot of people don&#8217;t know about them. </p>
<p>With that in mind, here are five simple and fast Google hacks to help you find your content on the Web, understand how it is being used and, when appropriate, put a stop to its abuse.<span id="more-8843"></span></p>
<h4>1. Set Result Date Range</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2011/01/date-range-sample.png" alt="" title="date-range-sample" width="136" height="248" class="alignright size-full wp-image-8846" /></p>
<p>If you regularly search for your content, such as you have static marketing copy that you routinely look for, you&#8217;ll probably find that, after a while, the search results become less and less valuable as they are cluttered with either older, lawful uses of your work, false positives or older cases you didn&#8217;t/can&#8217;t deal with.</p>
<p>However, Google has a feature that lets you see only the most recent results. Simply by clicking &#8220;Show Search Tools&#8221; on the left-hand side column, you will be able to choose from a variety of date ranges including the past day, week, month, year or enter a custom range.</p>
<p>This makes it easy to only see results from the last time you&#8217;ve searched. As an alternative, if you stay logged in to the same Google account, you can also use the &#8220;Not Yet Visited&#8221; link to see only results you haven&#8217;t been to, making it even easier.</p>
<h4>2. Digital Fingerprints</h4>
<p>Tracking dynamic content, such as blog content, is difficult because it is constantly changing and the shelf life of most of it is so short that searching for all of it is impractical, if not impossible. However, using a digital fingerprint, <a href="http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/">a trick I&#8217;ve talked about before</a>, makes it easy to locate at versions that were scraped from your RSS feed. </p>
<p>Basically, you just edit your RSS feed to include a semi-random string of letters and numbers, something that is unique to you and your site, and routinely search for it. Wherever it appears, you know someone, most likely, scraped your RSS feed and you&#8217;ll want to follow up.</p>
<p>You can even pair this up with <a href="http://google.com/alerts">Google Alerts</a> and be notified via email when your fingerprint appears online.</p>
<h4>3. Google Similar Image Search</h4>
<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2011/01/similar-sample-273x300.png" alt="Google Similar Image Search" title="Google Similar Image Search" width="273" height="300" class="alignleft size-medium wp-image-8847" /></p>
<p>As great as Google is for finding text it is sometimes rather lackluster for finding images. The reason is the text contents of a webpage are easily machine-readable where the contents of an image are not, thus why CAPTCHAS work to keep bots out.</p>
<p>However, Google does offer one neat trick for detecting image plagiarism and copyright infringement. If you visit Google Image Search and hover over an image, you can select the &#8220;Similar&#8221; link and Google will take you to results that look, to Google at least, to be the same.</p>
<p>It works best with images that are broadly used on the Web but even those with a limited amount of reuse may still find it to be a practical alternative to dedicated image search engines, such as <a href="http://tineye.com">Tineye</a>. </p>
<h4>4. Detect Translated Plagiarism (kind of)</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2011/01/translated-pages.png" alt="Google Translated Search Image" title="Google Translated Search Image" width="145" height="241" class="alignright size-full wp-image-8848" /></p>
<p>Google has a new, somewhat experimental, new technique for detecting content that has been translated from one language to another. You can use it by performing a traditional search for a statistically improbable phrase (using quotes) and then, as with the date range search, click &#8220;Show Search Tools&#8221;. Then, at the very bottom of the list, click &#8220;Translated Foreign Pages&#8221;</p>
<p>To be honest, the translation is a bit crude and, since it is automated, will not likely catch many plagiarisms, especially if you choose phrases with difficult-to-translate words. However, the feature does check multiple languages at once, including letting you choose which languages to add, and shows a great deal of promise for the future. </p>
<p>Right now, however, it isn&#8217;t much more than a toy that might get lucky once in a while.</p>
<h4>5. Wildcard Game</h4>
<p>If you are having a difficult time finding a good, unique phrase in the content you are searching for, you may be able to find a workable solution by using Google&#8217;s wildcard function. </p>
<p>Basically, if you use an &#8220;*&#8221; in the middle of a phrase, Google will treat it as a wildcard and plug in any word. So, for example, if you have a phrase that is relatively unique but sometimes people like to substitute a word or two, you can easily wildcard those words out and get all the results.</p>
<p>This is especially useful when searching for academic plagiarism. All one has to do is find a suspect phrase, put quotes around it and asterisk out the word or words that seem out of place and then let Google do the searching.</p>
<p>Remember, however, the wildcard only works for whole words, not for partial words.</p>
<h4>Bottom Line</h4>
<p>Hopefully these quick Google hacks can help you be a better Google Ninja when looking for your content or checking other work for plagiarism. They by no means make Google a one-stop shop for plagiarism fighting, but they do make it a bit more powerful and help give you a edge when you&#8217;re looking for copied content.</p>
<p>In the end, plagiarism detection is still very much a game for humans and Google is just a tool to help with it. It will always take the judgement of human eyes and human minds to make the determination of what is right and wrong, both legally and ethically.</p>
<p>Still it is nice to get a little help along the way.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2011/01/31/5-fast-google-hacks-for-finding-plagiarism/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Image Detection: Watermarking vs. Fingerprinting</title>
		<link>http://www.plagiarismtoday.com/2009/12/02/image-detection-watermarking-vs-fingerprinting/</link>
		<comments>http://www.plagiarismtoday.com/2009/12/02/image-detection-watermarking-vs-fingerprinting/#comments</comments>
		<pubDate>Wed, 02 Dec 2009 21:57:57 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[digimarc]]></category>
		<category><![CDATA[digital fingerprints]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[picscout]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[watermark]]></category>
		<category><![CDATA[watermarking]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=5027</guid>
		<description><![CDATA[There's a boom taking place in the field of image detection. If you're an artist, now is as great time to learn about the technology so you can take advantage of falling prices and new technology.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/digimarc-logo.png" alt="digimarc-logo" title="digimarc-logo" width="269" height="70" class="alignleft size-full wp-image-5030" /></p>
<p>The image detection field is changing rapidly. <a href="http://www.plagiarismtoday.com/2008/12/30/2008-the-year-in-content-theft/">For years I&#8217;ve been predicting the growth in image searching</a> and it appears that is finally coming to be. Not only has <a href="http://www.plagiarismtoday.com/2009/10/08/picscout-announces-new-image-matching-tools/">PicScout has begun public testing on its newest offerings</a> and other announcements are in the works.</p>
<p>As such, over the next few weeks and months, it seems almost certain that I&#8217;ll be posting articles about new technologies and advancements in this field as it continues to heat up and prices begin to drop. This means that many photographers, including smaller ones, will be confronted with the question about how to best protect their images and track their usage.</p>
<p>Three years ago, this was almost unthinkable. Limited options and high prices made such tracking attainable only to large companies but now we&#8217;re preparing to enter a very different age for visual artists when it comes to following their work on the Web.</p>
<p>With that in mind, I wanted to take a moment to <a href="http://www.plagiarismtoday.com/2007/10/09/watermarking-vs-fingerprinting-a-war-in-terminology/">briefly revisit the two schools when it comes to image detection</a>, watermarking and fingerprinting, discuss their pluses and look at which solution may be right for you. </p>
<p>Hopefully, by the end of this, you&#8217;ll be better prepared to decide on your own image protection scheme and know which technology to watch for.<span id="more-5027"></span></p>
<h4>The Two Schools</h4>
<p>The challenge in finding copies of images on the Web has always been, to put it bluntly, that they aren&#8217;t text. Search engines, including image search engines, look for text as it is what is easiest to search for. Computers don&#8217;t understand what is inside an image (the reason CAPTCHAs are reasonably effective) and they have a hard time comparing one image to another. Scale that problem to the billions of images on the Web, and this issue becomes clear.</p>
<p>Solving the problem hasn&#8217;t been easy nor has it been perfect. However, there have been two different approaches that have been tried, both with great success but different ideal uses.</p>
<ol>
<li><strong>Watermarking:</strong> Though most think of image watermarking as the placing of a visual mark over an image, usually a logo or name, Watermarking in this sense involves placing either an invisible or nearly-invisible watermark over an image that can only be detected by a machine. This method is used by <a href="https://www.digimarc.com/">Digimarc</a> and <a href="http://c-registry.us">C-Registry.us</a> (Note: The latter uses both for various purposes).</li>
<li><strong>Fingerprinting:</strong> Fingerprinting is an automated process that takes an existing image file, hashes it and converts it to a unique fingerprint. That string is, in turn, compared to other fingerprints for potential matches. This method is favored by <a href="http://www.picscout.com">PicScout</a> and <a href="http://tineye.com">Tineye</a>.</li>
</ol>
<p>As you can see, the two systems are fundamentally different in terms of how they work and what they can do, these differences have an impact both in terms of how they are used and the situations they are best suited for.</p>
<h4>Watermarks</h4>
<p>The advantages of using a digital watermark are many-fold. First, they provide much more reliable detection. Where a very similar image by a different artist can trigger a fingerprint match, only copies of your actual image will trigger a watermark match. This also greatly helps with providing evidence for a potential court case since it can be shown exactly which image was copied from. </p>
<p>Watermarks can, typically, also survive great modification by users. Often images with watermarks can be detected even after cropping, rotation and distortion that would have caused a fingerprinting system to miss it (though fingerprints are getting much smarter about this as well). Some need only a few pixels to remain intact.</p>
<p>Also, additional information about the creator can be hidden in the watermark or stored in a database. This can help greatly with the worries surrounding the <a href="http://www.plagiarismtoday.com/tag/orphan-works/">orphan works bills</a>, which may allow certain uses of works if the author can not be reasonably identified.</p>
<p>In short, it works like image metadata in that it is invisible and is buried within the image, but it is much harder to remove and travels with the image even after it is modified and resaved.</p>
<p>However, watermarks can&#8217;t do anything to protect an image that&#8217;s already &#8220;in the wild&#8221;. Though you can re-release an image with a watermark, any copies of the non-watermarked image will not be detected. Also, watermarking systems tend to be expensive to use and, since one company usually controls the watermarks and the system for detecting them, they&#8217;re vulnerable to being rendered useless if the company closes.</p>
<h4>Fingerprinting</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/tineye-logo-1-300x72.png" alt="tineye-logo-1" title="tineye-logo-1" width="300" height="72" class="alignright size-medium wp-image-5033" /></p>
<p>The biggest asset of fingerprinting is that it works on any image, including one already on the Web. There&#8217;s no need to watermark the image or do any preparation work at all before publishing it, the technique works from the image itself.</p>
<p>Fingerprinting has become more and more accurate over the years, now competing with watermarks in that area, and is largely cheaper, including a free, effective offering from Tineye. </p>
<p>Also, since there are many fingerprinting systems in place and all work roughly the same way, its likely that you&#8217;ll be able to continue to match your images this way for many, many years to come. One company closing does not destroy your copyright protection system.</p>
<p>That being said, Fingerprinting does not, by itself, address the orphan works issue nor provide any proof of ownership, which is why PicScout recently released the Image IRC, nor is it always 100% reliable. Similar images, such as photos of similar subjects, have been known to trip fingerprinting matches, though the problem is nowhere near as severe was it was.</p>
<h4>So Which to Use?</h4>
<p>When deciding which to use, the easiest answer is to just say &#8220;both&#8221;. Since you can fingerprint a watermarked image, there&#8217;s nothing, save perhaps cost, that will prevent you from using both systems. Many already view watermarks as more of a copyright verification tool than an image detection one, making the two a natural combination. </p>
<p>However, duplicating efforts doesn&#8217;t make much sense. Due to refinements, both systems can do an extremely good job at finding matches, what is more important than the method is the individual system and how well it works. If it can find good matches and work as advertised, it&#8217;s probably a keeper, regardless of the system.</p>
<p>That being said though, you can&#8217;t even consider using a watermarking system if your images area already on the Web unprotected. It is only used, ideally, for new images as much of the usefulness is lost with unprotected copies being passed around. However, the added protections provided by watermarks make them worthwhile in many cases, especially when there is tight control over distribution and the images haven&#8217;t been leaked.</p>
<p>In short, if you have images that you are waiting to put on the Web or routinely put up new ones (and can justify the expense both in money and time), watermarking is likely a good solution due to its added benefits. For those who have their works online already or don&#8217;t wish to invest the time/money required, fingerprint matching, via Tineye, is free.</p>
<p>However, even that distinction is being blurred by PicScout&#8217;s Image IRC, which is a database of fingerprinted images, which provides something of a &#8220;best of both worlds&#8221; solution. The problem though is that it, right now, is targeted solely at large stock agencies. We&#8217;ll have to see later how approachable it is by smaller artists once it is finally released.</p>
<h4>Bottom Line</h4>
<p>In the end, any robust image protection system will likely use at least some of both. If your images are valuable, trusting any one solution by itself to handle detection and orphan works prevention is risky. Multiple tools are, most likely, the way to go when practical.</p>
<p>But the most important thing, as mentioned above, is not that the system works, not the technology behind it. If it works well, it doesn&#8217;t matter if it uses fingerprints or watermarks. That is where the real challenge lies for these companies. </p>
<p>So, with that in mind, I&#8217;m looking forward to, over the next few months, talking more about developments in this field and how they might impact visual artists of all kinds and all backgrounds.<img style="border: 0pt none; width: 0pt; height: 0pt; display: none;" src="http://tokentracker.com/token.gif?id=3aSc44adb" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/12/02/image-detection-watermarking-vs-fingerprinting/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Google Alerts to add RSS</title>
		<link>http://www.plagiarismtoday.com/2008/10/10/google-alerts-to-add-rss/</link>
		<comments>http://www.plagiarismtoday.com/2008/10/10/google-alerts-to-add-rss/#comments</comments>
		<pubDate>Fri, 10 Oct 2008 15:21:40 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[content detection]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[digital fingerprints]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Alerts]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Search-Engines]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1935</guid>
		<description><![CDATA[A recent article in the Wall Street Journal has given reason for many Google Alerts users to rejoice, the famous email alert service will soon be getting RSS support. ]]></description>
			<content:encoded><![CDATA[<p><IMG SRC="http://www.plagiarismtoday.com/images/google-alerts-20081010-100845.png" alt="Google Alerts Logo" align="left" class="picleft">A recent article in the Wall Street Journal by Walter Mossberg about <a href="http://online.wsj.com/article/SB122281243658792073.html">how to use alerts to keep track of the Web</a> dropped something of a bombshell for those of us who use <a href="http://www.google.com/alerts">Google Alerts</a> every day. According to Mossberg, Google Alerts will begin adding RSS alerts in addition to email ones &#8220;in about a month&#8221;.</p>
<p>Google Alerts, which is a service that sends out notices when content carrying the alert search term appears on the Web, currently only sends out its alerts via email. It is commonly used for vanity searches, for keeping on top of who mentions a person or site, and for keeping track of content, either through searches for <a href="http://www.plagiarismtoday.com/2005/11/07/tips-for-using-google-alerts/">statistically improbable phrases</a> or <a href="http://www.plagiarismtoday.com/2006/10/04/digital-fingerprints-to-detect-rss-scraping/">digital fingerprints</a>. </p>
<p>What this means to you will probably depend on how heavily you use RSS and how much use you make of Google Alerts. If you are not currently using Google Alerts and want to get started, I&#8217;ve <a href="http://www.plagiarismtoday.com/2008/01/24/video-how-to-use-google-alerts/">created a screencast to help you understand the basics</a>.</p>
<p>Obviously, I&#8217;ll have more to say on this once the new feature is made public. </p>
<p>However, at this time, I don&#8217;t see myself making heavy use of the RSS feature. I literally have years of experience meshing Google Alerts with email filters and creating a workflow around it. Though such a system could be moved to RSS easily, I don&#8217;t see how much is gained in my case.</p>
<p>Clearly though, this feature is not for people like myself and other current heavy users of Google Alerts, instead, it is for those who don&#8217;t because they can&#8217;t get the alerts in the format they want. This will change that and let them receive their alerts in a variety of places including their RSS reader, their Google home page and through a variety of mashup services.</p>
<p>Needless to say, this opens up a lot of new doors for Google Alerts but, personally, I&#8217;m just happy to hear that the service is still receiving some attention. After being so long without a significant upgrade, it is nice to see that Google is still working on their Google Alerts product. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/10/10/google-alerts-to-add-rss/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 18:10:22 -->
