<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism Todayfingerprinting | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/fingerprinting/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 17:55:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>5 Copyright Wastes of Time</title>
		<link>http://www.plagiarismtoday.com/2011/10/06/5-copyright-wastes-of-time/</link>
		<comments>http://www.plagiarismtoday.com/2011/10/06/5-copyright-wastes-of-time/#comments</comments>
		<pubDate>Thu, 06 Oct 2011 19:00:00 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[Copyright-Office]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[registration]]></category>
		<category><![CDATA[USCO]]></category>
		<category><![CDATA[watermarking]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=11258</guid>
		<description><![CDATA[Though no two artists are in the same position on matters of copyright, here are five things that are a definite waste of time.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://www.plagiarismtoday.com/wp-content/uploads/2011/10/clock-sample-image-300x231.jpg" alt="Image of Clock" title="Clock Image" width="300" height="231" class="alignleft size-medium wp-image-11370" />When it comes to matters of copyright, everyone is looking for answers. With such a confusing climate online, everyone is trying to find how to best maximize the benefit they get from their work, trying new licensing, enforcement and sharing techniques.</p>
<p>However, no two artists are in the same position. A strategy that works for Metallica is not going to work for an upstart blogger. So, nearly every tool out there, no matter how insane it might seem, probably does at least some good for someone.</p>
<p>But then there are the copyright ideas that just don&#8217;t do anything or, in extreme cases, actually do much more harm than good.</p>
<p>These are ideas that need to be shot down and put away so better ideas can take their place. However, even though some of these have been around (and debunked) for decades, they continue to linger.</p>
<p>So what are those ideas? There are many out there but here are the ones I&#8217;m seeing the most and would like to see finally done away with. </p>
<h4>1. Badges and Buttons Everywhere</h4>
<p>It seems that nearly every copyright-related service out there is offering badges, banners and buttons for others to use. Though many of these companies do great work and provide a valuable service, the badges and buttons are a complete, or at least near-complete, waste of time.</p>
<p>The problem is simple: Few, if any, would-be plagiarists are going to be deterred by such buttons (if they even notice them) and with so much of the copying taking place via automated means, especially RSS scraping, your infringer likely never even saw the site.</p>
<p>These buttons are just a means to generate links back to the companies. Nothing more.</p>
<p><strong>Better Approach:</strong> Instead of using badges and buttons, <a href="http://www.plagiarismtoday.com/2009/09/09/5-free-copyright-steps-every-blogger-should-take-today/">make sure your copyright notice is in check</a> and that it has all of the required elements.</p>
<h4>2. No Right-Click/No Select Text Scripts</h4>
<p>There are hundreds of scripts that block right clicks and text selection on a site and they, for the most part, do their job. However, the issue with these scripts isn&#8217;t that they don&#8217;t stop at least some infringement, but that they do much more harm than good. </p>
<p><a href="http://www.plagiarismtoday.com/2009/03/27/tracer-3-weeks-later/">My own experiments with Tynt</a> have shown that the vast majority of copying is for use that&#8217;s clearly non-infringing. Far more people copy content to share legitimately than to infringe.</p>
<p><a href="http://www.plagiarismtoday.com/2010/01/26/why-the-no-right-click-script-must-die/">Couple that with how easily these systems are defeated</a> and there is literally no reason to use them.</p>
<p><strong>Better Approach:</strong> <a href="http://www.tynt.com/">Tynt</a>, if used correctly, can track your copied content and help you understand what&#8217;s being used. Be careful though not to use it to mess with the copy/paste function, only to track it.</p>
<h4>3. Self-Written Licenses</h4>
<p>I covered this in greater detail previously but it bears repeating: <a href="http://www.plagiarismtoday.com/2009/05/28/why-to-not-write-your-own-license/">Never write your own copyright license</a>.</p>
<p>Not only is it legally risky unless you are an attorney, but such licenses rarely get read and almost never change behavior. </p>
<p>Besides, you don&#8217;t need a license to require others to ask permission to use your work. Also, if you want to give permission, <a href="http://creativecommons.org">there are a slew of professionally-written and free licenses</a> that you can apply at your leisure.</p>
<p><strong>Better Approach:</strong> Either stick to &#8220;all rights reserved&#8221; and handle permission requests one at a time or, if you want, choose any one of the other, professionally-written licenses out there.</p>
<h4>4. Fake &#8220;Copyright Registration&#8221; Services</h4>
<p>Though there are some great non-repudation services out there such as <a href="http://myows.com">Myows</a> and <a href="http://safecreative.org">Safe Creative</a>, there are a slew of other services with insane pricing schemes and scare-tactic marketing strategies.</p>
<p>Some of these services will cost up to $20 or more to register one file. Worst of all, these services don&#8217;t even register the file with the U.S. Copyright Office (USCO), which is a requirement in the U.S. in order to have full legal protection. </p>
<p>Instead, these services just accept a copy of your file, print you a certificate as to when they got it and take your money.</p>
<p><strong>Better Approach:</strong> Non-repudiation services can be helpful but Safe Creative and Myows are both free and provide a slew of additional features. Even advanced certification services, such as <a href="http://easytimestamping.com">EastTimestamping</a> charge well less than a dollar per registration. Never pay more.</p>
<h4>5. EXIF Data</h4>
<p>Theoretically, this <em>should</em> be a great idea. EXIF, or EXchangeable Image File format, lets you embed text-based information in a jpg file. This includes information about the camera, the date it was taken and even the GPS coordinates. It can also include copyright holder and author information.</p>
<p>The problem isn&#8217;t the format itself, which works very well, <a href="http://www.plagiarismtoday.com/2008/11/05/facebook-flickr-strip-copyright-data-from-images/">but that the data is stripped almost everywhere it&#8217;s uploaded</a>. In short, nearly every time that you or a plagiarist posts the photo online, including Facebook and Flickr, that information is ripped out as part of the compression process. </p>
<p>Basically, most sites remove the information in a bid to make file size smaller and reduce bandwidth. An unfortunate decision, to say the least.</p>
<p><strong>Better Approach:</strong> <a href="http://www.plagiarismtoday.com/2009/12/02/image-detection-watermarking-vs-fingerprinting/">Image watermarking</a>, either visisble or <a href="http://www.plagiarismtoday.com/2010/08/02/signmyimage-cheap-invisible-watermarking/">invisible</a>, is your best approach to protect and track your content. EXIF data just doesn&#8217;t get passed along reliably enough to be useful.</p>
<h4>Bottom Line</h4>
<p>Though I would love to live in a world where I could give the same, simple copyright advice to everyone, that&#8217;s not the reality we face. However, there are a few things that I can definitely tell creators to avoid using or, at the very least, avoid relying on.</p>
<p>Whether it&#8217;s snake oil salesmen trying to separate you from your money or people with good intentions dealing with unfair realities, some ideas just don&#8217;t work.</p>
<p>It&#8217;s best to move away from those as quickly as possible so we can find the ideas that do work and focus our energies or productive ways to protect ones work.</p>
<p><em><strong>Disclosure:</strong> I have done paid consulting for both Safe Creative and Myows. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2011/10/06/5-copyright-wastes-of-time/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ImageExchange: An Early Review</title>
		<link>http://www.plagiarismtoday.com/2009/12/17/imageexchange-an-early-review/</link>
		<comments>http://www.plagiarismtoday.com/2009/12/17/imageexchange-an-early-review/#comments</comments>
		<pubDate>Thu, 17 Dec 2009 21:32:05 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[Firefox]]></category>
		<category><![CDATA[firefox extension]]></category>
		<category><![CDATA[picscout]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[stock photography]]></category>
		<category><![CDATA[stock photos]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=5168</guid>
		<description><![CDATA[PicScout's new ImageExchange Firefox extension aims to make every search and every Web page visit a chance to discover stock images, but how well does it work?]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/imageexchange-sample-3.jpg" alt="imageexchange-sample-3" title="imageexchange-sample-3" width="166" height="197" class="alignleft size-full wp-image-5175" /></p>
<p><a href="http://www.plagiarismtoday.com/2009/10/08/picscout-announces-new-image-matching-tools/">As previously covered</a>, image matching company PicScout recently launched its Image IRC, a database of fingerprinted images with licensing data that is publicly accessible, and ImageExchange, a user-facing program that accesses the Image IRC and finds matched images on the Web.</p>
<p>Currently, <a href="http://www.picscout.com/solutions/imageexchange.html">ImageExchange is only available as a Firefox extension</a> (though other extensions are planned) and is in the very early stages of public testing. Aimed mostly at buyers of stock images, such as designers, ImageExchange and the Image IRC currently have access to millions of images, including from many of the largest microstock and stock photo agencies.</p>
<p>With all of that in mind, I wanted to take a look and see how well ImageExchange performed both to see what it was doing well and to see what might need improvement in later versions.</p>
<p>Here&#8217;s a look at my experience with the extension and what my early impressions are.<span id="more-5168"></span></p>
<h4>The Experiment</h4>
<p>For my limited test I decided to simulate what a potential searcher for stock photos might do, I visited Google Image Search and experimented with different search terms to see how many images were found and to make notes about the usefulness of the extension.</p>
<p>All in all, I conducted 35 searches. The first 25 were for a variety of semi-random terms a stock photo searcher might look for, ranging from vague to very specific. Five searches were for terms in the first 25 but with the words &#8220;stock photo&#8221; added. The final five were for the same terms but with the words &#8220;clip art&#8221; added.</p>
<p>From there, I checked the first three pages of Google and counted how many matches were found. The results are below (click to enlarge):</p>
<p><a href="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/imageexchange-table.png"><img src="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/imageexchange-table-500x531.png" alt="imageexchange-table" title="imageexchange-table" width="500" height="531" class="alignnone size-large wp-image-5171" /></a></p>
<h4>Matching</h4>
<p>As you can see in the chart, the matching isn&#8217;t exactly going to blow anyone away. Many of the searches produced no results with Google Image Search, in my test, showing 18 results per page, the highest any one search found on one page was 6, or 1/3. None generated more than 17 matches out of the 54 total images.</p>
<p>That being said, the results were interesting. The more specific the search, the better ImageExchange did. Searches with &#8220;stock photo&#8221; added also did better most of the time.</p>
<p>However, to be clear, when it was struggling to find matches it isn&#8217;t because ImageExchange wasn&#8217;t matching images correctly, but because the top pages within Google were filled with user-generated content and other images that aren&#8217;t indexed at this time. That will likely change as the database grows.</p>
<p>On that note, I fully expect these numbers to go up in the coming weeks in months as PicScout adds both more stock image providers, new sources of user-generated content and the existing sources continue to add images.</p>
<p>In short, the matching isn&#8217;t great at this time but it shows a lot of promise and, considering that it is a free service that runs in the background, it is still a worthwhile install.</p>
<h4>Usability</h4>
<p><a href="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/imageexchange-settings.jpg"><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/imageexchange-settings-300x199.jpg" alt="imageexchange-settings" title="imageexchange-settings" width="300" height="199" class="alignright size-medium wp-image-5169" /></a></p>
<p>ImageExchange is pretty much a &#8220;set and forget&#8221; extension. The installation is nothing special if you&#8217;re familar with Firefox extensions and, as you can see in the image to the right, the settings are pretty basic, the most important being the ability to manually search for just the images you ask.</p>
<p>My biggest gripe on the usability front is that the &#8220;i&#8221; that indicates whether an image is a match or not is a bit difficult to see, though it would be easier in regular Web surfing, it got lost in Google Image Search&#8217;s already cluttered results pages. I had to be careful to go through image by image when counting to make sure I didn&#8217;t miss any.</p>
<p>However, once clicked, the &#8220;i&#8221; did a good job directing me to the correct licensing page. In a sample of the found images, I never had an issue finding the correct licensor and understanding how to obtain the image.</p>
<p>All things considered, it&#8217;s a simple extension that seems to work well and not require any real intervention from the user. It&#8217;s a &#8220;set and forget&#8221; application that does its job well.</p>
<h4>Speed</h4>
<p>I wouldn&#8217;t call ImageExchange lightning fast but I wouldn&#8217;t call it slow either. On my computer, it took about 4-5 seconds after the page loaded to complete its matching.</p>
<p>The timeframe isn&#8217;t outrageous but there is no clear indication as to when ImageExchange is done working other than the icon in the toolbar. Since many of the &#8220;i&#8221;s loaded before ImageExchange had finished checking all of the images, it was very easy to try and click to the next page too soon sometimes.</p>
<p>The time wasn&#8217;t nearly as long during regular Web surfing. It is only during these images searches that the delay was as noticeable as it was.</p>
<h4>Potential Uses</h4>
<p>Considering that ImageExchange is both zero-cost and zero-thought, there&#8217;s really no reason not to install it. Yes, it has flaws but they are all ones that can either be addressed or are being addressed now. It is important to note though that, right now now, it is not aimed at laypeople, but at buyers of stock images, a crowd that will be well-served by this service.</p>
<p>My hope is that the service is able to grow in both scope and usability to reach out to a broader market, something PicScout has said they are working on. It would be nice to see average bloggers and Web designers using this tool to find licensable images on the Web as they surf. </p>
<p>It certainly is an improvement over those who <a href="http://www.plagiarismtoday.com/2009/12/16/the-google-image-search-clipart-ad/">use Google Images as a clip art library</a>.</p>
<h4>Bottom Line</h4>
<p>The $10,000 question is whether or not you should install this extension. To that, the answer depends on what you intend to do with it. If you&#8217;re a buyer of stock images or routinely do searches for reusable photos, the answer is probably &#8220;Yes&#8221;. It&#8217;s free, easy and, even though it isn&#8217;t  as effective as it could be, it works well enough to probably help you find what you need.</p>
<p>On the other hand, if you&#8217;re a blogger or someone that only looks for images more casually, especially if you&#8217;re looking for ones for free use, it&#8217;s not the right tool yet. That being said, do keep an eye on it as improvements could easily change that.</p>
<p>All in all, ImageExchange is really a no-cost service to use and, if you&#8217;re the type of user who might gain any benefit from it, I would consider signing up to use it. Though there are imperfections, they don&#8217;t outweigh the benefits and they especially do not outweigh the potential.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/12/17/imageexchange-an-early-review/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Image Detection: Watermarking vs. Fingerprinting</title>
		<link>http://www.plagiarismtoday.com/2009/12/02/image-detection-watermarking-vs-fingerprinting/</link>
		<comments>http://www.plagiarismtoday.com/2009/12/02/image-detection-watermarking-vs-fingerprinting/#comments</comments>
		<pubDate>Wed, 02 Dec 2009 21:57:57 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[digimarc]]></category>
		<category><![CDATA[digital fingerprints]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[picscout]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[watermark]]></category>
		<category><![CDATA[watermarking]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=5027</guid>
		<description><![CDATA[There's a boom taking place in the field of image detection. If you're an artist, now is as great time to learn about the technology so you can take advantage of falling prices and new technology.]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/digimarc-logo.png" alt="digimarc-logo" title="digimarc-logo" width="269" height="70" class="alignleft size-full wp-image-5030" /></p>
<p>The image detection field is changing rapidly. <a href="http://www.plagiarismtoday.com/2008/12/30/2008-the-year-in-content-theft/">For years I&#8217;ve been predicting the growth in image searching</a> and it appears that is finally coming to be. Not only has <a href="http://www.plagiarismtoday.com/2009/10/08/picscout-announces-new-image-matching-tools/">PicScout has begun public testing on its newest offerings</a> and other announcements are in the works.</p>
<p>As such, over the next few weeks and months, it seems almost certain that I&#8217;ll be posting articles about new technologies and advancements in this field as it continues to heat up and prices begin to drop. This means that many photographers, including smaller ones, will be confronted with the question about how to best protect their images and track their usage.</p>
<p>Three years ago, this was almost unthinkable. Limited options and high prices made such tracking attainable only to large companies but now we&#8217;re preparing to enter a very different age for visual artists when it comes to following their work on the Web.</p>
<p>With that in mind, I wanted to take a moment to <a href="http://www.plagiarismtoday.com/2007/10/09/watermarking-vs-fingerprinting-a-war-in-terminology/">briefly revisit the two schools when it comes to image detection</a>, watermarking and fingerprinting, discuss their pluses and look at which solution may be right for you. </p>
<p>Hopefully, by the end of this, you&#8217;ll be better prepared to decide on your own image protection scheme and know which technology to watch for.<span id="more-5027"></span></p>
<h4>The Two Schools</h4>
<p>The challenge in finding copies of images on the Web has always been, to put it bluntly, that they aren&#8217;t text. Search engines, including image search engines, look for text as it is what is easiest to search for. Computers don&#8217;t understand what is inside an image (the reason CAPTCHAs are reasonably effective) and they have a hard time comparing one image to another. Scale that problem to the billions of images on the Web, and this issue becomes clear.</p>
<p>Solving the problem hasn&#8217;t been easy nor has it been perfect. However, there have been two different approaches that have been tried, both with great success but different ideal uses.</p>
<ol>
<li><strong>Watermarking:</strong> Though most think of image watermarking as the placing of a visual mark over an image, usually a logo or name, Watermarking in this sense involves placing either an invisible or nearly-invisible watermark over an image that can only be detected by a machine. This method is used by <a href="https://www.digimarc.com/">Digimarc</a> and <a href="http://c-registry.us">C-Registry.us</a> (Note: The latter uses both for various purposes).</li>
<li><strong>Fingerprinting:</strong> Fingerprinting is an automated process that takes an existing image file, hashes it and converts it to a unique fingerprint. That string is, in turn, compared to other fingerprints for potential matches. This method is favored by <a href="http://www.picscout.com">PicScout</a> and <a href="http://tineye.com">Tineye</a>.</li>
</ol>
<p>As you can see, the two systems are fundamentally different in terms of how they work and what they can do, these differences have an impact both in terms of how they are used and the situations they are best suited for.</p>
<h4>Watermarks</h4>
<p>The advantages of using a digital watermark are many-fold. First, they provide much more reliable detection. Where a very similar image by a different artist can trigger a fingerprint match, only copies of your actual image will trigger a watermark match. This also greatly helps with providing evidence for a potential court case since it can be shown exactly which image was copied from. </p>
<p>Watermarks can, typically, also survive great modification by users. Often images with watermarks can be detected even after cropping, rotation and distortion that would have caused a fingerprinting system to miss it (though fingerprints are getting much smarter about this as well). Some need only a few pixels to remain intact.</p>
<p>Also, additional information about the creator can be hidden in the watermark or stored in a database. This can help greatly with the worries surrounding the <a href="http://www.plagiarismtoday.com/tag/orphan-works/">orphan works bills</a>, which may allow certain uses of works if the author can not be reasonably identified.</p>
<p>In short, it works like image metadata in that it is invisible and is buried within the image, but it is much harder to remove and travels with the image even after it is modified and resaved.</p>
<p>However, watermarks can&#8217;t do anything to protect an image that&#8217;s already &#8220;in the wild&#8221;. Though you can re-release an image with a watermark, any copies of the non-watermarked image will not be detected. Also, watermarking systems tend to be expensive to use and, since one company usually controls the watermarks and the system for detecting them, they&#8217;re vulnerable to being rendered useless if the company closes.</p>
<h4>Fingerprinting</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/12/tineye-logo-1-300x72.png" alt="tineye-logo-1" title="tineye-logo-1" width="300" height="72" class="alignright size-medium wp-image-5033" /></p>
<p>The biggest asset of fingerprinting is that it works on any image, including one already on the Web. There&#8217;s no need to watermark the image or do any preparation work at all before publishing it, the technique works from the image itself.</p>
<p>Fingerprinting has become more and more accurate over the years, now competing with watermarks in that area, and is largely cheaper, including a free, effective offering from Tineye. </p>
<p>Also, since there are many fingerprinting systems in place and all work roughly the same way, its likely that you&#8217;ll be able to continue to match your images this way for many, many years to come. One company closing does not destroy your copyright protection system.</p>
<p>That being said, Fingerprinting does not, by itself, address the orphan works issue nor provide any proof of ownership, which is why PicScout recently released the Image IRC, nor is it always 100% reliable. Similar images, such as photos of similar subjects, have been known to trip fingerprinting matches, though the problem is nowhere near as severe was it was.</p>
<h4>So Which to Use?</h4>
<p>When deciding which to use, the easiest answer is to just say &#8220;both&#8221;. Since you can fingerprint a watermarked image, there&#8217;s nothing, save perhaps cost, that will prevent you from using both systems. Many already view watermarks as more of a copyright verification tool than an image detection one, making the two a natural combination. </p>
<p>However, duplicating efforts doesn&#8217;t make much sense. Due to refinements, both systems can do an extremely good job at finding matches, what is more important than the method is the individual system and how well it works. If it can find good matches and work as advertised, it&#8217;s probably a keeper, regardless of the system.</p>
<p>That being said though, you can&#8217;t even consider using a watermarking system if your images area already on the Web unprotected. It is only used, ideally, for new images as much of the usefulness is lost with unprotected copies being passed around. However, the added protections provided by watermarks make them worthwhile in many cases, especially when there is tight control over distribution and the images haven&#8217;t been leaked.</p>
<p>In short, if you have images that you are waiting to put on the Web or routinely put up new ones (and can justify the expense both in money and time), watermarking is likely a good solution due to its added benefits. For those who have their works online already or don&#8217;t wish to invest the time/money required, fingerprint matching, via Tineye, is free.</p>
<p>However, even that distinction is being blurred by PicScout&#8217;s Image IRC, which is a database of fingerprinted images, which provides something of a &#8220;best of both worlds&#8221; solution. The problem though is that it, right now, is targeted solely at large stock agencies. We&#8217;ll have to see later how approachable it is by smaller artists once it is finally released.</p>
<h4>Bottom Line</h4>
<p>In the end, any robust image protection system will likely use at least some of both. If your images are valuable, trusting any one solution by itself to handle detection and orphan works prevention is risky. Multiple tools are, most likely, the way to go when practical.</p>
<p>But the most important thing, as mentioned above, is not that the system works, not the technology behind it. If it works well, it doesn&#8217;t matter if it uses fingerprints or watermarks. That is where the real challenge lies for these companies. </p>
<p>So, with that in mind, I&#8217;m looking forward to, over the next few months, talking more about developments in this field and how they might impact visual artists of all kinds and all backgrounds.<img style="border: 0pt none; width: 0pt; height: 0pt; display: none;" src="http://tokentracker.com/token.gif?id=3aSc44adb" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/12/02/image-detection-watermarking-vs-fingerprinting/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>PicScout Announces New Image Matching Tools</title>
		<link>http://www.plagiarismtoday.com/2009/10/08/picscout-announces-new-image-matching-tools/</link>
		<comments>http://www.plagiarismtoday.com/2009/10/08/picscout-announces-new-image-matching-tools/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 20:30:38 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[image irc]]></category>
		<category><![CDATA[imageexchange]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[licensing]]></category>
		<category><![CDATA[picsout]]></category>
		<category><![CDATA[Plagiarism]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4754</guid>
		<description><![CDATA[Image tracking company PicScout unveiled a potential game-changing product, but can it live up to the hype?]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/10/new_PICSCOUT_logo-300x53.jpg" alt="new_PICSCOUT_logo" title="new_PICSCOUT_logo" width="300" height="53" class="alignleft size-medium wp-image-4767" /></p>
<p>Imagine if you could open up your browser, search for an image using Google and instantly know who produced every image that came up and how you could license it for reuse. Likewise, as an artist, imagine that you could index your works into a database and everyone would know who the creator was and be able to license it from you, no matter where they saw it on the Web.</p>
<p>That&#8217;s the future that <a href="http://picscout.com/news-and-events/picscout-technology-supports-ubiquitous-online-image-accreditation-and-transaction.html">Picscout is trying to bring about with a pair of new initiatives</a> , <a href="http://picscout.com/solutions/image-irc.html">Image IRC</a> and <a href="http://picscout.com/solutions/imageexchange.html">Image Exchange</a>.</p>
<p>But while this brave new world likely sounds very enticing to visual artists, the framework of it has just been announced and it has many hurdles to overcome before it becomes a part of most artist&#8217;s daily lives. </p>
<p>Still, it is an interesting idea with a great deal of promise.</p>
<h4>How it Works</h4>
<p>The system consists of two parts which are both required for it to function:</p>
<ol>
<li><strong>Image IRC:</strong> This component is essentially a very large database of Images that PicScout has collected information on. The images are fingerprinted to make them easily identified and are then paired up with their metadata, which includes information about who owns the image and where it can be licensed.</li>
<li><strong>ImageExchange:</strong> This is the component that is end user facing, allowing Web surfers to automatically locate and license images contained in the Image IRC. One example is a planned <a href="http://picscout.com/solutions/image-recognition-platform.html">Firefox extension</a> that will be released that allows users to automatically find out if images they see on the Web are licensed via the Image IRC.</li>
</ol>
<p>The result is that, if a user is surfing the Web with the ImageExchange addon running, they will see an &#8220;i&#8221; over images that are in the Image IRC database.</p>
<p><img src="http://files.plagiarismtoday.com/wp-content/uploads/2009/10/picapp-sample-3-500x147.jpg" alt="picapp-sample-3" title="picapp-sample-3" width="500" height="147" class="alignnone size-large wp-image-4764" /></p>
<p>Users can then click on one of the &#8220;i&#8221;s and will get a popup like the one below with more information about the image.</p>
<p><img src="http://files.plagiarismtoday.com/wp-content/uploads/2009/10/picapp-sample-2-500x310.jpg" alt="picapp-sample-2" title="picapp-sample-2" width="500" height="310" class="alignnone size-large wp-image-4759" /></p>
<p>From there, they will be given the ability to correctly license the image, including the ability to click through and purchase a license if needed.</p>
<p>Currently the Image IRC includes &#8220;tens of millions&#8221; of images, including some 7 million CC-licensed images from Flickr. PicScout is also opening up to agencies, professional photographers and photo-sharing sites that can provide bulk image delivery.</p>
<p>PicScout hopes to open up the platform to smaller copyright holders and sites in the near future.</p>
<h4>Why It&#8217;s Important</h4>
<p>What is interesting about these new tools is that image matching and fingerprinting has typically been used as a one-way street. Creators have used the technology, often PicScout&#8217;s, to locate copies of their own works. The idea is to take the process and put it in reverse.</p>
<p>Rather than scanning one image to find its copies on the Web, one now automatically scans the images they see to find which are available for use. It is the same technology but, rather than starting from a known source and locating unknown matches, you&#8217;re starting with unknown sources trying to locate known matches.</p>
<p>This has the potential to be a boon for those who regularly license images. Rather than going through stock photo libraries to find the right image, they can have their browser (or a third-party site using the API) passively scan images as they surf the Web, allowing them to use the broader Web and better search tools.</p>
<p>These tools could also, foreseeably, have an impact on the <a href="http://www.plagiarismtoday.com/tag/orphan-works/">orphan works issue</a>, or works with unknown authors. By creating a database of easily-identified images, it decreases the likelihood an image&#8217;s creator might become unknown. Though PicScout is careful to not guarantee its service will be part of a &#8220;reasonably diligent search&#8221;, which was the language of the previous orphan works bill when determining whether a work was an orphan or not, it is easy to see how it could be useful in such searches. </p>
<p>However, this service does have a lot of obstacles in its way and there may be at least one problem that it can not solve with technology alone.</p>
<h4>Problems and Concerns</h4>
<p>Obviously, since the service has not been unveiled publicly, there isn&#8217;t much that I can say about it. However, there are several areas of concern that need to be addressed before this system can be called a success.</p>
<ol>
<li><strong>Database Size:</strong> Though the database should swell in size as testing begins, tens of millions of images is not an impressive sample. Tineye has over a billion images and is growing very rapidy and, even then, <a href="http://www.plagiarismtoday.com/2008/08/19/tineye-protecting-images-preventing-orphans/">it doesn&#8217;t escape criticism from me for not catching all infringements</a>. While it is true that the Image IRC database doesn&#8217;t have to be as large as Tineye&#8217;s as it is only indexing originals of images, not their copies, if the database is too small, it will not be useful for performing image searches and that will make it useless for everyone.</li>
<li><strong>Licensing Limitations:</strong> All of the images in the Image IRC must be made available for licensing either as a commercial work or under one of a few CC licenses (according to a representative from PicScout, the image must be licensed for non-commercial use at this time). There&#8217;s no way to use the Image IRC to say that you do not want your images used. This limits the usefulness to photographers who have no interest in image licensing.</li>
<li><strong>The Wrong Audience:</strong> While this will be seen as a great product for those who routinely purchase photos online, it remains to be seen how well it will be adopted by those who misuse images, either through ignorance or malice. Those who don&#8217;t know or don&#8217;t care if they are violating copyright are unlikely to download a Firefox plugin to start buying images they see on the Web. If that&#8217;s the case, the actual impact on getting users to license images could be small.</li>
</ol>
<p>However, the gravest concern I have is that this tool could create even more confusion when it comes to licensing. Given the current level of copyright understanding on the Web, it is conceivable some could use this tool and think think not that they can license the images with the &#8220;i&#8221;, but that those are the ones they have to pay for and the others are free.</p>
<p>In short, without proper education about how the tool is to be used and without proper groundwork on copyright education, it is conceivable that this product could actually backfire, especially against those not in the Image IRC system.</p>
<p>That being said, these are issues that can and should be addressed as the product development rolls along. Right now we have a very brief glimpse at a promising new tool that could, if it is able to grow to a large enough size, could revolutionize the way people buy stock images and the way people find works for their various projects.</p>
<h4>Bottom Line</h4>
<p>There is no doubt that PicScout&#8217;s matching technology is among the best in the industry and its ability to use fingerprints of images rather than watermarks (visible or invisible) to track works enable it to track a much wider range of material. There is also no doubt that this application of their technology can have a lot of benefits to copyright holders of all kinds.</p>
<p>The questions that remain unanswered is how will this system be developed and how will it be used. If PicScout is judicious and wise with how they build this service, it could become an incredibly powerful tool for users and a major coup for visual artists, who have struggled with being found since the dawn of the Web.</p>
<p>If it is developed unwisely, it will likely remain a niche product, used primarily by those already in the business, that does little to improve the overall copyright climate. At worst, it could even introduce additional confusion and headaches.</p>
<p>Without a doubt, this will be an offering to follow over the next year or so. No matter what, it seems destined to make an impact.</p>
<h4>Full Sample Image</h4>
<p>Click for full size.</p>
<p><a href="http://files.plagiarismtoday.com/wp-content/uploads/2009/10/PR_Image_All-1.jpg"><img src="http://files.plagiarismtoday.com/wp-content/uploads/2009/10/PR_Image_All-1-366x1024.jpg" alt="PR_Image_All (1)" title="PR_Image_All (1)" width="366" height="1024" class="alignnone size-large wp-image-4762" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/10/08/picscout-announces-new-image-matching-tools/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Elaine Scott&#8217;s Copyright Folly</title>
		<link>http://www.plagiarismtoday.com/2009/10/02/elaine-scotts-copyright-folly/</link>
		<comments>http://www.plagiarismtoday.com/2009/10/02/elaine-scotts-copyright-folly/#comments</comments>
		<pubDate>Fri, 02 Oct 2009 17:43:45 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Punditry]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[elaine scott]]></category>
		<category><![CDATA[filtering]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[iparadigms]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[scribd]]></category>
		<category><![CDATA[turnitin]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4707</guid>
		<description><![CDATA[Elaine Scott is suing Scribd. However, her copyright folly could make it harder for you to protect your own work in the future. ]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/10/scribd-logo-full.png" alt="scribd-logo-full" title="scribd-logo-full" width="143" height="42" class="alignleft size-full wp-image-4712" /></p>
<p>What do you call someone who not only sues someone trying to help them, but does so with terminally flawed legal arguments? An idiot.</p>
<p>There&#8217;s really no &#8220;nice&#8221; way to say it. Using the courtroom as a club against people trying to help you is just plain stupid, even if you have a disagreement with them.</p>
<p>However, that is exactly what children&#8217;s book author <a href="http://www.elainescott.com/">Elaine Scott</a> is doing to document-sharing site <a href="http://www.scribd.com">Scribd</a>, <a href="http://www.guardian.co.uk/technology/2009/sep/21/scribd-lawsuit">suing them for trying to help her better protect her work</a>.</p>
<p>It is a strange and bizarre case not only tests the limits of one&#8217;s imagination, but also one that bears a closer examination as it may have an impact on the rights of smaller content creators everywhere.<span id="more-4707"></span></p>
<h4>Background</h4>
<p>The case began when Scott found one of her earlier books, financial advice work entitled &#8220;Stocks and Bonds, Profits and Losses&#8221;, available illegally on Scribd. She filed a DMCA takedown notice and the work was removed but Scott was not satisfied. She filed suit against Scribd with two different accusations.</p>
<p>First, according to Scott, the site itself is a violation of copyright because it &#8220;Shamelessly profits from the stolen copyrighted works of innumerable authors&#8221; and the creators &#8220;Have built a technology that&#8217;s broken barriers to copyright infringement on a global scale and in the process have built one of the largest readerships in the world.&#8221;</p>
<p>However, it is the second claim that has become the most attention-grabbing. Scribd employs a &#8220;takedown stay down&#8221; system that prevents infringing works from being reuploaded to the site and her book, once the notice was filed, was added to that database. Scott now also claims that this <a href="http://blogs.wsj.com/law/2009/10/01/the-latest-copyright-head-scratcher-courtesy-of-kiwi-camara/">copyright filtering service is an infringement and that</a> “Scribd doesn’t have the authority to ‘help’ me with anything…. The filtering system is Scribd’s way of asking for forgiveness, rather than permission.”</p>
<p>What ends up tying this entire lawsuit together into a big ball of copyright strangeness is that Scott is being represented by Kiwi Camara, who became famous in copyright circles for his defense of Jammie Thomas in her retrail, which saw her being found liable for $1.92 million in damages for sharing some 24 songs.</p>
<p>When Ashby Jones <a href="http://blogs.wsj.com/law/2009/10/01/the-latest-copyright-head-scratcher-courtesy-of-kiwi-camara/">at the Wall Street Journal called this case a &#8220;head scratcher&#8221;</a>, he was being polite. This case is, without a doubt, one of the strangest and most befuddling I&#8217;ve ever seen.</p>
<h4>Analysis</h4>
<p>Legally, the case seems to be a non-starter. On the first complaint, the facts of the case line up very closely with the Universal v. Veoh case where the video sharing site Veoh was sued by Universal for enabling users to publish copyrighted content. However, the court <a href="http://www.pcmag.com/article2/0,2817,2352870,00.asp">threw out the case in September</a> after finding that Veoh had complied with the DMCA and qualified for its safe harbor protection.</p>
<p>Though the ruling in the Veoh case was very specific to the company itself, the facts largely seem to line up. I can attest that Scribd has always responded swiftly to DMCA notices and its copyright filtering system is a further sign of good faith. Scribd is not a popular site among authors and publishers, but that doesn&#8217;t make it illegal. </p>
<p><a href="http://www.plagiarismtoday.com/2009/09/23/red-flags-takedowns-and-copyright-law/">Given that red flag takedowns are virtually meaningless and useless</a>, it is almost certain the courts will favor Scribd on this case. That is, barring a surprise in discovery that shows Scribd had some unexpected level of knowledge or ignored DMCA notices.</p>
<p>The second argument, the one involving the copyright filtering system, also looks to be equally doomed. If the Scribd copyright filter is anything like every other such filter I&#8217;ve seen, it operates on fingerprints and not the work itself (doing otherwise requires exponentially more computing power). As such, the facts of that case <a href="http://www.plagiarismtoday.com/2008/03/25/iparadigms-wins-turnitin-lawsuit/">mirror closely the Turnitin lawsuit</a>, which iParadigms, the makers of Turnitin, won handily.</p>
<p>Turnitin is a plagiarism checking service commonly used by high schools and colleges to look for matching text in student&#8217;s paper. As part of the service, it stores a fingerprint of every essay scanned to match against future works. Four students in Virginia objected to this and sued the company but lost with the judge saying that the use of their work was transformative and a fair use.</p>
<p>Even Camara&#8217;s argument that &#8220;They are building up a valuable asset that nobody else is going to have by taking work from authors without sharing with them the profits,&#8221; falls flat. iParadigms is actively selling access to this database and does so to many thousands of institutions. In short, Camara&#8217;s worst fears have already been realized in the iParadigms case and the judge still found it to be a fair use.</p>
<p>Whether you agree with that verdict or not, it is clear that it seems to put the Scribd case in a very bad spot. </p>
<p>To be clear, I am not an attorney but these counter-arguments only took me a minute or two to think up. They are patently obvious, especially considering the Veoh case was thrown out mere days before this suit was filed, and it makes this case look doomed to fail. However, that doesn&#8217;t mean it won&#8217;t have an impact on the copyright climate.</p>
<h4>The Real Problem</h4>
<p>Copyright filtering services are for the good of both the hosts that offer them and for copyright holders. Hosts don&#8217;t have to spend manpower and time removing works via DMCA that can instead be blocked at the gate and copyright holders don&#8217;t have to file notices for the same works over and over again.</p>
<p>Though there are some cases where a copyright holder would not want to be in such a database, for example if they only wanted to stop plagiarized uses, not attributed ones, this wasn&#8217;t the case for Scott. Since the only use a fingerprinting system has is matching one work against another, for example plagiarism detection or copyright filtering, pretty much any commercial use Scribd had found for its database would have also been beneficial.</p>
<p>However, Scribd has taken no such steps and expressed no intent to do so. It&#8217;s matching database, much like YouTube&#8217;s, is proprietary and closely guarded. The mere fact that they could do something doesn&#8217;t warrant a preemptive strike, especially when what they are accused of possibly doing was found to be a fair use in another case.</p>
<p>What this case might do, however, is make other companies more skittish about developing &#8220;takedown stay down&#8221; systems. This would likely be a major blow to the smaller artists who lack the resources to file takedown notices on every single misuse of their work but still license their works in a way where such a system would be practical.</p>
<p>This also might have an impact in discouraging companies from entering into areas where takedown notices might be frequent. Faced with the choice of having to spend the manpower to remove every work subject to takedown notice or being sued for a filtering system, some will likely just walk away. This, in turn, could give rise to more legally dubious sites.</p>
<p>After all, if showing good faith either bankrupts your startup or gets you sued, there isn&#8217;t much point in showing it at all. It may be more attractive to move to a country with more relaxed regulations and avoid the problem all together.</p>
<p>In short, by suing the people who tried to help her, she may be hurting every other copyright holder in the world.</p>
<h4>Bottom Line</h4>
<p>In the end, most copyright experts, including <a href="http://copyrightsandcampaigns.blogspot.com/2009/09/class-action-copyright-suit-filed.html">Ben Sheffner</a> and <a href="http://blog.ericgoldman.org/archives/2009/09/another_copyrig.htm">Eric Goldman</a> (who many more reasons why the outlook is bleak for Scott) are skeptical about the suit&#8217;s future.</p>
<p>Pretty much all of the recent case history goes against Scott in this matter but, more importantly, logic itself is against her.</p>
<p>Though it is understandable that authors and artists want to protect their work and profit from as many uses of their content as practical, suing a company for infringement when they are trying to prevent your work from being infringed is a classic case of cutting off your nose to spite your face. It makes no sense and will only cause more problems, most of them for other copyright holders.</p>
<p>This case is most likely doomed but the sooner it is thrown out the better. The more fear Scott can sew into the providers who act in good faith, the more room others have to grow. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/10/02/elaine-scotts-copyright-folly/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Corrigon: Fast, Reliable Image Matching</title>
		<link>http://www.plagiarismtoday.com/2009/08/12/corrigon-fast-reliable-image-matching/</link>
		<comments>http://www.plagiarismtoday.com/2009/08/12/corrigon-fast-reliable-image-matching/#comments</comments>
		<pubDate>Wed, 12 Aug 2009 21:59:14 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[image matching]]></category>
		<category><![CDATA[image-search]]></category>
		<category><![CDATA[Photography]]></category>
		<category><![CDATA[Plagiarism]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=4344</guid>
		<description><![CDATA[Image matching company Corrigon promises to revolutionize the image detection world, and make it faster and more accessible along the way. ]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/08/corrigon-logo.png" alt="corrigon-logo" title="corrigon-logo" width="235" height="88" class="alignleft size-full wp-image-4345" /></p>
<p>Some time ago I predicted that image matching technology would become both cheaper and more effective, making it possible to search for duplicate or near-duplicate images the same way that writers do with text while keeping the price affordable. <a href="http://tineye.com">Tineye</a> was, and remains, a major step in the right direction on that front but its limited database, about 1.2 billion has <a href="http://www.plagiarismtoday.com/2008/08/19/tineye-protecting-images-preventing-orphans/">hindered its ability to be comprehensive</a> (Facebook, Photobucket and Flickr alone <a href="http://www.datacenterknowledge.com/archives/2008/10/15/facebook-hosts-10-billion-photos/">combine for over 18 billion images</a>). </p>
<p>Corrigon, however, hopes to bring something new to this field. According to them, they&#8217;ve developed new technology that makes it faster and cheaper to search for copies of images, that will not only enable them to launch a service targeted at individuals later this year, but provide results in near-realtime.</p>
<p>I recently sat down with Corrigon&#8217;s CEO, Avinoam Omer and CTO Einav Itamar to talk about their service and their future plans. They gave me a demo of the product and showed me some of its promising new features.<span id="more-4344"></span></p>
<h4>What Corrigon Does</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  src="http://files.plagiarismtoday.com/wp-content/uploads/2009/08/mona-lisa-sample.png" alt="mona-lisa-sample" title="mona-lisa-sample" width="281" height="257" class="alignright size-full wp-image-4346" /></p>
<p>What Corrigon does is similar to the services that stock photo agencies have been using for some time. Users provide a collection of images that are then put into a database. The service then crawls the Web, looking for matching images. </p>
<p>What makes Corrigon unusual is that it doesn&#8217;t store the images, but rather, fingerprints them and compares the fingerprint against other matches it finds on the Web. This is very similar to what <a href="http://c-registry.us/">C-registry.us</a> is <a href="http://www.plagiarismtoday.com/2009/05/05/c-registry-orphan-work-prevention/">doing with its matching technology</a>. However, where C-Registry is more geared toward preventing works from becoming orphans, Corrigon is more about image search (though C-Registry <a href="http://www.plagiarismtoday.com/2009/07/01/c-registry-announces-image-matching/">has added image search</a>)</p>
<p>The advantage of this system is simple. By not having to store the images or search against them, Corrigon needs much less space and processing power. This both takes less resources, thus cutting costs, and speeds up the process. </p>
<p>Though I was unable to take screenshots of the demo of since corrigon asked me to keep the identities the image owners secret, the matching seemed to be very near realtime Several of the matches were for stories posted that same day and new ones seemed to be constantly discovered. Though not quite realtime in the sense that Twitter or Facebook provide realtime search, it was definitely much faster than similar services, which often take days or weeks to find results.</p>
<p>In the demo provided me, this was especially well-used in tracking uses of images on news sites, blogs, etc. </p>
<p>As for the matching itself, it seemed to be very solid. The examples on the public site involving the mona lisa are good examples of the flexibility of the service. It was able to detect matches where the copy had been used in a package (with writing over it) modified to make a new image, etc. Though very  interesting, it seemed to mirror closely what other services, including Tineye and C-Registry have been able to do in that area.</p>
<p>However, the real selling point for the service seems to be speed and price, something they are making a major play on.</p>
<h4>Future Plans</h4>
<p>Though Corrigon is currently targeting larger companies, it is planning to offer a solution to smaller users sometime later this year, likely in the 3rd quarter. Though there isn&#8217;t much to be certain about pricing, they are also planning on introducing a resolution system that helps resolve cases of copyright infringement, either by obtaining licensing from users or seeking removal of the content, depending on the needs of the user.</p>
<p>Needless to say that many of the plans are unsettled as of yet, but this is clearly a market that they plan on targeting soon.</p>
<h4>Some Concerns and a Request for Help</h4>
<p>The biggest concern I have is that, without an actual analysis, I can&#8217;t say exactly how comprehensive or accurate the matching is. The samples and demo I viewed were impressive, but how that translates into the real world is hard to say. So, while I am impressed, I am cautious as well.</p>
<p>Since I am not a visual artist and do not have a body of work to test the system with, I&#8217;m limited in what I can do. However, I am getting in touch with photoraphers and other artists I know to see they would be interested in having their work used for at least a limited test, mostly to give Corrigon feedback but also let me know how effective the system is.</p>
<p>If you think you might be a good candidate, namely a smaller artist with a large amount of reuse of their work, please <a href="http://www.plagiarismtoday.com/contact-pt/">let me know</a>. I can not make any promises but would like to hear from you.</p>
<h4>Bottom Line</h4>
<p>There&#8217;s little doubt that Corrigon shows a great deal of promise in this field. All that remains to be seen is how that promise translates to real-world application. </p>
<p>However, Corrigon is not the only company entering this market. Another is preparing to drop its prices and reach out more to individual blogers and artists with an improved technology.</p>
<p>I&#8217;ll have a write up on their efforts closer in to their launch.</p>
<p>The main thing is that, when you look at the bigger picture, you see services such as Tineye, C-Registry, Corrigon and others bringing image matching to the masses. This has a huge impact on how visual artists track their works and also works to mitigate <a href="http://www.plagiarismtoday.com/2008/05/01/a-realistic-look-at-orphan-works/">the orphan works issue</a>.</p>
<p>Hopefully we&#8217;ll soon be reaching a point where artists of all stripes can find their content as easily and as cheaply as writers do today. This will have a profound impact on both the protection of copyrighted images, but also in empowering the legal use of said images through proper licensing and a sheer understanding of how one&#8217;s works are used across the Web.</p>
<p>It will be a better world for artists, that much is certain.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2009/08/12/corrigon-fast-reliable-image-matching/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Gazopa: Not For Copy Detection</title>
		<link>http://www.plagiarismtoday.com/2008/09/19/gazopa-not-for-copy-detection/</link>
		<comments>http://www.plagiarismtoday.com/2008/09/19/gazopa-not-for-copy-detection/#comments</comments>
		<pubDate>Fri, 19 Sep 2008 15:53:37 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[gazopa]]></category>
		<category><![CDATA[image protection]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[Orphan Works]]></category>
		<category><![CDATA[orphans]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[tineye]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1763</guid>
		<description><![CDATA[A new image search engine, Gazopa, was recently announced and holds a great deal of promise for making image search easier. Unfortunately, one area it struggles is in detecting image copying. ]]></description>
			<content:encoded><![CDATA[<p><img class="picleft" src="http://www.plagiarismtoday.com/wp-content/uploads/2008/09/gazopa-logo.jpg" border="0" alt="gazopa-logo.jpg" width="224" height="91" align="left" />After <a href="http://www.plagiarismtoday.com/2008/08/19/tineye-protecting-images-preventing-orphans/">Tineye impressed me with its technology but disappointed me with its database</a> (Note: Tineye has added 200 million more images since the first review was written), I was excited to hear about a new image search company claiming that it could find similar images based upon an upload.</p>
<p>That company, a Japanese organization called <a href="http://www.gazopa.com/">Gazopa</a> produced a technology that seemed to be very similar to Tineye at first glance.</p>
<p>However, after using and experimenting with the service, I can safely say that it was not built for this kind of searching and, at this time, it not ready to be used as a means to detect image copyright infringement.<span id="more-1763"></span></p>
<h4>How Gazopa Works</h4>
<p>In terms of how one uses Gazopa, it actually works very similar to Tineye. The user either uploads or points the search engine to an image on the Web and Gazopa then searches the Web for similar images.</p>
<p>However, Gazopa does offer two additional options. First, it allows users to search for images based upon a keyword, thus making it <a href="http://images.google.com/">more like a traditional image search engine</a>, or via a drawing that the user makes on an electronic whiteboard.</p>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/09/gazopa-samples.jpg" border="0" alt="gazopa-samples.jpg" width="500" height="60" /></p>
<p>If you upload an image or use a drawing, Gazopa then searches its database of approximately 50 million images (Note: Tineye currently has approximately 900 million) for images that are similar to it. But rather than using fingerprinting to perform the matching, it is based on more vague elements such as shape, color and face.</p>
<p>The concept is interesting and it would seem to have a lot of potential for detecting heavily modified image copying, such as images that were severely cropped or distorted.</p>
<p>However, after experimenting some with the service, it appears that the service is not quite ready for such use, at least not at this time.</p>
<h4>Limitations of Gazopa</h4>
<p>The problem with Gazopa is that, no matter what I search for, the site does not seem to return exact duplicates of the image or even modifications of it. Rather, it returns images that feel like the original, but actually have little in common.</p>
<p>A good example is what happens when I search for the Layered Tech logo using Gazopa. As you can see in the image below, none of the top results are copies of the logo, though many exist on the Web.</p>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/09/gazopa-large2.jpg" border="0" alt="Gazopa Sample" width="500" height="323" /></p>
<p>Though Gazopa returned 1000 matches, none of them, at least on the first five pages, were direct copies of the Layered Tech Logo.</p>
<p>However, when I punched the same image into Tineye, I was given much different results. Even though Tineye was only able to pull one result down, it was a perfect match for the original logo.</p>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/09/tineye-large3.jpg" border="0" alt="tineye-large3.jpg" width="499" height="295" /></p>
<p>The result is that, where Gazopa produces far more results per item, most of them are questionable in terms of being a direct match. However, when you consider that Gazopa has a feature to prevent direct matches from appearing in the results, it is clear that isn&#8217;t the goal of the service.</p>
<h4>Better Uses</h4>
<p>At this time, Gazopa does not appear to be well-geared toward detecting image copying. Though I can see a lot of usefulness for the search engine, this doesn&#8217;t appear to be one of the better applications.</p>
<p>I would be more fond of this search if I were a designer looking for ideas or wanting to see if there was anything extremely similar, not identical, to a work I had created.</p>
<p>Likewise, with the drawing feature, I might be able to see if an idea I came up with has been done before or how unique a new logo I got really was.</p>
<p>It is a great idea for an image search engine, it just doesn&#8217;t fill this one function. Though that may change as the database grows, it is still barely 1/20th the site of Tineye&#8217;s limited database, it still seems as if the technology is not well geared toward this kind of use.</p>
<p>There is a lot to be excited about in Gazopa, but unfortunately, this kind of detection is just not what they are good at. There are many other reasons to try it out, but copy detection isn&#8217;t one of them.</p>
<h4>Conclusions</h4>
<p>Overall, I like Gazopa. It has a slick interface, a fast search and seems to return some interesting results. However, it doesn&#8217;t meet the needs for myself and others like me who are trying to track how their work is being used on the Web.</p>
<p>Still, I definitely recommend giving Gazopa a try. Not only might it work better for you, but it is just something that is fun to use and to play around with as well as something that can be useful.</p>
<p>It may not be the end all solution to finding copied images on the Web, but it certainly can help in other ways.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/09/19/gazopa-not-for-copy-detection/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Frustration and Fighting Plagiarism</title>
		<link>http://www.plagiarismtoday.com/2008/04/30/the-problem-with-fighting-plagiarism/</link>
		<comments>http://www.plagiarismtoday.com/2008/04/30/the-problem-with-fighting-plagiarism/#comments</comments>
		<pubDate>Wed, 30 Apr 2008 16:37:21 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[DMCA]]></category>
		<category><![CDATA[Legal Issues]]></category>
		<category><![CDATA[Personal Experiences]]></category>
		<category><![CDATA[Attributor]]></category>
		<category><![CDATA[bitscan]]></category>
		<category><![CDATA[blogwerx]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[copyscape]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Search-Engines]]></category>
		<category><![CDATA[searching]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=998</guid>
		<description><![CDATA[If you read this site regularly, you probably know a great deal about how to protect your content on the Web. However, you probably also know that the current system is an inelegant solution in need of serious repair. Here's what's wrong and what can be done about it. ]]></description>
			<content:encoded><![CDATA[<p><img style=' float: left; padding: 4px; margin: 0 7px 2px 0;'  class="picleft alignleft size-full wp-image-999" title="google-alerts-logo" src="http://www.plagiarismtoday.com/wp-content/uploads/2008/04/google-alerts-logo.jpg" alt="" width="156" height="72" />We live in a strange time of copyright history. We exist in an age of rapid change, both in terms of technology and of law, we live in a time where people&#8217;s views about copyright are changing and artists are trying to convert &#8220;free&#8221; into a business model. It is the age of <a title="Creative Commons" href="http://www.creativecommons.org">Creative Commons</a>, free culture and mashups. an era of free thought and free exchange.</p>
<p>But, even in the face of that, artists have many reasons to want to exert reasonable control over their work. Even those that are comfortable giving away certain rights still don&#8217;t want their work to be plagiarized, scraped in its entirety by spammers or otherwise leeched off of.</p>
<p>Even in the era of free culture, there is <a title="Trademark as the new Copyright" href="http://www.plagiarismtoday.com/2007/11/15/is-trademark-the-new-copyright/">still a place for copyright enforcement</a> and it still has a role in protecting authors as they post their works online. However, enforcing copyright, for your average author, is no small challenge.</p>
<p>Simply put, the technology is not there, the law is not accessible and, even after three years of working in this field, the improvements have been slim.</p>
<p>If individual and smaller copyright holders are going to be able to protect their works at all, there are going to have to be improvements. The bad guys are not getting any fewer and the technology they use is developing rapidly as they compete with search engines to get their work ranked well.</p>
<p>In short, we&#8217;re caught in the cross-fire of a different war and we lack the means to protect ourselves. That is something that has to change soon.</p>
<h4><span id="more-998"></span>The Issues We Face</h4>
<p>When a new artist or Webmaster seeks to persue an infringement of their work, the learning curve is steep and the stakes are very high. Many refuse to start simply because it is too intimidating and I am hard pressed to fault them.</p>
<p>This learning curve is exasperated by a collection of inadequacies and problems that Webmasters have to endure and work around. Those problems include the following:</p>
<ol>
<li><strong>Copyright Law is Confusing:</strong> Even if one discounts the <a title="Copyright Myths" href="http://www.templetons.com/brad/copymyths.html">slew of copyright myths</a>, copyright law is confusing and vague. It is so bad that many lawyers call it &#8220;unreadable&#8221; and few touch if they can avoid it. Laypeople have almost no hope of navigating it successfully, especially when it comes to issues such as fair use.</li>
<li><strong>Lack of Detection Tools:</strong> It is a simple fact, there are no tools currently available to the average consumer designed, from the ground up, to track content copying. There are ways to use existing tools for that purpose, such as with <a title="Using Google Alerts" href="http://www.plagiarismtoday.com/2008/01/24/video-how-to-use-google-alerts/">Google Alerts</a>, but these are hacks designed to turn a generic search engine into a content detection tool. Even services such as <a title="Copyscape" href="http://www.copyscape.com">Copyscape</a> and <a href="http://www.bitscan.com">Bitscan</a> are search engine hacks in and of themselves, just significantly more user-friendly and elegant in nature. Sadly, <a href="http://www.plagiarismtoday.com/2007/10/02/copyscape-improved-again/">despite improvements</a>, this simplicity comes at the expense of accuracy.</li>
<li><strong>Networking is Hard:</strong> Assuming one can navigate copyright law and locate an infringement they are prepared to act upon, finding out who to write can be difficult. <a href="http://www.plagiarismtoday.com/stopping-internet-plagiarism/3-finding-the-host/">Determining the host</a> of the site is no easy task in many cases and, though <a href="http://www.whoishostingthis.com">new tools</a> have come forward to help with that, the accuracy can <a title="Who is Hosting This Review" href="http://www.plagiarismtoday.com/2008/02/22/useful-site-who-is-hosting-this/">leave something to be desired</a>.</li>
<li><strong>Hosts are Uncooperative: </strong>Even if you can find the host and determine who to contact, there is no guarantee that they will be helpful. Many hosts are uncooperative in these matters and refuse to remove infringing works, even if the law is very clear. Even those who are cooperative can throw up <a title="Google and the DMCA" href="http://www.plagiarismtoday.com/2006/06/02/google-the-dmca-and-you/">unnecessary roadblocks</a> and create huge delays.</li>
<li><strong>The Process Doesn&#8217;t Scale:</strong> Though the process gets easier every time you do it, Keeping track of your information gets harder as you grow. I&#8217;ve used databases, spreadsheets and even Word files to keep track of the cases I&#8217;ve handled, However, every solution has been both inefficient and  incomplete.</li>
</ol>
<p>With these problems in mind, it is very easy to see why so many either avoid pursuing infringements of their work or stop soon after they start. It is an intimidating process and one that can grow into a tremendous time-sink very easily.</p>
<p>Fixing this problem is going to require more than just more changes to the current system, it is going to require a full solution that addresses these issues by creating a completely new set of tools, ones built from the ground up.</p>
<h4>Fixing What&#8217;s Broken</h4>
<p>If we assume that the current system is broken, or at least not functioning very well, we then have to look at what we can do to fix it.</p>
<p>Unfortunately, some things can not be repaired, at least not anytime soon. Copyright law is not going to magically become intelligible nor are we going to be able to consistently make locating a host easy. Though we can improve these problems with knowledge and technology, some element of these problems will remain.</p>
<p>Fortunately other elements can be addressed.</p>
<ol>
<li><strong>A</strong><strong> Centralized Solution:</strong> Currently detection and cessation are two different functions requiring very different tools. Unifying these elements would not only make the process faster, but easier to track. This would make it much less intimidating for new comers and streamline the entire process for veterans.</li>
<li><strong>Better Searching:</strong> The problem with relying on Google, or any other search engine, is that they routinely ban and remove sites that appear to be spam-like in nature. Sadly, these are the exact sites we are sometimes trying to find. Furthermore, any site that relies on Google for searching will make trade offs between simplicity and accuracy, either missing matches or requiring more human filtering of results.</li>
<li><strong>Real Image Searching:</strong> Image searching right now is a failure. There is no way for a visual artist, especially a smaller one, to effectively search for copies of their work on the Web. <a title="BAYTSP" href="http://www.baytsp.com/">Fingerprinting technology exists</a>, but is only available to those with deep pockets.</li>
<li><strong>Better Host Accountability:</strong> Hosts often ignore DMCA notices and spam reports because they realize that there is very little chance of them being sued. Though much of this is due to the <a title="Why Your Copyright Protection is Second Rate" href="http://www.plagiarismtoday.com/2008/01/11/why-your-copyright-is-second-rate/">nature of copyright law in the United States</a>, one would hope that there would be other methods of holding hosts accountable other than lawsuits. Though I made an attempt of that with my <a title="Host Report" href="http://www.plagiarismtoday.com/host-report/">Host Report</a>, my status on the Web combined with limited sample size and lack of time hamstrung the effort. Such a project would have to be larger than just this site.</li>
<li><strong>Integration:</strong> Though it is a cliche, we are in Web 2.0. However, the tools are decidedly from the old Web. RSS feeds, widgets and APIs open up a new world of possibilities and ways to simplify the process. New tools should both be aware of these elements and take advantage of them.</li>
</ol>
<p>Though this sounds like a mammoth task, and it is, much of the work has already been done. Large copyright holders already have tools capable of many of these things, they are just yet to be made available to the public at large.</p>
<p>Fortunately, there may be help on the horizon.</p>
<h4>A New Interest</h4>
<p><img style=' float: right; padding: 4px; margin: 0 0 2px 7px;'  class="picright alignright size-full wp-image-1000" title="attributor-logo" src="http://www.plagiarismtoday.com/wp-content/uploads/2008/04/attributor-logo.jpg" alt="" width="206" height="77" />Though the tools and technology have not changed much in the past few years, what has shifted is that there is now a growing interest in creating them. At least two companies, <a title="Attributor" href="http://www.attributor.com">Attributor</a> and <a title="Blogwerx" href="http://www.blogwerx.com/">Blogwerx</a>, seek to bring many of the features above to the table.</p>
<p>Attributor, for its part, seems to be farther along, already doing beta testing and having signed up both the <a title="Attributor AP" href="http://www.attributor.com/morenews2.php">AP</a> and <a title="Attributor Reuters" href="http://www.plagiarismtoday.com/2007/09/17/attributor-signs-up-reuters/">Reuters</a>, among others. They&#8217;ve also <a title="Attributor News" href="http://www.techcrunch.com/2007/11/04/attributor-launches-service-to-track-copyright-infringement-across-the-web/">announced plans</a> for a &#8220;self serve&#8221; version of the service aimed at bloggers.</p>
<p>Hopefully, we will see fruits from that later this year.</p>
<p>Also, there are rumors of other companies looking to set up and provide similar services, including at least one company that may be producing a stand-alone software application that will provide much of that functionality.</p>
<p>However, no matter what company it is that makes the big leap, this is an area that is ripe for a great deal of innovation in the months and years to come.</p>
<h4>Conclusions</h4>
<p>In the nearly three years that I have been running this site, precious little has changed for small copyright holders. Even as new tools have helped deep-pocketed corporations protect their works, often to the point of waging an unreasonable war, those same tools have passed over the rest of us.</p>
<p>However, the powers that be have started to see the potential for making the technology available to the rest of us. This has the potential to not only help us protect our content, but further the copyright dialog by bringing about a greater understanding about the frequency and ways that content is being reused.</p>
<p>Right now, for the most part, we are all just feeling around in the dark, hoping to get an idea of what is going on outside of our site. Even those who are against copyright enforcement have to agree that more knowledge is never a bad thing.</p>
<p>Hopefully, that time is coming soon.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/04/30/the-problem-with-fighting-plagiarism/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Attributor Launches Content Monitoring Service</title>
		<link>http://www.plagiarismtoday.com/2007/11/06/attributor-launches-content-monitoring-service/</link>
		<comments>http://www.plagiarismtoday.com/2007/11/06/attributor-launches-content-monitoring-service/#comments</comments>
		<pubDate>Tue, 06 Nov 2007 17:15:00 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Attributor]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Music Lyrics]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[Recipes]]></category>
		<category><![CDATA[Search-Engines]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[Song Lyrics]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/11/06/attributor-launches-content-monitoring-service/</guid>
		<description><![CDATA[Web startup Attributor, previously covered here, has announced the launch of their content monitoring and analysis platform and is openly accepting customers for their service. According to Attributor, the system &#8220;works by allowing publishers to register and digitally fingerprint each piece of content&#8221; and then checking for uses of that content that violate previously determined...]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.divshare.com/download/2657396-26b"><img hspace="5" align="left" src="http://www.divshare.com/img/2657396-26b.jpg" border="0" /></a>Web startup Attributor, <a href="http://www.plagiarismtoday.com/2007/09/17/attributor-signs-up-reuters/">previously covered here</a>, has <a href="http://www.attributor.com/morenews5.php">announced the launch</a> of their content monitoring and analysis platform and is openly accepting customers for their service.</p>
<p>According to Attributor, the system &#8220;works by allowing publishers to register and digitally fingerprint each piece of content&#8221; and then checking for uses of that content that violate previously determined rules for use. The system can then take action against any misuse of the content, including requesting a link back or filing a takedown notice.</p>
<p>The enterprise version, which is what Attributor is taking customers for at this time, costs tens of thousands of dollars per year to operate <a href="http://www.techcrunch.com/2007/11/04/attributor-launches-service-to-track-copyright-infringement-across-the-web/">according to TechCrunch</a>. A different version of the service aimed at individuals and bloggers should be available some time in 2008.</p>
<p>Also, according to the TechCrunch article, Attributor is currently indexing over 100 million pages per day and has already indexed over 15 billion. The current system only works with text but an image detection system is in early beta and audio/video detection is also planned.</p>
<p>Clearly, Attributor could be a game-changer in this industry and this will be a product to keep track of and follow as it grows.</p>
<p><span id="more-722"></span><strong>Content Analysis</strong></p>
<p>In a <a href="http://www.attributor.com/morenews6.php">separate announcement</a>, Attributor unveiled the results of an analysis the company did on the copying and pasting of song lyrics. </p>
<p>The results of the analysis found that song lyrics are copied widely on lyric sites, fan sites and blogs and that very few are officially licensed. What was surprising was that the only official source studied, <a href="http://music.yahoo.com/">Yahoo! Music</a>, was ranked behind all of the unofficial copies in Google and almost all in their own search engine.</p>
<p>Other tests done by Attributor, including one revolving around the recipe site <a href="http://www.epicurious.com">Epicurious</a>, <a href="http://attributor.com/blog/?p=23">have found similar results</a>. In that study, over 10,000 copies were found and over half had a higher search engine rank than the originals. Also, over half of the sites had ads on the pages and 60% failed to link back to the source.</p>
<p>Though these results are not likely to be a surprise to anyone who reads this site regularly, it is an embarrassment for Google, which has repeatedly said that it can distinguish between copies and originals. In a <a href="http://googlewebmastercentral.blogspot.com/2006/12/deftly-dealing-with-duplicate-content.html">blog post</a> in December 2006, they said, &#8220;Though annoying, it&#8217;s highly unlikely that such sites (scrapers) can negatively impact your site&#8217;s presence in Google.&#8221;</p>
<p>Apparently Google&#8217;s accuracy in this area leaves a great deal to be desired. This may explain some of Attributor&#8217;s harsh words for Google, calling it &#8220;unacceptable to publishers&#8221;, and why some feel that an Adsense lawsuit is on the horizon. </p>
<p><strong>Conclusions</strong></p>
<p>Attributor has the potential to be a major game-changer in this field. It is already making its impressions on the enterprise level and has signed on many prominent customers, including both the AP and Reuters, but it remains to be seen what impact it will have to individuals and bloggers.</p>
<p>Since the single user version of Attributor is not slated until some time in 2008, we will have to simply wait and see.</p>
<p>In the meantime, we are forced into a choice between paying high dollar for an enterprise or academic solution and using a hodgepodge of limited but free services. Though it is possible to track and monitor one&#8217;s content very well using the currently available means, it is growing more and more time restrictive as the Internet, along with the misuse of content, grows by the minute.</p>
<p>The value of a system like Attributor is not just its ability to detect more reuse of content, to organize it in a way that is maintainable and offer solutions that are practical. If Attributor can do that, then real progress will be made in protecting content on the Web.</p>
<p>Regardless, this is a company to follow and to watch.</p>
<p><em><strong>Disclaimer:</strong> I am a consultant for Attributor. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/11/06/attributor-launches-content-monitoring-service/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Watermarking vs. Fingerprinting: A War in Terminology</title>
		<link>http://www.plagiarismtoday.com/2007/10/09/watermarking-vs-fingerprinting-a-war-in-terminology/</link>
		<comments>http://www.plagiarismtoday.com/2007/10/09/watermarking-vs-fingerprinting-a-war-in-terminology/#comments</comments>
		<pubDate>Tue, 09 Oct 2007 20:32:40 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Prevention]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[Copyright-Law]]></category>
		<category><![CDATA[embedding]]></category>
		<category><![CDATA[fingerprinting]]></category>
		<category><![CDATA[hashing]]></category>
		<category><![CDATA[image editing]]></category>
		<category><![CDATA[sha]]></category>
		<category><![CDATA[watermarking]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2007/10/09/watermarking-vs-fingerprinting-a-war-in-terminology/</guid>
		<description><![CDATA[The terminology when dealing with copyright law is notoriously thick. Throw in technology and you have a kludge of confusing terms, misnomers and techno-speak. Navigating these waters can be tough and, as the latest episode of the Copyright 2.0 Show indicated, even the brightest in the field can be tripped up. However, in honor of...]]></description>
			<content:encoded><![CDATA[<p>The terminology when dealing with copyright law is notoriously thick. Throw in technology and you have a kludge of confusing terms, misnomers and techno-speak.</p>
<p>Navigating these waters can be tough and, as the <a href="http://www.plagiarismtoday.com/2007/10/08/copyright-20-show-episode-27-setting-precedent/">latest episode of the Copyright 2.0 Show indicated</a>, even the brightest in the field can be tripped up.</p>
<p>However, in honor of that confusion, I&#8217;ve decided to take a close look at two of the terms that have been giving headaches to many in the field, myself included, digital watermarking and digital fingerprinting.</p>
<p>Though the two terms sound very much the same and serve the same end goal many times, they are actually very different in the way that they operate and the protections they provide. </p>
<p>Confusing the two terms is very easy, for obvious reasons, but distinguishing between them is very important.</p>
<p><span id="more-699"></span><strong>Watermarking</strong></p>
<p>If <a href="http://en.wikipedia.org/wiki/Digital_watermarking">Wikipedia is to be believed</a>, then digital watermarking &#8220;is a technique which allows an individual to add hidden copyright notices or other verification messages to digital audio, video, or image signals and documents.&#8221;</p>
<p>The important element to remember is that, when watermarking something, you are adding information to an existing product. That information could be visible, such as those provided by <a href="http://www.visualwatermark.com/">Visual Watermark</a>, or invisible, such as those <a href="http://www.digimarc.com/mypicturemarc/">provided by Digimarc</a>.</p>
<p>Either way, the effect is the same. You are taking an existing work and embedding new information into it in order to make the work more easily identified as yours. This identification can be done by a third party, by you or by an automated tool.</p>
<p>It is that act of altering the content that is crucial to remember because, as you will see, it distinguishes itself from how fingerprinting works. </p>
<p><strong>Fingerprinting</strong></p>
<p>Fingerprinting has at least two definitions when it comes to protecting content. The first deals with taking each copy of your content and making it unique to the person who receives it. This way, if the work is shared, you know exactly which person spread the work initially.</p>
<p>A variation of this technique is used by the <a href="http://wordpress.org/extend/plugins/copyfeed/">CopyFeed plugin</a>, which embeds the IP address of the feed reader into every entry. Thus, if the feed is scraped and reposted, the person doing the scraping can be identified and blocked.</p>
<p>However, the more common definition deals with a technique, <a href="http://en.wikipedia.org/wiki/Digital_fingerprinting">as Wikipedia puts it</a>, &#8220;in which sophisticated software identifies, extracts and then compresses characteristic components&#8230; enabling that video [or other content] to be immediately and uniquely identified by its resultant &#8216;fingerprint&#8217;&#8221;.</p>
<p>In short, what fingerprinting usually does is take the content, use some kind of software to convert it into a unique number or string of characters and then use that string to match it against other content out there. </p>
<p>It basically does to the content what a fingerprint scanner does to your fingerprint. The principles are very much the same.</p>
<p>The simplest and best-known form of fingerprinting is <a href="http://www.w3.org/PICS/DSig/SHA1_1_0.html">SHA Hashing</a>. This is how downloaders verify they received the complete file and how <a href="http://www.numly.com">Numly</a> matches content against its database.</p>
<p>However, hashing can be thwarted easily by the alteration of just one character or byte. Since that changes the hash for the entire content, it will not return a match even if everything else is the same.</p>
<p>More advanced fingerprinting technology, such as what is employed by major media corporations and is under construction for <a href="http://gizmodo.com/gadgets/press/youtube-leashed-claim-your-content-is-on-its-way-253113.php">YouTube via the &#8220;Claim Your Content&#8221; site</a> are much more accurate and can detect snippets of material as short as a few seconds. These fingerprints are available for video, audio, image and even textual works. However, they are less commonly used with text due to other readily-available search methods.</p>
<p>In short, digital fingerprinting is something that you do to an original work that does not modify it, but makes it easily searchable by compressing the content into a string which is both unique to it and can easily be compared against other works.</p>
<p>If you&#8217;re confused, well, things are about to get even worse.</p>
<p><strong>Exceptions to the Rule</strong></p>
<p>Right now, the rule seems simple enough. Watermarking is anything you actively embed into your content to aid in detection or identification, fingerprinting is something you do to your work to make it easily searchable and comparable to other works by using some element of programming.</p>
<p>However, watermarking is a strange term in that it is usually applied only to multimedia works. You can watermark a movie, an image, an audio file or even a downloadable document, but the term is almost never used to describe something done to clear text.</p>
<p>For example, the Digital Fingerprint Plugin actually works more like a watermarking plugin. It embeds a unique string into the RSS feed, one selected by the user, to make the work more easily identifiable. It fits the description of a watermark almost perfectly but calling it the &#8220;Digital Watermarking&#8221; plugin would have created a great deal of confusion.</p>
<p>The reason is that watermaking is usually associated with some kind of visual or audible change in the work. It might be invisible to the naked eye, but it is supposed to be a modification to the look or sound of the work. At the very least, it is a modification to a file, such as a Word document. For whatever reason, embedding a new line of text or adding an signature is called by many things, but not watermarking.</p>
<p>The term fingerprinting has been expanded to encompass just about anything done to text to make it uniquely identifiable including traditional fingerprinting and some of what we might call watermarking. This has been going on since long before the Digital Fingerprint Plugin and is likely to carry on deep into the future.</p>
<p>That is, unless linguists suddently decide it is possible to &#8220;watermark&#8221; a textual work. </p>
<p><strong>Conclusions</strong></p>
<p>When you look at these two terms and how they overlap and intertwine, it is easy to see how and why there is so much confusion regarding them. Even this lengthy article is not by any means a final report on the differences between the two and I&#8217;m already looking forward to the discussion in the comments.</p>
<p>But if you need a simple way to remember the two, try this. If I needed to prove that your thumb belonged to you, I could either A) Tattoo a unique symbol onto it, that would be a form of watermarking, or B) I could put your thumb up against a scanner and let analyze the lines and curves of your thumb, making sure it was unique to you. That would be, quite predictably, fingerprinting.</p>
<p>Though that analogy isn&#8217;t perfect by any stretch, at least it brings some clarity to the table.</p>
<p>Now, if we could only deal with the dozens of other misnomers and confusing names, we might actually be able to have a serious conversation on these topics. </p>
<p>Perhaps another day. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2007/10/09/watermarking-vs-fingerprinting-a-war-in-terminology/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 14:22:33 -->
