<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Plagiarism Todaypoetry | Plagiarism Today</title>
	<atom:link href="http://www.plagiarismtoday.com/tag/poetry/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.plagiarismtoday.com</link>
	<description>Content Theft, Plagiarism, Copyright Infringement</description>
	<lastBuildDate>Mon, 13 Feb 2012 06:51:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>The Myspace Problem</title>
		<link>http://www.plagiarismtoday.com/2008/10/03/the-myspace-problem/</link>
		<comments>http://www.plagiarismtoday.com/2008/10/03/the-myspace-problem/#comments</comments>
		<pubDate>Fri, 03 Oct 2008 16:41:54 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Punditry]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[image protection]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[Myspace]]></category>
		<category><![CDATA[photobucket]]></category>
		<category><![CDATA[Photography]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[poetry]]></category>
		<category><![CDATA[Social-Networking]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/?p=1852</guid>
		<description><![CDATA[Myspace may be the most popular social networking site, but its share of plagiarism seems to far outstrip its market share. Along with its sister site Photobucket, Myspace is likely the current king of human-born plagiarism.  ]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/10/myspace-logo.jpg" alt="myspace-logo.jpg" border="0" width="186" height="45" align="left" class="picleft" />I&#8217;ve been tracking plagiarists on the Web for the better part of a decade and I&#8217;ve watched as they seem to have followed well-known trends on the Internet, constantly shifting to the latest tools and hang outs.</p>
<p>When I first started dealing with plagiarists in 2001, the problem was primarily on message boards and forums. This made sense. Forums were the most popular means of socialization at the time and they were by far the easiest way for someone with limited Web experience to publish content.</p>
<p>So, even though I regularly ran across a personal home page or even a blog that had plagiarized copies of my work, most of my time was spent working with forum admins.</p>
<p>However, over the years that has shifted. Forums are now more of an Internet curiosity than an established meeting place. Social networking is now, <a href="http://blog.freshnetworks.com/2008/09/social-networking-more-popular-than-porn/">quite literally</a>, the hottest thing on the Web and with profile, blogging and image tools built in, they are by far the easiest way to publish a work on the Web.</p>
<p>So, with this trend, plagiarists have also begun to flock to social networks. Of all of the human-made plagiarism of my content I deal with, approximately 75% take place on social networking sites.</p>
<p>However, whenever I say &#8220;Social Networking&#8221;, I am actually being generous. Nearly 95% of all of my social networking issues, and thus well over half of all of my issues in general, come from one site: Myspace. </p>
<p>This is something that has me very worried. <span id="more-1852"></span><br />
<h4>Rupert Murdoch&#8217;s Empire of Social Plagiarism</h4>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/10/photobucket-upload.jpg" alt="photobucket-upload.jpg" border="0" width="202" height="159" align="right" class="picright"/>Sadly, I am not alone in my issues with Myspace. Nearly every author I have asked about their plagiarism problems have put Myspace either at or near the top of their list. This is especially true for anyone who writes poetry, short stories or other creative works that might appeal to Myspace users.</p>
<p>The only group that doesn&#8217;t seem to put Myspace at the top of their list is fan fiction authors. They are one of the few groups that still see the worst issue with forums and community sites. However, fan fiction plagiarism creates <a href="http://www.plagiarismtoday.com/2006/01/18/fan-fiction-plagiarism/">a whole new set of problems</a> beyond the scope of this article. </p>
<p>The Myspace issue isn&#8217;t limited to writers either. Ask any artist or photographer where they see the most unwanted copying, they&#8217;ll usually mention Photobucket. Photobucket, however, is owned by Fox Corp., which also owns Myspace, and the two sites are closely linked. In fact, much of Photobucket&#8217;s initial popularity, before it was bought out, was derived from <a href="http://ecoustics-cnet.com.com/MySpace-blocks-Photobucket-videos-and-slide-shows/2100-1026_3-6175272.html">Myspace users wanting to host images and videos</a> in their profiles.</p>
<p>But while it makes sense that the largest social networking site and the largest photo sharing site would attract the largest number of plagiarists, these sites see copyright infringement way out of proportion to their numbers. Where Myspace may be a little bit ahead of Facebook, I see hundreds of cases of plagiarism the former with almost none on the latter. </p>
<p>The question becomes &#8220;Why is that true?&#8221; and &#8220;What can be done about it?&#8221;</p>
<h4>Copy and Paste Haven</h4>
<p><img src="http://www.plagiarismtoday.com/wp-content/uploads/2008/10/photobucket-dard.jpg" alt="myspace-logo.jpg" border="0" align="left" class="picleft" />To see the problem first hand, all you have to do is either browse around a few Myspace profiles or visit Photobucket&#8217;s home page. Though many people use these sites to post legitimate content, the majority seems to have performed at least one act of likely copyright infringement.</p>
<p>The problem is so serious that <a href="http://dard.freehostia.com/">scripts have been developed exclusively to detect image copying on Photobucket</a> (more on this script later) and it seems that you can&#8217;t go anywhere on Myspace or Photobucket without encountering likely infringements.</p>
<p>Users of these sites are content hungry. They need text, images, video and audio to complete their profiles. On one hand, this has created a whole cottage industry around providing licensed content for Myspace, <a href="http://www.pimp-my-profile.com/">including profile layouts</a>. On the other, it has lead to a copy and paste culture that can be very dangerous.</p>
<p>Though plagiarism and copyright infringement takes place everywhere, in my experience, the number of likely infringements on Myspace profiles dwarf those on Facebook. Why this is seemingly true is complicated, but there are five potential reasons I see for it.</p>
<ol>
<li><strong>Public Profiles:</strong> Myspace profiles are, for the most part, visible to general public. This not only makes it easier to find infringements, thus increasing the number found, but adds motivation for people to beef up their profiles, sometimes motivating people to take content that is not theirs. I see a similar trend on dating sites that have public profiles. </li>
<li><strong>Younger Audience:</strong> Most of the plagiarists I deal with on Myspace are under the age of 25. Myspace seems to cater to a younger audience and, for whatever reason, they seem more prone to abusing copy and paste on the Web.</li>
<li><strong>Low Expectation of Originality:</strong> In a way, it is hard to call much of the content use on Myspace plagiarism. There is a very low expectation of originality on the site as most people recognize that the images and other profile touches came from somewhere else, attributed or not. This is less true with the blogs and profile information sections, but it is clear that most Myspace users don&#8217;t expect fellow members to create every element of their profile. </li>
<li><strong>Permissive Takedown System:</strong> Though Fox responds quickly to all DMCA notices and removes infringing works, it is very slow to ban users. I have seen many cases where a Myspace user has lifted ten or more works and Myspace simply removed the infringing work without deleting the user as a repeat infringer.</li>
<li><strong>High Degree of Customization:</strong> Myspace, unlike many other networks, allows users to edit almost every aspect of their profile&#8217;s look and feel, leading to far more opportunities for abuse. </li>
</ol>
<p>All in all, the reason Myspace is such a haven for plagiarists is not a simple question to answer, but rather, there seems to be a combination of factors contributing to the problem.</p>
<p>Fixing it, however, is going to be even more difficult. </p>
<h4>Solving the Problem</h4>
<p>Myspace, as a site presents several critical challenges for writers and artists.</p>
<p>First, there is almost no way to contact the infringer directly unless you already have a Myspace account and can send them a message. However, such messages are a risky way to resolve copyright disputes as the paper trail is very limited.</p>
<p>Second, as mentioned above, Myspace does respond well to DMCA notices but rarely takes action against heavily infringing users. This can make it difficult to handle any infringer that has taken many different works.</p>
<p>Finally, the nature of Myspace is one where works posted are passed around almost immediately. It is something of an enclosed ecosystem where a poem, an image or an article will enter into the environment and then travel throughout the service, often carrying dozens of different names. </p>
<p>Overcoming this will not be easy and there is no way that a single Webmaster or copyright holder can do so. It is going to take a concerted effort to change both the culture of Myspace and the approach of its parent company.</p>
<p>The best advice I can give right now is to be aggressive when dealing with infringements on Photobucket and Myspace. Actively seek out infringements on the sites and report them quickly. Once image search tools allow users to detect infringements on Photobucket, use those tools as well and aggressively demand removal.</p>
<p>The goal is two fold. First, motivate Fox to create a more efficient and usable copyright system and the second is to change the culture of these sites. Currently, on YouTube, there is a great deal of infringement but users, for the most part, expect any likely infringing video to be pulled down. As a result, those who care about their YouTube account are careful not to post such videos and the most successful YouTube users feature original content.</p>
<p>The goal is not to eliminate copyright infringement on the site, that will never happen, but create an ecosystem that encourages creativity and rewards originality. The difference in such a system is obvious when one looks at a site such as Flickr and then goes to visit Photobucket.</p>
<h4>Conclusions</h4>
<p>The sad truth is that there is very little that we can do from the outside to help shape the type of site Myspace is to become. In addition to its large size and strong, if at times annoying, culture, there seems to be a lot of foot dragging on Fox&#8217;s end.</p>
<p>But what is frustrating about this problem is just that infringement seems to be so rampant on these sites, but that legitimate users of them seem to share the burden.</p>
<p>There is a reason why Myspace is not taken as seriously as Facebook or Photobucket as seriously as Flickr and the issue of copyright infringement is a big part of the equation.</p>
<p>This is most unfortunate not just for the artists and writers that have their work misused, but those that use those sites and try to do good things with it. Their reputations are tarnished by association. </p>
<p>Fortunately, most of those who use or did use those sites for the purpose of displaying their own work have at least established presences elsewhere.</p>
<p>If one wants to be taken seriously, it is clear that Myspace nor Photobucket is the place to do it. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/10/03/the-myspace-problem/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Using Copyscape to Detect Derivative Works</title>
		<link>http://www.plagiarismtoday.com/2008/02/13/using-copyscape-to-detect-derivative-works/</link>
		<comments>http://www.plagiarismtoday.com/2008/02/13/using-copyscape-to-detect-derivative-works/#comments</comments>
		<pubDate>Wed, 13 Feb 2008 17:12:39 +0000</pubDate>
		<dc:creator>Jonathan Bailey</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Content-Theft]]></category>
		<category><![CDATA[Copyright-Infringement]]></category>
		<category><![CDATA[copyscape]]></category>
		<category><![CDATA[Plagiarism]]></category>
		<category><![CDATA[plagiarism-detection]]></category>
		<category><![CDATA[poetry]]></category>
		<category><![CDATA[Search-Engines]]></category>
		<category><![CDATA[searching]]></category>
		<category><![CDATA[short stories]]></category>

		<guid isPermaLink="false">http://www.plagiarismtoday.com/2008/02/13/using-copyscape-to-detect-derivative-works/</guid>
		<description><![CDATA[Every tool has a purpose and Copyscape has its. Though weaker on verbatim plagiarism, it can help detect cases where the work has been rewritten and the regular search engines struggle to keep up.]]></description>
			<content:encoded><![CDATA[<p><img SRC="http://img.skitch.com/20080213-m2adnnd68ew6fn63cy3hn798ds.png" align="left" class="picleft"/>Yesterday, a concerned mother approached me worried that a poem being submitted to a local poetry contests was less than original. I offered some basic advice on how to search for the work in Google but, despite her best efforts, no results came up.</p>
<p>I then had her forward the poem on to me and I began to use my tools to try and analyze it. After a few futile Google searches of my own, I decided to give <a href="http://www.copyscape.com">Copyscape</a> a try before using even more advanced tools. </p>
<p>However, further searching proved unnecessary as Copyscape detected a match and it immediately became clear why Google searches were failing, the poem in question was not a verbatim plagiarism, but a derivative with more than a few modifications.</p>
<p>It highlighted the point that Copyscape, while perhaps not ideal for verbatim plagiarism, is best at detecting derivative works and performing the kind of thorough, line-by-line checking that simply is not practical for humans.<br />
<span id="more-818"></span></p>
<h4>Why It Works</h4>
<p>Humans and computers search for duplicates much the same way, by punching in strings of words into a search engine and comparing the sites that come up to the original. The difference, however, is that the Copyscape, and similar systems, do it much faster and process many more strings.</p>
<p>If someone takes your work and rewrites it, producing a derivative of the work, they are going to keep at least some of the original wording. That means there is searchable text to match in the derivative, unless the plagiarist changed at least one in every five or six words, but finding it is nearly impossible. </p>
<p>The problem is that the matching strings may cross punctuation or even paragraph breaks. Humans, in order to use hand search techniques, need to first remove all of the punctuation and then search for a string beginning with every word in the piece. </p>
<p>That could take hours, or even days, especially when you have to filter out false positives.</p>
<p>Computers, however, typically ignore such punctuation to start with and search for a large number of strings rapidly. If there is a match, they hold it and see if it matches for other strings as well. This works for pretty much any copy detection system including Copyscape, Turnitin and BitScan. </p>
<p>Though the technique <a href="http://www.plagiarismtoday.com/2007/10/02/copyscape-improved-again/">can detect verbatim plagiarism</a>, it works best in situations where the amount of work copied word for word is low or moderate. This is not just where these tools provide a unique and valuable service, but where they perform the best. </p>
<h4>The Problem with Verbatim Detection</h4>
<p><img SRC="http://img.skitch.com/20080213-1fnr9bdueqd8fay5gyditdjkr5.png" align="right" class="picright"/>The reason these services generally do a poor job at detecting widespread verbatim plagiarism is because of the nature of the searches.</p>
<p>If you search for a string of words in Goggle, the search engine will spit out all pages that match that text. If you choose a good phrase, all of those sites should have content lifted from your work. </p>
<p>However, with a Copyscape search, the computer has to process hundreds of search results, parse them and decide which sites are matches. How such services process these results are closely held secrets, but its safe to say that if the bar for determining a match is set too low, there will be a large number of false positives, too high, many actual matches will be discarded.</p>
<p>If you&#8217;re using such a system to detect widespread verbatim plagiarism, some of the matches are going to be discarded. This is because the search results received are not predictable and, depending on how the backend handles the process, it could discard some results it is unsure of in a bid to prevent false positives. </p>
<p>However, this isn&#8217;t to say that there aren&#8217;t times that using such an automated system is preferable. There are many situations where it is practically necessary.</p>
<h4>Derivative Destiny</h4>
<p>There are some types of work that are almost destined to be turned into unwanted derivative works. </p>
<p>Academic works are the most obvious target. Plagiarism in student papers, research reports and even academic journals are widespread and the vast majority of it involves at least some rewriting.</p>
<p>When you consider that, it is no shock that academics have had systems such as Turnitin for many years now as they are solutions targeted directly at that problem. In fact, many of the solutions we use to day are based, at least in spirit, upon these tools. </p>
<p>Another category of work likely to be rewritten as it is being plagiarized is marketing and promotional copy. In the fields of advertising and PR, effective copy is hard to write and similarities are expected. However, on the Web it often goes beyond taking ideas and into the realm of copyright infringement. However, since originality is still prized, both by humans and search engines, the copying is generally not verbatim. </p>
<p>Finally, artistic works, especially poetry and short stories, are particularly vulnerable to unwanted derivatives. Sometimes it is simple to detect, as when the plagiarist changes a character&#8217;s name or substitutes a few words in a poem, other times it can involve massive edits to the work, leaving it almost completely different from the original.</p>
<p>If you work heavily in any one of these categories, it is probably worth your while to do occasional searches for your work using a system such as Copyscape to help detect derivative use of your work. </p>
<p>The systems may not be perfect, but they are certainly preferable to doing the checks by hand.</p>
<h4>Conclusions</h4>
<p>Search engines were never designed to detect duplicate content. They were created to locate keywords and search terms within sites and determine which of those sites deserves to be ranked the highest.</p>
<p>As such, the tools we use to detect copies of our work have to push the search engines to their limit and twist their features to generate unintended results.</p>
<p>Fortunately, new tools are being developed from the ground up to index and monitor content across the Web. As these tools improve and grow, they will likely be able to replace these hacks and tweaks for those seeking to search out their own content. </p>
<p>When that happens, not only will the search engines be able to better focus on what they do best, but we will have much better data about who is using our work and how it is affecting us.</p>
<p>The push in this area has really just begun and it will be an area of radical change in the coming months and years. </p>
<h4>Next Time</h4>
<p>This is the first in a short series on search engines and plagiarism detection. Next time, I&#8217;m going to do a head-to-head comparison of five of the top search engines to see which works best at detecting verbatim copies of your work. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.plagiarismtoday.com/2008/02/13/using-copyscape-to-detect-derivative-works/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

Served from: www.plagiarismtoday.com @ 2012-02-13 06:42:40 -->
