<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Open Source Research</title>
	<atom:link href="https://openresearch.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>https://openresearch.wordpress.com</link>
	<description>My daily sufferings as a PhD student at Berkeley</description>
	<lastBuildDate>Thu, 05 Jan 2012 00:57:43 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='openresearch.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>https://s-ssl.wordpress.com/i/buttonw-com.png</url>
		<title>Open Source Research</title>
		<link>https://openresearch.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="https://openresearch.wordpress.com/osd.xml" title="Open Source Research" />
	<atom:link rel='hub' href='https://openresearch.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Notes from Neel Sundaresan&#8217;s keynote speech at RecSys 2011</title>
		<link>https://openresearch.wordpress.com/2011/10/26/notes-from-neel-sundaresans-keynote-speech-at-recsys-2011/</link>
		<comments>https://openresearch.wordpress.com/2011/10/26/notes-from-neel-sundaresans-keynote-speech-at-recsys-2011/#comments</comments>
		<pubDate>Wed, 26 Oct 2011 14:05:16 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=292</guid>
		<description><![CDATA[He started by stating that he won&#8217;t have any greek symbols in the talk. Arch West was the inventor of Doritos and David Pace we the inventor of Pace sauce. What they did was that they noticed the can sell more if they advertise the two products together. There is a lesson in cross-selling and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=292&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_293" class="wp-caption alignleft" style="width: 210px"><img class="size-full wp-image-293" title="Neel Sundaresan" src="http://openresearch.files.wordpress.com/2011/10/neel2008.jpg?w=497" alt="Neel Sundaresan"   /><p class="wp-caption-text">Neel Sundaresan</p></div>
<p>He started by stating that he won&#8217;t have any greek symbols in the talk.</p>
<p>Arch West was the inventor of Doritos and David Pace we the inventor of Pace sauce. What they did was that they noticed the can sell more if they advertise the two products together. There is a lesson in cross-selling and recommender systems that we can learn from this story.</p>
<p>eBay started when Pierre Omidyar wanted to sell his broken laser pointer. He listed the laser pointer online for 99 cents and finally sold for 14 dollar. Was wondering if the person who bought it knew that it is broken. The guy responded &#8220;Yes I am a collector of broken laser pointers&#8221;</p>
<p>Why people buy something? it is hard to say. Some people buy stories! remember the toast that sold for 27K that was the cheapest marketing campaign that a casino had.  One man&#8217;s trash is another&#8217;s treasure.</p>
<p>The long tail in eBay&#8217;s context mean most people sell very few items and most of eBay&#8217;s revenue comes from these people. (i.e the mean is way larger than the median</p>
<p>The users constantly are running experiments to maximize their revenue. They are constantly testing to see if free shipping can sell more, different selling strategies are being tested by users at any time on eBay.</p>
<p>This causes an interesting behavior. If you promote a user&#8217;s product on the homepage they may increase their price! This is an interesting dynamics between the user and the seller (eBay)</p>
<p>One of the problems that locations like eBay have is the problem of big data. Complex algorithms are often impossible to work with in that scale. If you are looking for a job at eBay you need to know how to work with data in that scale.  A goal at eBay lab is that when a new scientist joins the lab on Monday he got access to all the data by Friday.</p>
<p>This amount of data has changed how economics is doing experiments. They can now run experiments on 400 million data points.</p>
<p>What are you optimizing for at eBay? is it profit maximization? do you want to increase the shopping cart size? are you looking for maximum customer satisfaction?</p>
<p>The other thing is how do you measure success?</p>
<p>Everything we do at eBay is a recommendation.</p>
<blockquote><p>I KEEP six honest serving-men<br />
(They taught me all I knew);<br />
Their names are What and Why and When<br />
And How and Where and Who.</p>
<p>&#8211; from <em>The Elephant&#8217;s Child </em></p></blockquote>
<p>When we look at the tag cloud of eBay you see keywords like &#8220;used&#8221;, &#8220;vintage&#8221; and &#8220;antique&#8221; a lot more than &#8220;new&#8221;</p>
<p>The search is an interesting problem some people are looking for &#8220;ipod nano 4gb black new&#8221; and some are looking for the skin for their ipod. Our search engine should be able to differentiate between &#8221;ipod nano 4gb black new&#8221; &#8221;ipod nano 4gb black new skin&#8221;. This proposes a hard and challenging research questions.</p>
<p>Click trails can help us tremendously with building recommender systems that can capture these behavior and improve recommender systems.  At eBay a data cleanup is an important part of recommender. Specially when they use click trails.</p>
<p>eBay has a language like pig that allows them to do pattern recognition at scale. Sometimes a search is followed by some page views and another search. This pattern is useful to do recommendation to other users who have similar initial search queries. See two recent papers from Sundaresan for the results and model.</p>
<p>Fashion item buyer on eBay are very brand aware. Sometimes ebay does not have enough inventories and needs to recommend proper products from outside websites.</p>
<p>one of the challenges at eBay is that we do not have a catalogue of items (remember the laser pointer story?) Amazon does not have such a problem, you cannot sell anything on Amazon unless it is on the catalog.</p>
<p>eBay uses its own matrix factorization see their ICML paper. The sparsity in eBay&#8217;s data is fascinating it is 100 times the sparsity in the Netflix data.</p>
<p>eBay clusters items into pseudo products using LDA. He shows an example of a recommendation for a broken blackberry cellphone.</p>
<p>The most important thing is &#8220;why&#8221; are you recommending this to the user and &#8220;why&#8221; they should buy it. HCI is a useful tool here. reveal to the user why you are recommending. Something like &#8220;52% of the people who bought this item also bought &#8230;&#8221; are very effective.  Be very explicit on why certain recommendations are made.</p>
<p>Let&#8217;s look at &#8220;When&#8221;. things like reminders, post purchases, urgency, upgrades, seasonal sales fall into this. Reminders can be like &#8220;you have viewed this item&#8221; that reminds people that they can still go and buy. There is a temporal element to this problem too. They may not need the same item until after 30 days but need to buy it again after 30 days is passed.</p>
<p>See this <a href="http://www.wired.com/magazine/2011/04/st_essay_persuasion_profiling/" target="_blank">wired article </a>on persuasion based profiling and recommendation systems. (thanks to twitter).</p>
<p>There is a lot of seasonality on eBay. Mother and Father&#8217;s days, Christmas. There are other events that are we don&#8217;t know (so my question is how can we find them algorithmically)</p>
<p>We get more data from mobile devices than we get from online. It is a huge research opportunity.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/292/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/292/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=292&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/10/26/notes-from-neel-sundaresans-keynote-speech-at-recsys-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/neel2008.jpg" medium="image">
			<media:title type="html">Neel Sundaresan</media:title>
		</media:content>
	</item>
		<item>
		<title>Notes from From Understanding to Enabling Networks: Using Web Science to Enhance Recommender Systems</title>
		<link>https://openresearch.wordpress.com/2011/10/24/notes-from-from-understanding-to-enabling-networks-using-web-science-to-enhance-recommender-systems/</link>
		<comments>https://openresearch.wordpress.com/2011/10/24/notes-from-from-understanding-to-enabling-networks-using-web-science-to-enhance-recommender-systems/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 17:15:33 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=284</guid>
		<description><![CDATA[The keynote at #recsys2011 is by Noshir Contractor. He is the coauthor of &#8220;Theories of communication networks&#8221; which seems to be an interesting book from amazon reviews. the presentation stack is available here (thanks to @barrysmyth for the link) He started by presenting SNIF. SNIF is a device and social networks for dogs! Kind of social petworking. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=284&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_289" class="wp-caption alignleft" style="width: 210px"><img class="size-full wp-image-289" title="Noshir Contractor" src="http://openresearch.files.wordpress.com/2011/10/contractor_noshir.jpg?w=497" alt="Noshir Contractor"   /><p class="wp-caption-text">Noshir Contractor</p></div>
<p>The keynote at #recsys2011 is by<a href="http://nosh.northwestern.edu/"> Noshir Contractor</a>. He is the coauthor of &#8220;Theories of communication networks&#8221; which seems to be an interesting book from amazon <a href="http://www.amazon.com/gp/product/0195160371/ref=as_li_ss_tl?ie=UTF8&amp;tag=resodail-20&amp;linkCode=as2&amp;camp=217145&amp;creative=399369&amp;creativeASIN=0195160371" target="_blank">reviews</a>.</p>
<p>the presentation stack is <a href="http://www.sis.smu.edu.sg/SocInfo2011/documents/noshir.pdf">available</a> here (thanks to <a href="http://twitter.com/barrysmyth">@barrysmyth</a> for the link)</p>
<p>He started by presenting SNIF. SNIF is a device and social networks for dogs! Kind of social petworking. In contrast lovegety is the SNIF technology for people. Find love through random encounters.</p>
<p>Today we will talk about How we can take research in social sciences and bring it to recommender systems.</p>
<p>People have looked at citations and papers and found that people who write papers in teams have a high impact. Also articles by teams from different disciplines from different geographic locations have the highest impact. Fining the appropriate team from a diverse background and geography is much harder.</p>
<p>Thus we are looking at assembeling these type of teams. But how do we decide whom to bring to the team?</p>
<p>The exciting thing about our time is that we have theories, data and methods, additionally we have computation infrastructure to run these models</p>
<p>Why do people collaborate with each other?</p>
<p>MTML model:</p>
<ul>
<li>self interest (from econ theories)</li>
<li>Social and resource exchange</li>
<li>Mutual interest and collective action</li>
<li>Theories of contagion</li>
<li>Theories of balance</li>
<li>Theories of homophily</li>
<li>Theories of proximity</li>
</ul>
<div>My note: How about Robert Spolsky&#8217;s theory?</div>
<div><em>Exponential randome graphs</em> can explain how these collaboration networks is formed (the shape of the graph)</div>
<div>They have looked at the structure of NSF proposals and they wanted to see if they can build a recommnder system that by using characteristics of the proposal make recommendations for acceptance</div>
<div>The likelihood of collaboration is highers if:</div>
<div>
<ul>
<li>you have written an NSF proposal together</li>
<li>you have cited each other</li>
</ul>
<div>Didn&#8217;t know about <a href="http://en.wikipedia.org/wiki/H-index">H-index</a>. Interesting factor. Apparently those with higher H-index are less likely to collaborate.</div>
</div>
<div>Citing your collaborators actually reduces the likelihood of getting NSF funding (!)</div>
<div>Solving the link recommendation problem (recommending who should be on the team)</div>
<div>Link prediction approaches: node-wise similarity, network topology, or probabilistic modeling</div>
<div>P* for link prediction</div>
<div>Use p* models to calculate link probability</div>
<div>
<ul>
<li>Estimate p*/ERGM</li>
<li>the rest I didn&#8217;t get to type (!)</li>
</ul>
<div>I think the probabilistic model that he is refering to is the same as model fitting on Bayesian nets but I am not sure.</div>
</div>
<div>The talk ended by a demo of the implementation that is available <a href="http://ciknow1.northwestern.edu/sw_nucats/">here </a></div>
<div>Noshir&#8217;s book is also available for free on his <a href="http://nosh.northwestern.edu/cv/dissertation-and-publications/">personal </a>website</div>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/284/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/284/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/284/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/284/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/284/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/284/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/284/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=284&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/10/24/notes-from-from-understanding-to-enabling-networks-using-web-science-to-enhance-recommender-systems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/contractor_noshir.jpg" medium="image">
			<media:title type="html">Noshir Contractor</media:title>
		</media:content>
	</item>
		<item>
		<title>Notes from &#8220;Recommendations as a Conversation with the User&#8221; by Daniel Tunkelang</title>
		<link>https://openresearch.wordpress.com/2011/10/24/notes-from-recommendations-as-a-conversation-with-the-user-by-daniel-tunkelang/</link>
		<comments>https://openresearch.wordpress.com/2011/10/24/notes-from-recommendations-as-a-conversation-with-the-user-by-daniel-tunkelang/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 15:56:59 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=275</guid>
		<description><![CDATA[These are my unedited notes from Daniel Tunkelang&#8217;s presentation at #recsys2011. I am editing as you are reading this post. &#8220;Recommendations as a Conversation with the User&#8221; by Daniel Tunkelang Goal is to have a better relationship with the user Three take aways from this talk: Consider asking vs guessing Ask good questions It&#8217;s okay [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=275&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_277" class="wp-caption alignleft" style="width: 145px"><img class="size-full wp-image-277" title="Daniel Tunkelang" src="http://openresearch.files.wordpress.com/2011/10/daniel-tunkelang.jpg?w=497" alt="Daniel Tunkelang"   /><p class="wp-caption-text">Daniel Tunkelang</p></div>
<p>These are my unedited notes from Daniel Tunkelang&#8217;s presentation at #recsys2011. I am editing as you are reading this post.</p>
<p><strong>&#8220;Recommendations as a Conversation with the User&#8221; by Daniel Tunkelang</strong><br />
Goal is to have a better relationship with the user</p>
<p>Three take aways from this talk:</p>
<ul>
<li>Consider asking vs guessing</li>
<li>Ask good questions</li>
<li>It&#8217;s okay to make mistakes<strong> if</strong> you have a good explanation and adapt to feedback</li>
</ul>
<p><strong>Theory</strong></p>
<p><strong></strong>&#8220;<a href="http://www.amazon.com/gp/product/B004J8HY8K/ref=as_li_ss_tl?ie=UTF8&amp;tag=resodail-20&amp;linkCode=as2&amp;camp=217145&amp;creative=399373&amp;creativeASIN=B004J8HY8K">The Man Who Lied to His Laptop</a>&#8221;  is a great related read<br />
Paul Grice&#8217;s maxims of conversations:</p>
<ol>
<li>Quality</li>
<li>Quantity</li>
<li>Relation</li>
<li>Manner</li>
</ol>
<p><strong>**Do not lie</strong></p>
<ul>
<li>Don&#8217;t use &#8220;recommended&#8221; when you really mean &#8220;sponsored&#8221; or &#8220;excess inventory&#8221;. User&#8217;s loss of trust will cost you. but users do not have a model on how on how to trust a system</li>
<li>Optimize for the user&#8217;s utility</li>
<li>Apply a standard of evidence (quality, quantity) that you believe in</li>
</ul>
<p><strong>Right amount of information</strong></p>
<ul>
<li>Exchange small units of information</li>
<li>If recommendations supplement other content consider overall cognitive load</li>
<li>provide short meaningful explanations</li>
</ul>
<p><strong>Maxim 3: Relation. Relevant to the user</strong></p>
<ul>
<li>Offer value to the user</li>
<li>respect task context</li>
<li>don&#8217;t be obnoxious</li>
</ul>
<p><strong>Maxim 4: Manner</strong></p>
<ul>
<li>relevant to the user</li>
<li>Eschew obfuscation</li>
<li>Avoid ambiguity</li>
<li>be brief</li>
<li>be orderly</li>
</ul>
<p><strong>Another perspective</strong></p>
<p>Another perspective is Gary Marchionini&#8217;s perspective on Human computer information retrieval</p>
<p><span style="text-decoration:underline;"><em>Empower people to explore large-scale information but demand that people also take responsibly for the control be expending cognitive and physical energy</em></span></p>
<p><strong>principles of hcir</strong></p>
<ol>
<li>do more than deliver information: facilitate sense-making</li>
<li>require and reward effort</li>
<li>adapt to increasingly knowledgeable users over time</li>
<li>be engaging and fun to use</li>
</ol>
<p><strong>Adapt to user knowledge</strong><br />
Systems that don&#8217;t get better over time will frustrate users, because users<strong> DO</strong> get better over time</p>
<p><strong>Personalized recommendations</strong></p>
<ul>
<li>be transparent about model so users gain insight</li>
<li>allow users to modify models to correct</li>
<li>solicit just enough information to provide value</li>
<li>Exemplars are interesting tools to communicate the recommender model to the user</li>
<li>Users should be able to modify the recommender system say you have a recommender system that uses location and user is using a proxy. He should be able to turn if off to make it noncreepy!</li>
</ul>
<p><strong>Social recommnedations</strong></p>
<ul>
<li>identify the right set of similar users</li>
<li>allow users to manipulate the social lens</li>
<li>accommodate users who break your model</li>
</ul>
<p>When making item recs, explain your recommendations! <em>Watch for non-sequiturs (diapers -&gt; beer problem)</em></p>
<p><strong>**Tell me about yourself is friendlier than &#8220;fill out 20 pages of survey&#8221;</strong></p>
<p>Corpse bride is in the recommnded set and I have watched it, it is good. it gives me the feeling that recommender is working properly</p>
<p><strong>Learning from netflix</strong></p>
<ul>
<li>Ask users for help upfront but not too much help</li>
<li>pay attention to what the user tells you</li>
<li>give users value often and early</li>
</ul>
<p>75% of netflix views result from recommendation</p>
<p>Underpromissing and overdelivering is sometimes a <strong>good idea</strong></p>
<p>Soe models more explainabel than others</p>
<ol>
<li>consider decision trees and rule based models</li>
<li>avoid using latent, unlabled features</li>
<li>if the model is opaque use exaples as surrougates</li>
</ol>
<p>Make a good first impression<br />
your user&#8217;s first experience is critical</p>
<p>See &#8220;Machine learning for large scale recommender systems&#8221; by Agrawal and Chen ICL 2011 Tutorial</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/275/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/275/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/275/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/275/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/275/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/275/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/275/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=275&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/10/24/notes-from-recommendations-as-a-conversation-with-the-user-by-daniel-tunkelang/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/daniel-tunkelang.jpg" medium="image">
			<media:title type="html">Daniel Tunkelang</media:title>
		</media:content>
	</item>
		<item>
		<title>&#8220;We Will All Be Jedi Masters Soon&#8221; or &#8220;Random But Coherent Thoughts on Modern Education&#8221;</title>
		<link>https://openresearch.wordpress.com/2011/10/12/education/</link>
		<comments>https://openresearch.wordpress.com/2011/10/12/education/#comments</comments>
		<pubDate>Wed, 12 Oct 2011 09:48:13 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=262</guid>
		<description><![CDATA[Three things have happened recently: I am going on the job market and looking for a research job Steve Jobs passed away I read a blog post entitled &#8220;Unless you are awesome, you will be outsourced&#8220; 1- We are living in an exciting era. Stanford is offering their AI course online for free and I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=262&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://openresearch.files.wordpress.com/2011/10/education.jpg"><img class="aligncenter size-full wp-image-263" title="education" src="http://openresearch.files.wordpress.com/2011/10/education.jpg?w=497" alt=""   /></a></p>
<p>Three things have happened recently:</p>
<ul>
<li>I am going on the job market and looking for a research job</li>
<li>Steve Jobs passed away</li>
<li>I read a blog post entitled &#8220;<a href="http://blog.summation.net/2011/10/awesome-or-outsourced.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+typepad%2Fsummation+%28Summation%29">Unless you are awesome, you will be outsourced</a>&#8220;</li>
</ul>
<p>1- We are living in an exciting era. Stanford is offering their <a href="http://www.ai-class.com/home/">AI course</a> online for free and I am telling you, it is not the crappy study material that MIT dumps on their open course ware website. These are serious, well curated videos with quizzes and assignments.  Basically what it means is that a brown kid in the desserts of south Oman can now learn what a wealthy full-of-himself Stanford kid in Palo Alto learns about AI .</p>
<p>2- For my PhD I worked on the intersection of human-computer interaction (HCI) and machine learning (ML). It took me a couple of years and I can assure you, you can find online and free educational material to become as bad-ass as I am in both of the fields. Ironically the number of educational videos on machine learning outnumbers the number of videos on CHI!</p>
<p>3- Steve Jobs died a couple of days ago. As an open source contributor I hated him while he was around. But I felt extremely sad when passed away. Let&#8217;s all face it, he might have been an ass when it comes to treating others or exploiting and abusing child labor. But he lead many amazing projects. Apple was a mecca for HCI people. They really set the standard for innovation in consumer products. I have recently found<a href="http://www.youtube.com/watch?v=ob_GX50Za6c"> this video</a> by Steve Jobs in which he emphasizes tool building and how computers are making us super humans. And it is very true. The guy is a visionary.  In his words we can quote him, disagree with him, glorify or vilify him, but the only thing we can&#8217;t do is to ignore him because he changed our lives forever.</p>
<p>4- In terms of computing we are living in an amazing time too. I work in a lab that has a pretty strong cluster of computers and 90% of the time the load on the cluster is not that much. Do you see it? It means we are entering an era that our computing power is way more than our computing needs! And if we utilize our computing power well we may actually have excess cycles. Things like Hadoop have allowed us to treat multi-computers like single computers and run massive jobs on them. Do things that were impossible before.</p>
<p><strong>I guess what I am saying is that:</strong></p>
<p>The other blog post is claiming that we need to be awesome otherwise we will loose our jobs. What I am saying is that the cost of becoming awesome is decreasing dramatically. With all these free courses, education is becoming cheap (while schooling becomes more and more expensive), our  tools are getting better and I also believe that seeing our friends on facebook/twitter/google+ has given us an incentive for self improvement and encourages us to learn and educate ourselves more. It has become much easier to push ourselves to become a Jedi master.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/262/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/262/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/262/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/262/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/262/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/262/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/262/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=262&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/10/12/education/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/education.jpg" medium="image">
			<media:title type="html">education</media:title>
		</media:content>
	</item>
		<item>
		<title>Methods for testing data-heavy applications &#8211; Part I</title>
		<link>https://openresearch.wordpress.com/2011/07/14/methods-for-testing-data-heavy-applications-part-i/</link>
		<comments>https://openresearch.wordpress.com/2011/07/14/methods-for-testing-data-heavy-applications-part-i/#comments</comments>
		<pubDate>Fri, 15 Jul 2011 02:32:56 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=254</guid>
		<description><![CDATA[For the past two months, I have been building a piece of code for a very interesting company that is not on the Fortune 500 list yet but if it goes public it may very well be a Fortune 1 company! This application is very heavy on data. I am still developing, I feed it chunks of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=254&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_255" class="wp-caption alignright" style="width: 257px"><a href="http://openresearch.files.wordpress.com/2011/07/test-in-progress.jpg"><img class="size-full wp-image-255" title="Software Testing" src="http://openresearch.files.wordpress.com/2011/07/test-in-progress.jpg?w=497" alt="Software Testing"   /></a><p class="wp-caption-text">Software Testing</p></div>
<p>For the past two months, I have been building a piece of code for a very interesting company that is not on the Fortune 500 list yet but if it goes public it may very well be a Fortune 1 company! This application is very heavy on data. I am still developing, I feed it chunks of 100 megabyte datasets, it goes through labor for a couple of minutes and crunches the numbers, and eventually comes back with a table with about 100 rows that I could take and do some statistical analysis on and plot. That big chunk of code gets reduced to bite size human-readable numbers and graphs. We are still testing it and I honestly have no idea how it will perform once it gets connected to the data firehose of the company! and that drives me insane!</p>
<p>I have been testing this software. It works fine. It works fine on all of my test datasets and I have been scrolling through its algorithms that span over a thousand line of code to make sure they work as I expect. I am going insane over this. What if we ship it next week and there is a bug in it? It is not like a Google+ application that you write, ship and then notice that instead of the photo of your female friend it brings back the photo of your dad! there is no way to see the bug when the product is out. your users cannot report the bugs because they won&#8217;t see any of it.</p>
<p>I have been looking into the scientific literature and there is tons of methods in the field of software development with tons of different names but none is about the type of software that I am working on. So I asked on twitter how other people are testing their software and here is a number of responses that I have received.</p>
<p>1- Take the current data files and manually perform your algorithm on the data set (using something like Excel) and see if the final results match</p>
<p>2- Fabricate test data sets for which you know the final result. Run the code on those test files. For example say you have written a code for summation. You can just feed it an array of ones and see if it works on them. Test for boundary cases, test for data that produce intuitive outputs, if you know your algorithm should have a specific behavior, test for that too (like when you expect symmetry in your output)</p>
<p>3 &#8211; Hopefully you break your code into atomic blocks that only do one thing (inverse a matrix, svd calculations, etc) write unit tests for those functions individually. Then combine those individual functions into larger functions and components and test them too and so on.</p>
<p>I am more interested to know your thoughts. what are your suggestions for testing big data applications?</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/254/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=254&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/07/14/methods-for-testing-data-heavy-applications-part-i/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/07/test-in-progress.jpg" medium="image">
			<media:title type="html">Software Testing</media:title>
		</media:content>
	</item>
		<item>
		<title>Mechanical Turk Market, Ethics and Milton Friedman</title>
		<link>https://openresearch.wordpress.com/2011/04/07/mechanical-turk-market-ethics-and-milton-friedman/</link>
		<comments>https://openresearch.wordpress.com/2011/04/07/mechanical-turk-market-ethics-and-milton-friedman/#comments</comments>
		<pubDate>Fri, 08 Apr 2011 05:50:34 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=244</guid>
		<description><![CDATA[Disclaimer: unlike what it may seem like, I am not a conservative. I am just another liberal researcher from Berkeley who values and celebrates weed and women As an experimental economist I found the mechanical turk marketplace a very interesting example of macro economy. Yes, I know it is a zoo. But at the same [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=244&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://openresearch.files.wordpress.com/2011/04/friedman.jpg"><img class="alignleft size-full wp-image-246" title="friedman" src="http://openresearch.files.wordpress.com/2011/04/friedman.jpg?w=497" alt=""   /></a></p>
<p><em>Disclaimer: unlike what it may seem like, I am not a conservative. I am just another liberal researcher from Berkeley who values and celebrates weed and women</em></p>
<p>As an experimental economist I found the mechanical turk marketplace a very interesting example of macro economy. Yes, I know it is a zoo. But at the same time people get interesting results out of this market. Apparently <a href="http://behind-the-enemy-lines.blogspot.com/2011/04/want-to-improve-sales-fix-grammar-and.html" target="_blank">Zappos</a> is using it to correct the grammar and wording of the reviews on their website and Professor Franklin&#8217;s team has been building a <a href="http://www.google.com/search?aq=f&amp;sourceid=chrome&amp;ie=UTF-8&amp;q=Crowddb" target="_blank">database engine</a> that can use the crowd to fill the missing values in their database. In some sense you can really get Shakespeare&#8217;s work out of this crowd of people who just tap on their keyboards. No complain on my part though, many turkers are honest workers and use it to get a legitimate salary but many (including myself) are just there for fun and they won&#8217;t blink if they can screw over your work.</p>
<p>Those researcher who piss me off the most are the ones who claim that they have found a groundbreaking theory on mechanical turk, I keep seeing people who claim they have observed a &#8220;new behavior&#8221; in the crowd while what they are really seeing is just another validation for old economics theories.</p>
<p>One of the issues that keeps coming up is &#8220;<em>Is it ethical that there is no minimum wage for mechanical turk?</em>&#8221; This is the question that was answered 60 years ago not on this stupid mechanical turk market but on large labour markets by our old friend, professor <a href="http://en.wikipedia.org/wiki/Milton_Friedman" target="_blank">Milton Friedman</a> (he has some other crazy ideas including geting rid of IRS and FDA but those are a little crazier than what I can really explain here <img src='https://s-ssl.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  )</p>
<p>In essence, Friedman claims that when you set a minimum wage for people&#8217;s work, you are just causing part of your society to be out of work. It affects minorities, women and teenagers dramatically, increases unemployment and in these years it highly encourages businesses to hire people offshore. Watch <a href="http://www.youtube.com/watch?v=ca8Z__o52sk" target="_blank">Friedman&#8217;s video here</a>. Or Google his books on Google books.</p>
<p>So an answer to those of my friends who question the ethics of having these labor markets like Mechanical Turk, here is my answer:</p>
<blockquote><p>Not only it is ethical not to have a minimum wage in mechanical turk but it is also unethical to have a minimum wage in the US. In some scene mechanical turk was formed due to some nonsense minimum wage regulations in the US labor market.  If business owners could hire people in the US to clean up their messy databases for $3.00 they wouldn&#8217;t need to go on mechanical turk to hire the same person for $1.50 an hour.</p></blockquote>
<p>I am interested in knowing what you think especially counter arguments are highly appreciated.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/244/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/244/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=244&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/04/07/mechanical-turk-market-ethics-and-milton-friedman/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/04/friedman.jpg" medium="image">
			<media:title type="html">friedman</media:title>
		</media:content>
	</item>
		<item>
		<title>Everybody is spamming everybody else on Mechanical Turk</title>
		<link>https://openresearch.wordpress.com/2011/03/09/everybody-is-spamming-everybody-else-on-mechanical-turk/</link>
		<comments>https://openresearch.wordpress.com/2011/03/09/everybody-is-spamming-everybody-else-on-mechanical-turk/#comments</comments>
		<pubDate>Wed, 09 Mar 2011 08:17:37 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=232</guid>
		<description><![CDATA[Two months ago Panos Ipeirotis, an NYU professor who studies anthropological aspects of online slavery as well as doing nerdy computer science stuff, pointed out in his blog post that 41 percent of mechanical turk HITs are posted by spammers. These requesters are typically those who ask turkers to click on a link, friend somebody on facebook or trash [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=232&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Two months ago <a rel="author" href="http://www.blogger.com/profile/15283752183704062501">Panos Ipeirotis</a>, an NYU professor who studies anthropological aspects of online slavery as well as doing nerdy computer science stuff, pointed out in his <a href="http://behind-the-enemy-lines.blogspot.com/2010/12/mechanical-turk-now-with-4092-spam.html" target="_blank">blog post </a>that 41 percent of mechanical turk HITs are posted by spammers. These requesters are typically those who ask turkers to click on a link, friend somebody on facebook or trash somebody on facebook. These tasks may not look like spam to us as I personally friend and/or trash people on facebook on regular basis so no big deal, but I never get paid for it:)</p>
<p>Anyways as a curious researcher who  doesn&#8217;t want to accept anything until he has done his due diligence I posted a spam HIT with a generous 1 dollar reward asking people to write a review about interesting places to visit in the  &#8221;Shiraz City, France&#8221;. Shiraz is known for its ridiculously hot girls and great wine but the problem is that it is not city in France, surprisingly it is in Iran.</p>
<p>I exclusively asked for those turkers who have lived in &#8220;Shiraz City, France&#8221; and made sure to highlight that if the turker has never lived in that city the work is going to be rejected. (For those who would like to bring up ethical issues for  this experiment I would like to point out that turkers have screwed me over and over on countless number of occasions so I reserve the right for myself to mess with them occasionally, additionally you can look at this as the Milgram experiment of the crowdsourcing era. Who doesn&#8217;t want to be called Philip Zimbardo of the online slavery market? Now that I am thinking about it I actually kind of like it.</p>
<div id="attachment_234" class="wp-caption alignleft" style="width: 210px"><img class="size-full wp-image-234" title="Antoine Dodson" src="http://openresearch.files.wordpress.com/2011/03/dodson.jpg?w=497" alt="Antoine Dodson"   /><p class="wp-caption-text">The title of this post was inspired by a poem by Antoine Dodson, a frustrated community observer who once shouted out of frustration &quot;Hide yo&#039; keeds, hide yo&#039; wife, and hide yo&#039; husband cause they rapin&#039;everybody out here&quot; He was the first one to voice his frustration about the kind of society that we all have become, a community in which people don&#039;t care about each other, the society that leaves no choice for people but hiding! </p></div>
<p>Anyways if the market was fair I assume two things would have never happend</p>
<p>1) I would have never been allowed to submit such an obscene work</p>
<p>2) I would have never received any responses</p>
<p>To my surprise both happened.</p>
<p>Nobody really polices the mturk market so I was able to get my HIT through. but the more interesting thing was that I was actually receiving suggestions for visiting a city that didn&#8217;t even exist!</p>
<p>I receive three types of responses:</p>
<ul>
<li> 1- Random excerpts from our old friend &#8220;the internet&#8221; about cities in france (mostly Paris)</li>
<li>2- Email addresses of the turkers</li>
<li>3- Random user names</li>
</ul>
<p>We all know that Mechanical Turk challenges the whole &#8220;Junk-in, Junk-out&#8221; dilemma and makes it more like &#8220;Always junk-out, regardless of the input process&#8221; but I feel that users posted junk for my junk HIT assuming that there is no quality assurance process working behind the task and I would probably just accept them all (which I have done for many jobs before)</p>
<p>Anyways I just wanted to highlight that the spamming goes both ways. Turkers spam requesters, requesters spam turkers, everybody wins!</p>
<p>(On a side note, I really need some behind the scene data from Amazon, how can I convince amazon to give me some data for research? In return I am willing to give them a high quality travel booklet that I have curated about a city in France, the city is called &#8220;Shiraz&#8221;    : )</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/232/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/232/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/232/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=232&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/03/09/everybody-is-spamming-everybody-else-on-mechanical-turk/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/03/dodson.jpg" medium="image">
			<media:title type="html">Antoine Dodson</media:title>
		</media:content>
	</item>
		<item>
		<title>Distribution of Mechanical Turkers in the US</title>
		<link>https://openresearch.wordpress.com/2011/02/16/distribution-of-mechanical-turkers-in-the-us/</link>
		<comments>https://openresearch.wordpress.com/2011/02/16/distribution-of-mechanical-turkers-in-the-us/#comments</comments>
		<pubDate>Wed, 16 Feb 2011 09:15:06 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=226</guid>
		<description><![CDATA[A couple of weeks ago our friends at Techlist published one of their visualizations of the distribution mechanical turkers of the US.  They had chosen very large icons for each turker and it was hard to understand the general distribution. I took their XML file and read it in R and generated a csv file [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=226&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A couple of weeks ago our friends at Techlist published one of their visualizations of the <a href="http://techlist.com/mturk/mturk-worker-map.htm" target="_blank">distribution mechanical turkers of the US</a>.  They had chosen very large icons for each turker and it was hard to understand the general distribution. I took their <a href="http://techlist.com/mturk/mturkgeo.xml" target="_blank">XML file </a>and read it in R and generated a csv file in R. The following visualization is produced in Tableau and might be more clear than their Google map visualization (h/t to techlist of their dataset)</p>
<div id="attachment_227" class="wp-caption aligncenter" style="width: 472px"><a href="http://openresearch.files.wordpress.com/2011/02/usturkers.png"><img class="size-full wp-image-227" title="Distribution of US Turkers" src="http://openresearch.files.wordpress.com/2011/02/usturkers.png?w=497" alt="Distribution of US Turkers"   /></a><p class="wp-caption-text">Distribution of US Turkers (16,000 Turkers)</p></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/226/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/226/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/226/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/226/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/226/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/226/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/226/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/226/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/226/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/226/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/226/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/226/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/226/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/226/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=226&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/02/16/distribution-of-mechanical-turkers-in-the-us/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/02/usturkers.png" medium="image">
			<media:title type="html">Distribution of US Turkers</media:title>
		</media:content>
	</item>
		<item>
		<title>Why do people work on Mechanical Turk</title>
		<link>https://openresearch.wordpress.com/2011/02/07/why-do-you-work-on-mechanical-turk/</link>
		<comments>https://openresearch.wordpress.com/2011/02/07/why-do-you-work-on-mechanical-turk/#comments</comments>
		<pubDate>Mon, 07 Feb 2011 08:48:58 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=219</guid>
		<description><![CDATA[I posted a mechanical turk task and asked the workers (a.k.a. Turkers) why they work there. I am still receiving responses but here is a wordl visualization of their responses so far<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=219&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I posted a mechanical turk task and asked the workers (a.k.a. Turkers) why they work there. I am still receiving responses but here is a wordl visualization of their responses so far</p>
<div id="attachment_220" class="wp-caption aligncenter" style="width: 310px"><a href="http://openresearch.files.wordpress.com/2011/02/earnmoney.png"><img class="size-medium wp-image-220" title="Why do you work on Mechanical Turk" src="http://openresearch.files.wordpress.com/2011/02/earnmoney.png?w=300&#038;h=174" alt="Why do you work on Mechanical Turk" width="300" height="174" /></a><p class="wp-caption-text">Why do you work on Mechanical Turk (wordl visualization)</p></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/219/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/219/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/219/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=219&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/02/07/why-do-you-work-on-mechanical-turk/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/02/earnmoney.png?w=300" medium="image">
			<media:title type="html">Why do you work on Mechanical Turk</media:title>
		</media:content>
	</item>
		<item>
		<title>How to produce a better circle-map visualization</title>
		<link>https://openresearch.wordpress.com/2011/01/29/circle-map-visualization-social-network/</link>
		<comments>https://openresearch.wordpress.com/2011/01/29/circle-map-visualization-social-network/#comments</comments>
		<pubDate>Sun, 30 Jan 2011 02:40:27 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=207</guid>
		<description><![CDATA[Circle-maps are sometimes used to visualize connections in graphs. I have been working on a visualization library in Python and as part of it I want to have the capability to visualize circle-maps. Below are two of my initial drafts (produced by matplotlib library in python). They visualize 12,000 unique votes on 700 status updates [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=207&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://openresearch.files.wordpress.com/2011/01/inner-circle.jpg"><img class="size-medium wp-image-211 alignright" title="circlemapsIcon" src="http://openresearch.files.wordpress.com/2011/01/inner-circle.jpg?w=180&#038;h=180" alt="" width="180" height="180" /></a>Circle-maps are sometimes used to visualize connections in graphs. I have been working on a visualization library in Python and as part of it I want to have the capability to visualize circle-maps.</p>
<p>Below are two of my initial drafts (produced by matplotlib library in python). They visualize 12,000 unique votes on 700 status updates in a social network.</p>
<p>The first one shows nodes on a circle and connects each note to its neighbor with a straight line. The other one connects nodes with a Bézier curve with four control points (starting node, end node, a point that is 2/3 of the way between the start point and the center, a point that is 2/3 of the way between the center and the end point). I still do not know what is the best way to draw arcs for a circle map and perhaps a circular arc might work better than a Bézier curve. but I&#8217;ve been happy with the Bézier curve so far. (make sure you take a look at the animations too [<a href="http://dl.dropbox.com/u/10279222/votes_lines_v2.gif">1</a>], [<a href="http://dl.dropbox.com/u/10279222/votes_curves_v2.gif">2</a>] they are large files so please wait a little for the animated gif to load completely)</p>
<div id="attachment_208" class="wp-caption aligncenter" style="width: 507px"><a href="http://openresearch.files.wordpress.com/2011/01/agreevalslines120.png"><img class="size-full wp-image-208" title="Circle-map of a social network with straight lines connecting adjacent people" src="http://openresearch.files.wordpress.com/2011/01/agreevalslines120.png?w=497&#038;h=487" alt="Circle-map of a social network with straight lines connecting adjacent people" width="497" height="487" /></a><p class="wp-caption-text">Circle-map of a social network with straight lines connecting adjacent people</p></div>
<div id="attachment_209" class="wp-caption aligncenter" style="width: 507px"><a href="http://openresearch.files.wordpress.com/2011/01/agreevals120.png"><img class="size-full wp-image-209" title="Circle-map of a social network with Bezier lines connecting adjacent people" src="http://openresearch.files.wordpress.com/2011/01/agreevals120.png?w=497&#038;h=492" alt="Circle-map of a social network with Bezier lines connecting adjacent people" width="497" height="492" /></a><p class="wp-caption-text">Circle-map of a social network with Bézier lines connecting adjacent people</p></div>
<p>I also produced two animations of these votes using Image Magick scripts. See the animation for <a href="http://dl.dropbox.com/u/10279222/votes_lines_v2.gif" target="_blank">straight lines </a>and for <em><a href="http://dl.dropbox.com/u/10279222/votes_curves_v2.gif" target="_blank">Bézier curves</a>. </em></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/openresearch.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/openresearch.wordpress.com/207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/openresearch.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/openresearch.wordpress.com/207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/openresearch.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/openresearch.wordpress.com/207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/openresearch.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/openresearch.wordpress.com/207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/openresearch.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/openresearch.wordpress.com/207/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/openresearch.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/openresearch.wordpress.com/207/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&amp;blog=7772417&amp;post=207&amp;subd=openresearch&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>https://openresearch.wordpress.com/2011/01/29/circle-map-visualization-social-network/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="https://secure.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/01/inner-circle.jpg?w=300" medium="image">
			<media:title type="html">circlemapsIcon</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/01/agreevalslines120.png" medium="image">
			<media:title type="html">Circle-map of a social network with straight lines connecting adjacent people</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/01/agreevals120.png" medium="image">
			<media:title type="html">Circle-map of a social network with Bezier lines connecting adjacent people</media:title>
		</media:content>
	</item>
	</channel>
</rss>
