<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Open Source Research</title>
	<atom:link href="http://openresearch.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://openresearch.wordpress.com</link>
	<description>My daily sufferings as a PhD student at Berkeley</description>
	<lastBuildDate>Sat, 15 Dec 2012 22:36:43 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='openresearch.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Open Source Research</title>
		<link>http://openresearch.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://openresearch.wordpress.com/osd.xml" title="Open Source Research" />
	<atom:link rel='hub' href='http://openresearch.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Learning C++11</title>
		<link>http://openresearch.wordpress.com/2012/06/22/learning-c11/</link>
		<comments>http://openresearch.wordpress.com/2012/06/22/learning-c11/#comments</comments>
		<pubDate>Sat, 23 Jun 2012 03:50:07 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=429</guid>
		<description><![CDATA[(I am still updating this post, I am learning C++11 and this is my live blog post. There might be typos and bugs) I have been hearing about the modern C++ and I feel that it is something that has a future. I mean C++ is already strong, it has survived for 30 years but when I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=429&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>(I am still updating this post, I am learning C++11 and this is my live blog post. There might be typos and bugs)</p>
<p>I have been hearing about the modern C++ and I feel that it is something that has a future. I mean C++ is already strong, it has survived for 30 years but when I code in C++ I need to use my brain cycles for stupid things. I need to think about data structures carefully but when I am working in something like Python I am free to get creative, I do not need to care about low level stuff. I can just code and hope that my code is going to run reletivley fast.</p>
<p>The new C++ seems to be pretty amazing. I am not just talking about the &#8220;auto&#8221; keywork and type inference. The lamba functions seem to be very useful and C++ now has things that were available in Java and C# from the beinning (remember for_each?). What I like to say is that C++ is now a</p>
<p>In this thread I will collect documents that can teach you and myself about the new C++. I will be a little careless about copyrights but nothing on this page is mine and I have just compiled it.</p>
<p>1- Watch the video, &#8220;<a href="http://channel9.msdn.com/Events/Lang-NEXT/Lang-NEXT-2012/-Not-Your-Father-s-C-">Not your father&#8217;s C++</a>&#8221; by our man Herb Sutter</p>
<p>2- Read Herb Sutter&#8217;s blog post &#8220;<a href="http://herbsutter.com/elements-of-modern-c-style/">Elements of Modern C++ Style</a>&#8220;</p>
<p>3- Lambda Expressions</p>
<p>An example of lambda functions is on <a href="http://www.cprogramming.com/c++11/c++11-lambda-closures.html">this page </a></p>
<blockquote>
<pre>#include &lt;iostream&gt;

using namespace std;

int main()
{
    auto func = [] () { cout &lt;&lt; "Hello world"; };
    func(); // now call the function
}</pre>
</blockquote>
<p>I use Visual Studio 2010 and Lambda functions already work in it. You may want to add #include &#8221;StdAfx.h&#8221; on top of your source for your code to work New Features The new features are summarized <a href="http://blog.smartbear.com/software-quality/bid/167271/">here</a></p>
<p>This is another good example of Lambda expressions &#8220;<a href="http://bit.ly/KNo5EH">Using Lambda Expressions for Shorter, More Readable C++ Code</a>&#8220;</p>
<p>The <a href="http://en.wikipedia.org/wiki/C%2B%2B11">wikipedia article</a> for C++11 is relatively useful the only problem is that it does not highlight which features are already implemented in VS 2010 or 2011. For example constant expressions are not yet supported in Visual Studio (at least not in the 2010 version that I use)</p>
<p>TODO: add this on Modern C++ <a href="http://msdn.microsoft.com/en-us/library/hh279654(v=vs.110).aspx">http://msdn.microsoft.com/en-us/library/hh279654(v=vs.110).aspx</a></p>
<p><a href="http://www.cpprocks.com/wp-content/uploads/c++11-rocks-free-sample-lambdas.pdf">http://www.cpprocks.com/wp-content/uploads/c++11-rocks-free-sample-lambdas.pdf</a></p>
<p><a href="http://www.cpprocks.com/c11-a-visual-summary-of-changes/?goback=%2Egde_86998_member_124675384#!prettyPhoto">http://www.cpprocks.com/c11-a-visual-summary-of-changes/?goback=%2Egde_86998_member_124675384#!prettyPhoto</a></p>
<p><a href="https://www.hackerschool.com/blog/5-learning-c-with-gdb">Learning C with gdb </a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/429/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/429/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=429&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2012/06/22/learning-c11/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>
	</item>
		<item>
		<title>My notes from the &#8220;Learning to learn&#8221; talk by Stanford&#8217;s Benjamin Von Roy</title>
		<link>http://openresearch.wordpress.com/2012/04/17/my-notes-from-the-learning-to-learn-talk-by-stanfords-benjamin-von-roy/</link>
		<comments>http://openresearch.wordpress.com/2012/04/17/my-notes-from-the-learning-to-learn-talk-by-stanfords-benjamin-von-roy/#comments</comments>
		<pubDate>Wed, 18 Apr 2012 04:30:13 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=419</guid>
		<description><![CDATA[Below are my notes from a talk entitled &#8220;Learning to Learn&#8221; by Benjamin Von Roy. I am reading some of the references and will add more to this document to make it readable for others soon. I will discuss the importance of learning to learn, and how this is a distinctive element of reinforcement learning relative to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=419&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<div id="attachment_420" class="wp-caption alignright" style="width: 189px"><img class=" wp-image-420  " title="Benjamin Von Roy" src="http://openresearch.files.wordpress.com/2012/04/bvr.jpg?w=179&#038;h=179" alt="Benjamin Von Roy" width="179" height="179" /><p class="wp-caption-text">Benjamin Von Roy</p></div>
<p>Below are my notes from a talk entitled &#8220;Learning to Learn&#8221; by Benjamin Von Roy. I am reading some of the references and will add more to this document to make it readable for others soon.</p>
<blockquote><p>I will discuss the importance of learning to learn, and how this is a distinctive element of reinforcement learning relative to other areas of statistical learning. I will then survey some relevant research and discuss recent work with Zheng Wen on an algorithm that efficiently learns to learn (and learns) in dynamic systems with arbitrarily large state spaces by combining optimistic exploration and value function generalization.</p>
<p>&nbsp;</p>
<p><strong><em>Bio:</em> </strong>Benjamin Van Roy is broadly interested in the formulation and analysis of mathematical models that address problems in information technology, business, and public policy. He is a Professor of Management Science and Engineering and Electrical Engineering, and, by courtesy, Computer Science, at Stanford University. He has held visiting positions as the Wolfgang and Helga Gaul Visiting Professor at the University of Karlsruhe and as the Chin Sophonpanich Foundation Professor of Banking and Finance at Chulalongkorn University. He has served on the editorial boards of Discrete Event Dynamic Systems, Machine Learning, Mathematics of Operations Research, and Operations Research, for which he is currently the Financial Engineering Area Editor. He has served as a researcher, advisor, founder, or director, for several technology companies. He received the SB (1993) in Computer Science and Engineering and the SM (1995) and PhD (1998) in Electrical Engineering and Computer Science, all from the Massachusetts Institute of Technology.</p>
<p>&nbsp;</p>
<p>&nbsp;</p></blockquote>
<p>Reinforcement Learning Models in Literature</p>
<ul>
<li>Myopic Learning</li>
<li>Dithering??</li>
<li>Reinforcement Learning</li>
</ul>
<p>What  is this &#8220;Multi-armed bandit&#8221; I keep hearing about it everywhere there is an online ad talk. I should learn it. Watch this <a href="http://videolectures.net/icml09_niculescu_mizil_mabb/">video lecture</a> later.</p>
<p>Literature on efficient reinforcement learning:</p>
<ol>
<li>Kearns-Singh 2002
<ol>
<li>Devise plan to learn soon if possible</li>
<li>Otherwise plan to exploit</li>
</ol>
</li>
<li>Braffman-Tennenholts 2002
<ol>
<li>Optimistic exploration</li>
</ol>
</li>
<li>Kearns-Koller 1999</li>
<li>Abbasi -Yadkori-Szepesvari 2011</li>
</ol>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/419/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/419/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=419&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2012/04/17/my-notes-from-the-learning-to-learn-talk-by-stanfords-benjamin-von-roy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2012/04/bvr.jpg" medium="image">
			<media:title type="html">Benjamin Von Roy</media:title>
		</media:content>
	</item>
		<item>
		<title>A Mandelbrot Fractal in Python</title>
		<link>http://openresearch.wordpress.com/2012/04/12/a-mandelbrot-fractal-in-python/</link>
		<comments>http://openresearch.wordpress.com/2012/04/12/a-mandelbrot-fractal-in-python/#comments</comments>
		<pubDate>Fri, 13 Apr 2012 06:27:27 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=412</guid>
		<description><![CDATA[I coded up this Mandelbrot fractal in python while watching TV. Not sure if is helpful for anybody but you may want to take a look at it and enjoy the bauty of chaotic dynamical systems. The code is posted here and below too. Here is a fascinating high quality version of it. # Mandelbrot set # [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=412&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>I coded up <a href="http://en.wikipedia.org/wiki/Mandelbrot_set">this Mandelbrot fractal</a> in python while watching TV. Not sure if is helpful for anybody but you may want to take a look at it and enjoy the bauty of chaotic dynamical systems. The code is<a href="http://pastebin.com/Mw7NQtHv"> posted here</a> and below too. Here is a fascinating <a href="https://github.com/faridani/PyFractal/raw/master/mandel.png">high quality version</a> of it.</p>
<p><a href="http://openresearch.files.wordpress.com/2012/04/1200.png"><img class="aligncenter size-full wp-image-413" title="Mandelbrot" src="http://openresearch.files.wordpress.com/2012/04/1200.png?w=497&#038;h=497" alt="" width="497" height="497" /></a></p>
<pre># Mandelbrot set
# By Mark Alen
# linux_jvm@yahoo.com
# April 2012
import ImageDraw
from PIL import Image, ImageFilter
from math import log
white = (255, 255, 255)
width = 5000
height = width
image1 = Image.new("RGB", (width, height), white)
draw = ImageDraw.Draw(image1)

# http://en.wikipedia.org/wiki/Mandelbrot_set
for xpix in range(1,width+1):
 for ypix in range(1,height+1):
 x0 = (xpix*1.0/width*3.5) -2.5
 y0 = (ypix*1.0/height*2)-1
 x = 0
 y = 0
 iteration = 0
 max_iteration = 1000
 while ( (x*x + y*y) &lt; 4) &amp; (iteration &lt; max_iteration ):
 xtemp = x*x - y*y + x0
 y = 2*x*y + y0
 x = xtemp
 iteration = iteration + 1
 mycol =int(255.0*(1-log(iteration/1000.0*255+1)/log(256)))
 color = (255-mycol,255-mycol,255-mycol)
 if iteration == max_iteration:
 color = white
 draw.point((xpix,ypix), color)
 if ((xpix*(100/1)%(width))==0):
 print (xpix*ypix*1.0)/((width*height))*100.0, "%"
filename = "mandel.png"
image1.save(filename)
print "Done!"</pre>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/412/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/412/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=412&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2012/04/12/a-mandelbrot-fractal-in-python/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2012/04/1200.png" medium="image">
			<media:title type="html">Mandelbrot</media:title>
		</media:content>
	</item>
		<item>
		<title>Dear Coursera and Udacity! Don&#8217;t congratulate yourself too much</title>
		<link>http://openresearch.wordpress.com/2012/03/27/dear-coursera-and-udacity-dont-congratulate-yourself-too-much/</link>
		<comments>http://openresearch.wordpress.com/2012/03/27/dear-coursera-and-udacity-dont-congratulate-yourself-too-much/#comments</comments>
		<pubDate>Tue, 27 Mar 2012 08:06:37 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=391</guid>
		<description><![CDATA[So a couple of smart professors from Stanford have started two separate startups and have put their courses on the web and the world is going nuts. Everyone is talking about them and they are busy congratulating themselves on this amazing accomplishment. Every major paper is writing about how these professors are revolutionizing education and how amazing these [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=391&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://openresearch.files.wordpress.com/2012/03/ff_aiclass_f-1.jpg"><img class="aligncenter size-full wp-image-394" title="Norvig and Thrun" src="http://openresearch.files.wordpress.com/2012/03/ff_aiclass_f-1.jpg?w=497&#038;h=354" alt="" width="497" height="354" /></a></p>
<p>So a couple of smart professors from Stanford have started two separate startups and have put their courses on the web and the world is going nuts. Everyone is talking about them and they are busy congratulating themselves on this amazing accomplishment. Every major paper is writing about how these professors are revolutionizing education and how amazing these wesbites are (see <a href="http://bit.ly/GWOcp6">Wired</a> or <a href="http://nyti.ms/GWOImT">NY times</a> articles). Do not get me wrong, the work that Udacity and Coursera are doing are way more superior to MIT&#8217;s course dump (<a href="http://ocw.mit.edu/index.htm">OCW</a>) but is it really what we were all envisioning for online education? I mean come on! We did all that research on distance learning, collaborative whiteboards, online labs and we ended up with these low quality Khaan Academy videos? are you kidding me? We have a whole freaking academic community specifically around <a href="http://www.asee.org/">engineering education</a>, they even publish scientific<a href="http://www.jee.org/"> journals</a>!</p>
<p>I have a lot of respect for the professor who started this, Sebastian Thrun, whose wonderful book on probabilistic robotics was my bible for a long time, but here is what I think they are doing  wrong.</p>
<blockquote><p>Both classrooms (udacity and Coursera) are too similar to regular classroms.</p></blockquote>
<p>Just like a classroom, the course starts on specific dates and goes on for 7 weeks. Students need to stick to deadlines, do quizzes, submit homeworks, and finish on time. There is no flexibility, there is no customization, you will take the same course as the next guy over the internet with a completely different background.  What if I want to learn a topic in a year instead of 7 weeks? What if I want to learn it in <a href="http://norvig.com/21-days.html">10 years</a>? For example I was busy last week and was catching up on my emails today, one of the emails was from Coursera announcing that their algorithms course was going live last week, when I went to sign up today it told me that I cannot enroll now! My question is: <strong>why</strong>? seriously why cannot I start whenever I want and finish whenever I want? This is the same thing that I hated about my old fashion offline university!</p>
<p>In fact  Professor Thrun has published  his vision for online education as a university that has the follwoing elements:</p>
<blockquote><p>&#8230;Nine essential components of a university education: admissions, lectures, peer interaction, professor interaction, problem-solving, assignments, exams, deadlines, and certification.</p></blockquote>
<p>Are you kidding me? I know a system that was around way before the web and had the same elements, its name is &#8220;College&#8221;. So all you have done is taking the same lectures and making a video out of them and it has become the revolution in education that we were all dreaming about?</p>
<p>This is what I think: People are taking these courses because for many it is the only way to learn about interesting topics like robotics or machine learning. Take a video of a Stanford professor talking about a hot topic and people will eat that up. That does not necessarily mean that we have unlocked the power of online education. I also doubt it will give any value to a Stanford student who can sit in the real classroom.</p>
<p>To me this is aiming low, it is giving up on our dreams,  it really is a<strong><em> failure</em>.</strong></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/391/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/391/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=391&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2012/03/27/dear-coursera-and-udacity-dont-congratulate-yourself-too-much/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2012/03/ff_aiclass_f-1.jpg" medium="image">
			<media:title type="html">Norvig and Thrun</media:title>
		</media:content>
	</item>
		<item>
		<title>5 Reasons Why We Live In A Freaking Exciting Time</title>
		<link>http://openresearch.wordpress.com/2012/02/18/5-reasons-why-we-live-in-a-freaking-exciting-time/</link>
		<comments>http://openresearch.wordpress.com/2012/02/18/5-reasons-why-we-live-in-a-freaking-exciting-time/#comments</comments>
		<pubDate>Sun, 19 Feb 2012 00:51:42 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=353</guid>
		<description><![CDATA[http://www.youtube.com/watch?v=8r1CZTLk-Gk
<p></p><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=353&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Our friend and comedy extraordinaire Louis C.K. has a short clip called &#8220;Everything is amazing and nobody is happy&#8221;. In the clip he basically asks why don&#8217;t we get excited about simple things that technology has brought to us? Watch it below.</p>
<p><span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='497' height='310' src='http://www.youtube.com/embed/8r1CZTLk-Gk?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<p>I keep telling my girlfriend that we live in an exciting and extraordinary time. And she keeps telling me that my dad has been probably saying the same thing 30 years ago. The thing she doesn’t know is that my dad was just a kid on the streets of a poor third world country, struggling to finish his PhD without Wikipedia and the Internet (even though Al Gore had invented the Internet a couple of years earlier <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  But now we have all these exciting things at our fingertips. With Wikipedia I am hundred times smarter than my dad at his smartest point in his life.</p>
<p>Below, I’ll give you 5 reasons from my everyday life and hopefully I can convince you that we should all go “Oh my God, this life is fucking awesome&#8221;. I will just give you examples about education as we rarely get excited about anything else.</p>
<p><strong>Reason 1. TV is changing and educational videos are becoming cooler to watch:</strong> We have a big TV at home, but we do not have cable. It is hooked to a small computer with which we can watch all sorts of shit, from the <a href="http://www.youtube.com/education">education channel </a>in youtube or some of the TED videos that do not suck major buttocks. By the way have you watched &#8220;<a href="http://www.youtube.com/course?list=PL30C13C91CFFEFEA6">Justice with Michael Sandel</a>&#8220;? highly recommended.</p>
<p><strong>Reason 2. We can now take our education on the road with us</strong>: Even I, as a poor grad student can now afford to have a couple of wireless devices. I can load PDF files onto my tablet and read them on the road. Through our university we get full access to books from O&#8217;Reilly and Springer. I can read them on the train to work without killing a lot of trees.</p>
<div id="attachment_362" class="wp-caption alignright" style="width: 210px"><a href="http://openresearch.files.wordpress.com/2012/02/johncanny.jpg"><img class="size-full wp-image-362" title="John Canny" src="http://openresearch.files.wordpress.com/2012/02/johncanny.jpg?w=497" alt="John Canny"   /></a><p class="wp-caption-text">John Canny is a professor at Berkeley. He is mainly known for this fact that he unlocked the secrets to longer than 24 hours days. When he was at MIT he used only 24 hours of his 70 hour days to invent the Canny edge detector and used the rest to date women.  After he got married and settled down he started utilizing the rest of his days to invent things in HCI, machine learning and God knows in many unrelated fields like healthcare and psychology.  To this date I am still wondering how he can make all these contributions. Many scientist go into severest depressions as soon as they realize that they can never be John Canny. </p></div>
<p>3. <strong>High quality education is becoming accessible to every fucking idiot:</strong> this is the most exciting thing for me. I have probably paid thousands of dollars to UC Berkeley. And I am kind of happy about it. That gives me the privilege to be able to sit in classes taught by professors like<a href="http://www.cs.berkeley.edu/~jfc/"> John Canny </a>and <a href="http://www.cs.berkeley.edu/~jordan/">Micheal Jordan</a>. But I can also stay home and get the same education from the Internet. I can watch Andrew Ng&#8217;s <a href="http://www.ml-class.org">machine learning class</a> or Berkeley&#8217;s <a href="http://alex.smola.org/teaching/berkeley2012/statistics.html">scalable machine learning </a>without paying a penny (well, I pay 40 bucks to those bloodsuckers at  AT&amp;T for the internet but that&#8217;s another story). Also if you like these things I highly recommend following John Canny&#8217;s <a href="http://bid.berkeley.edu/cs294-1-spring12/index.php/Main_Page">Behavioral Data Mining</a> course.</p>
<p>4. <strong>Science is actually being used now:</strong> When you read something about technology you know that it is being used right now. I was reading a paper about the All-Reduce method and it was great to know that my homeboy, <a href="http://hunch.net/~jl/">John Langford</a>, has used it in in his <a href="https://github.com/JohnLangford/vowpal_wabbit/wiki">Vowpal Wabbit </a>and Yahoo is using it for spam filtering.</p>
<p>5. <strong>Science experiments becoming inexpensive</strong>: I have an awfully incapable laptop but for a very cheap price I can now get a large cluster crunching numbers for me. My friends at Udacity are now teaching high school kids<a href="http://www.youtube.com/watch?v=IYZNchSxaHk&amp;feature=youtu.be"> how to build a kick-ass search engine</a> using commodity computers.</p>
<p>I do not know about you but every day when I wake up I feel freaking blessed that I live in this time. As Salman Khaan says, if these things do not make you excited, you might have no soul <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Disclaimer: I&#8217;m indebted to professor Canny immensely. And I have a lot of respect for the man.  This is just a joke.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/353/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/353/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=353&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2012/02/18/5-reasons-why-we-live-in-a-freaking-exciting-time/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2012/02/johncanny.jpg" medium="image">
			<media:title type="html">John Canny</media:title>
		</media:content>
	</item>
		<item>
		<title>Notes from Neel Sundaresan&#8217;s keynote speech at RecSys 2011</title>
		<link>http://openresearch.wordpress.com/2011/10/26/notes-from-neel-sundaresans-keynote-speech-at-recsys-2011/</link>
		<comments>http://openresearch.wordpress.com/2011/10/26/notes-from-neel-sundaresans-keynote-speech-at-recsys-2011/#comments</comments>
		<pubDate>Wed, 26 Oct 2011 14:05:16 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=292</guid>
		<description><![CDATA[He started by stating that he won&#8217;t have any greek symbols in the talk. Arch West was the inventor of Doritos and David Pace we the inventor of Pace sauce. What they did was that they noticed the can sell more if they advertise the two products together. There is a lesson in cross-selling and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=292&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<div id="attachment_293" class="wp-caption alignleft" style="width: 210px"><img class="size-full wp-image-293" title="Neel Sundaresan" src="http://openresearch.files.wordpress.com/2011/10/neel2008.jpg?w=497" alt="Neel Sundaresan"   /><p class="wp-caption-text">Neel Sundaresan</p></div>
<p>He started by stating that he won&#8217;t have any greek symbols in the talk.</p>
<p>Arch West was the inventor of Doritos and David Pace we the inventor of Pace sauce. What they did was that they noticed the can sell more if they advertise the two products together. There is a lesson in cross-selling and recommender systems that we can learn from this story.</p>
<p>eBay started when Pierre Omidyar wanted to sell his broken laser pointer. He listed the laser pointer online for 99 cents and finally sold for 14 dollar. Was wondering if the person who bought it knew that it is broken. The guy responded &#8220;Yes I am a collector of broken laser pointers&#8221;</p>
<p>Why people buy something? it is hard to say. Some people buy stories! remember the toast that sold for 27K that was the cheapest marketing campaign that a casino had.  One man&#8217;s trash is another&#8217;s treasure.</p>
<p>The long tail in eBay&#8217;s context mean most people sell very few items and most of eBay&#8217;s revenue comes from these people. (i.e the mean is way larger than the median</p>
<p>The users constantly are running experiments to maximize their revenue. They are constantly testing to see if free shipping can sell more, different selling strategies are being tested by users at any time on eBay.</p>
<p>This causes an interesting behavior. If you promote a user&#8217;s product on the homepage they may increase their price! This is an interesting dynamics between the user and the seller (eBay)</p>
<p>One of the problems that locations like eBay have is the problem of big data. Complex algorithms are often impossible to work with in that scale. If you are looking for a job at eBay you need to know how to work with data in that scale.  A goal at eBay lab is that when a new scientist joins the lab on Monday he got access to all the data by Friday.</p>
<p>This amount of data has changed how economics is doing experiments. They can now run experiments on 400 million data points.</p>
<p>What are you optimizing for at eBay? is it profit maximization? do you want to increase the shopping cart size? are you looking for maximum customer satisfaction?</p>
<p>The other thing is how do you measure success?</p>
<p>Everything we do at eBay is a recommendation.</p>
<blockquote><p>I KEEP six honest serving-men<br />
(They taught me all I knew);<br />
Their names are What and Why and When<br />
And How and Where and Who.</p>
<p>&#8211; from <em>The Elephant&#8217;s Child </em></p></blockquote>
<p>When we look at the tag cloud of eBay you see keywords like &#8220;used&#8221;, &#8220;vintage&#8221; and &#8220;antique&#8221; a lot more than &#8220;new&#8221;</p>
<p>The search is an interesting problem some people are looking for &#8220;ipod nano 4gb black new&#8221; and some are looking for the skin for their ipod. Our search engine should be able to differentiate between &#8221;ipod nano 4gb black new&#8221; &#8221;ipod nano 4gb black new skin&#8221;. This proposes a hard and challenging research questions.</p>
<p>Click trails can help us tremendously with building recommender systems that can capture these behavior and improve recommender systems.  At eBay a data cleanup is an important part of recommender. Specially when they use click trails.</p>
<p>eBay has a language like pig that allows them to do pattern recognition at scale. Sometimes a search is followed by some page views and another search. This pattern is useful to do recommendation to other users who have similar initial search queries. See two recent papers from Sundaresan for the results and model.</p>
<p>Fashion item buyer on eBay are very brand aware. Sometimes ebay does not have enough inventories and needs to recommend proper products from outside websites.</p>
<p>one of the challenges at eBay is that we do not have a catalogue of items (remember the laser pointer story?) Amazon does not have such a problem, you cannot sell anything on Amazon unless it is on the catalog.</p>
<p>eBay uses its own matrix factorization see their ICML paper. The sparsity in eBay&#8217;s data is fascinating it is 100 times the sparsity in the Netflix data.</p>
<p>eBay clusters items into pseudo products using LDA. He shows an example of a recommendation for a broken blackberry cellphone.</p>
<p>The most important thing is &#8220;why&#8221; are you recommending this to the user and &#8220;why&#8221; they should buy it. HCI is a useful tool here. reveal to the user why you are recommending. Something like &#8220;52% of the people who bought this item also bought &#8230;&#8221; are very effective.  Be very explicit on why certain recommendations are made.</p>
<p>Let&#8217;s look at &#8220;When&#8221;. things like reminders, post purchases, urgency, upgrades, seasonal sales fall into this. Reminders can be like &#8220;you have viewed this item&#8221; that reminds people that they can still go and buy. There is a temporal element to this problem too. They may not need the same item until after 30 days but need to buy it again after 30 days is passed.</p>
<p>See this <a href="http://www.wired.com/magazine/2011/04/st_essay_persuasion_profiling/" target="_blank">wired article </a>on persuasion based profiling and recommendation systems. (thanks to twitter).</p>
<p>There is a lot of seasonality on eBay. Mother and Father&#8217;s days, Christmas. There are other events that are we don&#8217;t know (so my question is how can we find them algorithmically)</p>
<p>We get more data from mobile devices than we get from online. It is a huge research opportunity.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/292/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/292/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=292&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2011/10/26/notes-from-neel-sundaresans-keynote-speech-at-recsys-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/neel2008.jpg" medium="image">
			<media:title type="html">Neel Sundaresan</media:title>
		</media:content>
	</item>
		<item>
		<title>Notes from From Understanding to Enabling Networks: Using Web Science to Enhance Recommender Systems</title>
		<link>http://openresearch.wordpress.com/2011/10/24/notes-from-from-understanding-to-enabling-networks-using-web-science-to-enhance-recommender-systems/</link>
		<comments>http://openresearch.wordpress.com/2011/10/24/notes-from-from-understanding-to-enabling-networks-using-web-science-to-enhance-recommender-systems/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 17:15:33 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=284</guid>
		<description><![CDATA[The keynote at #recsys2011 is by Noshir Contractor. He is the coauthor of &#8220;Theories of communication networks&#8221; which seems to be an interesting book from amazon reviews. the presentation stack is available here (thanks to @barrysmyth for the link) He started by presenting SNIF. SNIF is a device and social networks for dogs! Kind of social petworking. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=284&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<div id="attachment_289" class="wp-caption alignleft" style="width: 210px"><img class="size-full wp-image-289" title="Noshir Contractor" src="http://openresearch.files.wordpress.com/2011/10/contractor_noshir.jpg?w=497" alt="Noshir Contractor"   /><p class="wp-caption-text">Noshir Contractor</p></div>
<p>The keynote at #recsys2011 is by<a href="http://nosh.northwestern.edu/"> Noshir Contractor</a>. He is the coauthor of &#8220;Theories of communication networks&#8221; which seems to be an interesting book from amazon <a href="http://www.amazon.com/gp/product/0195160371/ref=as_li_ss_tl?ie=UTF8&amp;tag=resodail-20&amp;linkCode=as2&amp;camp=217145&amp;creative=399369&amp;creativeASIN=0195160371" target="_blank">reviews</a>.</p>
<p>the presentation stack is <a href="http://www.sis.smu.edu.sg/SocInfo2011/documents/noshir.pdf">available</a> here (thanks to <a href="http://twitter.com/barrysmyth">@barrysmyth</a> for the link)</p>
<p>He started by presenting SNIF. SNIF is a device and social networks for dogs! Kind of social petworking. In contrast lovegety is the SNIF technology for people. Find love through random encounters.</p>
<p>Today we will talk about How we can take research in social sciences and bring it to recommender systems.</p>
<p>People have looked at citations and papers and found that people who write papers in teams have a high impact. Also articles by teams from different disciplines from different geographic locations have the highest impact. Fining the appropriate team from a diverse background and geography is much harder.</p>
<p>Thus we are looking at assembeling these type of teams. But how do we decide whom to bring to the team?</p>
<p>The exciting thing about our time is that we have theories, data and methods, additionally we have computation infrastructure to run these models</p>
<p>Why do people collaborate with each other?</p>
<p>MTML model:</p>
<ul>
<li>self interest (from econ theories)</li>
<li>Social and resource exchange</li>
<li>Mutual interest and collective action</li>
<li>Theories of contagion</li>
<li>Theories of balance</li>
<li>Theories of homophily</li>
<li>Theories of proximity</li>
</ul>
<div>My note: How about Robert Spolsky&#8217;s theory?</div>
<div><em>Exponential randome graphs</em> can explain how these collaboration networks is formed (the shape of the graph)</div>
<div>They have looked at the structure of NSF proposals and they wanted to see if they can build a recommnder system that by using characteristics of the proposal make recommendations for acceptance</div>
<div>The likelihood of collaboration is highers if:</div>
<div>
<ul>
<li>you have written an NSF proposal together</li>
<li>you have cited each other</li>
</ul>
<div>Didn&#8217;t know about <a href="http://en.wikipedia.org/wiki/H-index">H-index</a>. Interesting factor. Apparently those with higher H-index are less likely to collaborate.</div>
</div>
<div>Citing your collaborators actually reduces the likelihood of getting NSF funding (!)</div>
<div>Solving the link recommendation problem (recommending who should be on the team)</div>
<div>Link prediction approaches: node-wise similarity, network topology, or probabilistic modeling</div>
<div>P* for link prediction</div>
<div>Use p* models to calculate link probability</div>
<div>
<ul>
<li>Estimate p*/ERGM</li>
<li>the rest I didn&#8217;t get to type (!)</li>
</ul>
<div>I think the probabilistic model that he is refering to is the same as model fitting on Bayesian nets but I am not sure.</div>
</div>
<div>The talk ended by a demo of the implementation that is available <a href="http://ciknow1.northwestern.edu/sw_nucats/">here </a></div>
<div>Noshir&#8217;s book is also available for free on his <a href="http://nosh.northwestern.edu/cv/dissertation-and-publications/">personal </a>website</div>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/284/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/284/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=284&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2011/10/24/notes-from-from-understanding-to-enabling-networks-using-web-science-to-enhance-recommender-systems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/contractor_noshir.jpg" medium="image">
			<media:title type="html">Noshir Contractor</media:title>
		</media:content>
	</item>
		<item>
		<title>Notes from &#8220;Recommendations as a Conversation with the User&#8221; by Daniel Tunkelang</title>
		<link>http://openresearch.wordpress.com/2011/10/24/notes-from-recommendations-as-a-conversation-with-the-user-by-daniel-tunkelang/</link>
		<comments>http://openresearch.wordpress.com/2011/10/24/notes-from-recommendations-as-a-conversation-with-the-user-by-daniel-tunkelang/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 15:56:59 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=275</guid>
		<description><![CDATA[These are my unedited notes from Daniel Tunkelang&#8217;s presentation at #recsys2011. I am editing as you are reading this post. &#8220;Recommendations as a Conversation with the User&#8221; by Daniel Tunkelang Goal is to have a better relationship with the user Three take aways from this talk: Consider asking vs guessing Ask good questions It&#8217;s okay [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=275&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<div id="attachment_277" class="wp-caption alignleft" style="width: 145px"><img class="size-full wp-image-277" title="Daniel Tunkelang" src="http://openresearch.files.wordpress.com/2011/10/daniel-tunkelang.jpg?w=497" alt="Daniel Tunkelang"   /><p class="wp-caption-text">Daniel Tunkelang</p></div>
<p>These are my unedited notes from Daniel Tunkelang&#8217;s presentation at #recsys2011. I am editing as you are reading this post.</p>
<p><strong>&#8220;Recommendations as a Conversation with the User&#8221; by Daniel Tunkelang</strong><br />
Goal is to have a better relationship with the user</p>
<p>Three take aways from this talk:</p>
<ul>
<li>Consider asking vs guessing</li>
<li>Ask good questions</li>
<li>It&#8217;s okay to make mistakes<strong> if</strong> you have a good explanation and adapt to feedback</li>
</ul>
<p><strong>Theory</strong></p>
<p><strong></strong>&#8220;<a href="http://www.amazon.com/gp/product/B004J8HY8K/ref=as_li_ss_tl?ie=UTF8&amp;tag=resodail-20&amp;linkCode=as2&amp;camp=217145&amp;creative=399373&amp;creativeASIN=B004J8HY8K">The Man Who Lied to His Laptop</a>&#8221;  is a great related read<br />
Paul Grice&#8217;s maxims of conversations:</p>
<ol>
<li>Quality</li>
<li>Quantity</li>
<li>Relation</li>
<li>Manner</li>
</ol>
<p><strong>**Do not lie</strong></p>
<ul>
<li>Don&#8217;t use &#8220;recommended&#8221; when you really mean &#8220;sponsored&#8221; or &#8220;excess inventory&#8221;. User&#8217;s loss of trust will cost you. but users do not have a model on how on how to trust a system</li>
<li>Optimize for the user&#8217;s utility</li>
<li>Apply a standard of evidence (quality, quantity) that you believe in</li>
</ul>
<p><strong>Right amount of information</strong></p>
<ul>
<li>Exchange small units of information</li>
<li>If recommendations supplement other content consider overall cognitive load</li>
<li>provide short meaningful explanations</li>
</ul>
<p><strong>Maxim 3: Relation. Relevant to the user</strong></p>
<ul>
<li>Offer value to the user</li>
<li>respect task context</li>
<li>don&#8217;t be obnoxious</li>
</ul>
<p><strong>Maxim 4: Manner</strong></p>
<ul>
<li>relevant to the user</li>
<li>Eschew obfuscation</li>
<li>Avoid ambiguity</li>
<li>be brief</li>
<li>be orderly</li>
</ul>
<p><strong>Another perspective</strong></p>
<p>Another perspective is Gary Marchionini&#8217;s perspective on Human computer information retrieval</p>
<p><span style="text-decoration:underline;"><em>Empower people to explore large-scale information but demand that people also take responsibly for the control be expending cognitive and physical energy</em></span></p>
<p><strong>principles of hcir</strong></p>
<ol>
<li>do more than deliver information: facilitate sense-making</li>
<li>require and reward effort</li>
<li>adapt to increasingly knowledgeable users over time</li>
<li>be engaging and fun to use</li>
</ol>
<p><strong>Adapt to user knowledge</strong><br />
Systems that don&#8217;t get better over time will frustrate users, because users<strong> DO</strong> get better over time</p>
<p><strong>Personalized recommendations</strong></p>
<ul>
<li>be transparent about model so users gain insight</li>
<li>allow users to modify models to correct</li>
<li>solicit just enough information to provide value</li>
<li>Exemplars are interesting tools to communicate the recommender model to the user</li>
<li>Users should be able to modify the recommender system say you have a recommender system that uses location and user is using a proxy. He should be able to turn if off to make it noncreepy!</li>
</ul>
<p><strong>Social recommnedations</strong></p>
<ul>
<li>identify the right set of similar users</li>
<li>allow users to manipulate the social lens</li>
<li>accommodate users who break your model</li>
</ul>
<p>When making item recs, explain your recommendations! <em>Watch for non-sequiturs (diapers -&gt; beer problem)</em></p>
<p><strong>**Tell me about yourself is friendlier than &#8220;fill out 20 pages of survey&#8221;</strong></p>
<p>Corpse bride is in the recommnded set and I have watched it, it is good. it gives me the feeling that recommender is working properly</p>
<p><strong>Learning from netflix</strong></p>
<ul>
<li>Ask users for help upfront but not too much help</li>
<li>pay attention to what the user tells you</li>
<li>give users value often and early</li>
</ul>
<p>75% of netflix views result from recommendation</p>
<p>Underpromissing and overdelivering is sometimes a <strong>good idea</strong></p>
<p>Soe models more explainabel than others</p>
<ol>
<li>consider decision trees and rule based models</li>
<li>avoid using latent, unlabled features</li>
<li>if the model is opaque use exaples as surrougates</li>
</ol>
<p>Make a good first impression<br />
your user&#8217;s first experience is critical</p>
<p>See &#8220;Machine learning for large scale recommender systems&#8221; by Agrawal and Chen ICL 2011 Tutorial</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/275/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=275&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2011/10/24/notes-from-recommendations-as-a-conversation-with-the-user-by-daniel-tunkelang/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/daniel-tunkelang.jpg" medium="image">
			<media:title type="html">Daniel Tunkelang</media:title>
		</media:content>
	</item>
		<item>
		<title>&#8220;We Will All Be Jedi Masters Soon&#8221; or &#8220;Random But Coherent Thoughts on Modern Education&#8221;</title>
		<link>http://openresearch.wordpress.com/2011/10/12/education/</link>
		<comments>http://openresearch.wordpress.com/2011/10/12/education/#comments</comments>
		<pubDate>Wed, 12 Oct 2011 09:48:13 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=262</guid>
		<description><![CDATA[Three things have happened recently: I am going on the job market and looking for a research job Steve Jobs passed away I read a blog post entitled &#8220;Unless you are awesome, you will be outsourced&#8220; 1- We are living in an exciting era. Stanford is offering their AI course online for free and I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=262&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://openresearch.files.wordpress.com/2011/10/education.jpg"><img class="aligncenter size-full wp-image-263" title="education" src="http://openresearch.files.wordpress.com/2011/10/education.jpg?w=497" alt=""   /></a></p>
<p>Three things have happened recently:</p>
<ul>
<li>I am going on the job market and looking for a research job</li>
<li>Steve Jobs passed away</li>
<li>I read a blog post entitled &#8220;<a href="http://blog.summation.net/2011/10/awesome-or-outsourced.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+typepad%2Fsummation+%28Summation%29">Unless you are awesome, you will be outsourced</a>&#8220;</li>
</ul>
<p>1- We are living in an exciting era. Stanford is offering their <a href="http://www.ai-class.com/home/">AI course</a> online for free and I am telling you, it is not the crappy study material that MIT dumps on their open course ware website. These are serious, well curated videos with quizzes and assignments.  Basically what it means is that a brown kid in the desserts of south Oman can now learn what a wealthy full-of-himself Stanford kid in Palo Alto learns about AI .</p>
<p>2- For my PhD I worked on the intersection of human-computer interaction (HCI) and machine learning (ML). It took me a couple of years and I can assure you, you can find online and free educational material to become as bad-ass as I am in both of the fields. Ironically the number of educational videos on machine learning outnumbers the number of videos on CHI!</p>
<p>3- Steve Jobs died a couple of days ago. As an open source contributor I hated him while he was around. But I felt extremely sad when passed away. Let&#8217;s all face it, he might have been an ass when it comes to treating others or exploiting and abusing child labor. But he lead many amazing projects. Apple was a mecca for HCI people. They really set the standard for innovation in consumer products. I have recently found<a href="http://www.youtube.com/watch?v=ob_GX50Za6c"> this video</a> by Steve Jobs in which he emphasizes tool building and how computers are making us super humans. And it is very true. The guy is a visionary.  In his words we can quote him, disagree with him, glorify or vilify him, but the only thing we can&#8217;t do is to ignore him because he changed our lives forever.</p>
<p>4- In terms of computing we are living in an amazing time too. I work in a lab that has a pretty strong cluster of computers and 90% of the time the load on the cluster is not that much. Do you see it? It means we are entering an era that our computing power is way more than our computing needs! And if we utilize our computing power well we may actually have excess cycles. Things like Hadoop have allowed us to treat multi-computers like single computers and run massive jobs on them. Do things that were impossible before.</p>
<p><strong>I guess what I am saying is that:</strong></p>
<p>The other blog post is claiming that we need to be awesome otherwise we will loose our jobs. What I am saying is that the cost of becoming awesome is decreasing dramatically. With all these free courses, education is becoming cheap (while schooling becomes more and more expensive), our  tools are getting better and I also believe that seeing our friends on facebook/twitter/google+ has given us an incentive for self improvement and encourages us to learn and educate ourselves more. It has become much easier to push ourselves to become a Jedi master.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/262/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/262/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=262&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2011/10/12/education/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/10/education.jpg" medium="image">
			<media:title type="html">education</media:title>
		</media:content>
	</item>
		<item>
		<title>Methods for testing data-heavy applications &#8211; Part I</title>
		<link>http://openresearch.wordpress.com/2011/07/14/methods-for-testing-data-heavy-applications-part-i/</link>
		<comments>http://openresearch.wordpress.com/2011/07/14/methods-for-testing-data-heavy-applications-part-i/#comments</comments>
		<pubDate>Fri, 15 Jul 2011 02:32:56 +0000</pubDate>
		<dc:creator>marksalen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://openresearch.wordpress.com/?p=254</guid>
		<description><![CDATA[For the past two months, I have been building a piece of code for a very interesting company that is not on the Fortune 500 list yet but if it goes public it may very well be a Fortune 1 company! This application is very heavy on data. I am still developing, I feed it chunks of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=254&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<div id="attachment_255" class="wp-caption alignright" style="width: 257px"><a href="http://openresearch.files.wordpress.com/2011/07/test-in-progress.jpg"><img class="size-full wp-image-255" title="Software Testing" src="http://openresearch.files.wordpress.com/2011/07/test-in-progress.jpg?w=497" alt="Software Testing"   /></a><p class="wp-caption-text">Software Testing</p></div>
<p>For the past two months, I have been building a piece of code for a very interesting company that is not on the Fortune 500 list yet but if it goes public it may very well be a Fortune 1 company! This application is very heavy on data. I am still developing, I feed it chunks of 100 megabyte datasets, it goes through labor for a couple of minutes and crunches the numbers, and eventually comes back with a table with about 100 rows that I could take and do some statistical analysis on and plot. That big chunk of code gets reduced to bite size human-readable numbers and graphs. We are still testing it and I honestly have no idea how it will perform once it gets connected to the data firehose of the company! and that drives me insane!</p>
<p>I have been testing this software. It works fine. It works fine on all of my test datasets and I have been scrolling through its algorithms that span over a thousand line of code to make sure they work as I expect. I am going insane over this. What if we ship it next week and there is a bug in it? It is not like a Google+ application that you write, ship and then notice that instead of the photo of your female friend it brings back the photo of your dad! there is no way to see the bug when the product is out. your users cannot report the bugs because they won&#8217;t see any of it.</p>
<p>I have been looking into the scientific literature and there is tons of methods in the field of software development with tons of different names but none is about the type of software that I am working on. So I asked on twitter how other people are testing their software and here is a number of responses that I have received.</p>
<p>1- Take the current data files and manually perform your algorithm on the data set (using something like Excel) and see if the final results match</p>
<p>2- Fabricate test data sets for which you know the final result. Run the code on those test files. For example say you have written a code for summation. You can just feed it an array of ones and see if it works on them. Test for boundary cases, test for data that produce intuitive outputs, if you know your algorithm should have a specific behavior, test for that too (like when you expect symmetry in your output)</p>
<p>3 &#8211; Hopefully you break your code into atomic blocks that only do one thing (inverse a matrix, svd calculations, etc) write unit tests for those functions individually. Then combine those individual functions into larger functions and components and test them too and so on.</p>
<p>I am more interested to know your thoughts. what are your suggestions for testing big data applications?</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/openresearch.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/openresearch.wordpress.com/254/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=openresearch.wordpress.com&#038;blog=7772417&#038;post=254&#038;subd=openresearch&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://openresearch.wordpress.com/2011/07/14/methods-for-testing-data-heavy-applications-part-i/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/46c83b6cafe5077dfdc0cd6b818e7e4f?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">marksalen</media:title>
		</media:content>

		<media:content url="http://openresearch.files.wordpress.com/2011/07/test-in-progress.jpg" medium="image">
			<media:title type="html">Software Testing</media:title>
		</media:content>
	</item>
	</channel>
</rss>
