<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.0.5" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>Greg's Blog</title>
	<link>http://greggurevich.com</link>
	<description>To be in the Tech industry, you need to have a blog, here is mine. I promise to make it as boring as the rest of them, so you might as well leave now.                                        I am even using a cookie cutter Wordpress theme, so you know this won't be good.</description>
	<pubDate>Tue, 12 Jun 2007 19:26:26 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.5</generator>
	<language>en</language>
			<item>
		<title>Finished Reading Learning Bayesian Networks</title>
		<link>http://greggurevich.com/2007/05/03/finished-reading-learning-bayesian-networks/</link>
		<comments>http://greggurevich.com/2007/05/03/finished-reading-learning-bayesian-networks/#comments</comments>
		<pubDate>Thu, 03 May 2007 05:26:34 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
		
		<category>Uncategorized</category>

		<guid isPermaLink="false">http://greggurevich.com/2007/05/03/finished-reading-learning-bayesian-networks/</guid>
		<description><![CDATA[I recently read through most of Learning Bayesian Networks. I have been using BNs for a while now they are what neural networks should be. I read this book because I wanted to learn more theory. It&#8217;s a weird thing that these days you don&#8217;t need to know the theory behind how everything works in [...]]]></description>
			<content:encoded><![CDATA[<p>I recently read through most of <a xhref="http://www.amazon.com/b?%5Fencoding=UTF8&#038;node=1000&#038;tag=dealresort&#038;linkCode=ur2&#038;camp=1789&#038;creative=9325">Learning Bayesian Networks</a>. I have been using BNs for a while now they are what neural networks should be. I read this book because I wanted to learn more theory. It&#8217;s a weird thing that these days you don&#8217;t need to know the theory behind how everything works in order to use it. For example <a href="http://www.openbayes.org">OpenBayes</a>  and many other libraries allow you to dive into data processing.</p>
<p>That&#8217;s the great thing about open source, that there are libraries there for just about everything allowing you to create your application much quicker. For example I would never want to develop a BN from scratch, but for stuff like machine learning it&#8217;s necessary to know what is going on in the background especially for when things go wrong.</p>
<p>I don&#8217;t usually do blog posts about books I read, but it has been a while since I read a &#8220;textbook,&#8221; closest thing recently was <a href="http://www.amazon.com/Neural-Networks-Finance-Predictive-Academic/dp/0124859674/ref=pd_bbs_sr_1/002-1705045-1461624?ie=UTF8&#038;s=books&#038;qid=1178169131&#038;sr=8-1">Neural Networks in Finance</a>. I need to say I always preferred books which are closer to practice than theory.
</p>
]]></content:encoded>
			<wfw:commentRss>http://greggurevich.com/2007/05/03/finished-reading-learning-bayesian-networks/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Better than rules, how Bayesian theory is being applied on the Net</title>
		<link>http://greggurevich.com/2007/04/19/the-rise-of-statistics/</link>
		<comments>http://greggurevich.com/2007/04/19/the-rise-of-statistics/#comments</comments>
		<pubDate>Thu, 19 Apr 2007 04:21:57 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
		
		<category>Uncategorized</category>

		<guid isPermaLink="false">http://greggurevich.com/2007/04/19/the-rise-of-statistics/</guid>
		<description><![CDATA[Bayesian statistics has influenced the way we use the internet.  It will soon influence how we navigate online. Currently everything from spam filters to misspelled words to video recommendations is based on some form of Bayesian probability.  In the past, humans were doing the editing.  Now they are done automatically using Statistics.
The use of Bayesian [...]]]></description>
			<content:encoded><![CDATA[<p>Bayesian statistics has influenced the way we use the internet.  It will soon influence how we navigate online. Currently everything from spam filters to misspelled words to video recommendations is based on some form of Bayesian probability.  In the past, humans were doing the editing.  Now they are done automatically using Statistics.</p>
<p>The use of Bayesian mathematics for online applications follows a common progression that parallels the way Bayesian math is taught in colleges. Bayesian formulas are not inherently logical&#8211;often after the first time learning it you quickly forget the formula.  I, like many other students, had an Aha! moment where all of a sudden one understands the power of this tool; one starts looking at the world in a different light. Paul Graham in his article “A plan for Spam” describes it perfectly: “I spent about six months writing software that looked for individual spam features before I tried the statistical approach. What I found was that recognizing that last few percent of spams got very hard, and that as I made the filters stricter I got more false positives.” Although computers are not able to classify data as well as humans (for now at least) they do see relationships among data which we miss. Not only that, but computers are able to see many more relationships that we ourselves can. Now most spam filters are based upon Paul Graham´s essay.</p>
<p>Bayesian mathematics already changed the way we browse the internet or look for information online. Before Google came along the way to find what you where looking for was through a directory. I just went to yahoo.com and couldn´t find the directory anywhere on their homepage; just like point-based spam systems which was proven to be completely obsolete compared to statistics. In the past,  everything had to be classified by people into directories and sub directories. This is very similar to the point approach.  As you create more and more rules or subdirectories you find that you get a lot more false positives; if the website is a travel website then it goes in the travel directory etc.</p>
<p>Statistics takes a completely different approach by looking through the entire website and matching it with your search term.  So far, it has proven to be a much more effective way of locating information online.</p>
<p>By using statistics we are able to start to personalize a user experience, and do automatic classification. There are lots of machine learning systems out there.   However the jump between using a good system to a great system is a lot smaller than the jump from no system to a good system.  In the field of finance it is the same.  Initially,  with the use of systems such as an option pricing formula, people where able to see in a market which was blind before. Although we all know the limitations of this simple formula, it is still pretty good. Currently we are seeing trading desks come up with more and more accurate models (and price discrepancies get smaller and smaller).  But none of those systems made as much of a difference as the initial one. I think we will see this in search engines: if someone creates search results that are slightly better than Google´s,  it will not be enough.   In fact, there are now companies (such as ask.com) which already claim to have done just that, but the cost here of getting the results perfect is not as big as in finance. (people can just look at the next result). One area where we should see increasingly better algorithms is in spam.  There are already techniques (such as spam bombing, etc) to get past Bayesian filters. The future of this area will certainly be very interesting.</p>
<p>A Bayesian approach can go two ways.   Up until now we have seen a lot of the first one: the ability to classify results. The next change that we will see with machine learning is the ability to classify people. Programs such as AJAX made websites easier to surf (when used correctly).  Machine Learning techniques have the ability to present us with the information that we <em>want </em>to see. Currently, most websites follow a directory based approach; first you pick your vertical.  For example, I want to see world or business news or I add my RSS feeds to my favorite reader.  I then browse through them one by one. This should be replaced (just like yahoo directories) with  a one page solution. There is no reason you should not be able to go to your homepage and see exactly what you want. With the website learning, you see what you’re interested in<em> as</em> you surf.</p>
<p>Often the simpler the system is, the better it performs.  The first jump from no system to machine learning is often the biggest because it changes the way you behave; everything after that is just there to squeeze out accuracy and performance. There are many techniques that I am currently using for work, as well as learning.  Specifically, Neural Networks (a whole other beast) and, more recently, Bayesian Networks. A word of caution though: Do not expect machine learning to be perfect. “I think it’s possible to stop spam, and that content-based filters are the way to do it. ” Paul Graham said. We still have not stopped spam. And I don’t think we will do so any time soon.</p>
<p><span style="font-family: "Verdana","sans-serif""> </span></p>
<p>-Greg</p>
<p class="MsoNormal">
]]></content:encoded>
			<wfw:commentRss>http://greggurevich.com/2007/04/19/the-rise-of-statistics/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Mikhail started up a blog</title>
		<link>http://greggurevich.com/2006/03/28/mikhail-started-up-a-blog/</link>
		<comments>http://greggurevich.com/2006/03/28/mikhail-started-up-a-blog/#comments</comments>
		<pubDate>Tue, 28 Mar 2006 04:34:23 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
		
		<category>ClickFacts</category>

		<category>Personal</category>

		<guid isPermaLink="false">http://greggurevich.com/2006/03/28/mikhail-started-up-a-blog/</guid>
		<description><![CDATA[   Mikhail started up a blog at http://ledvich.com/blog/. I work with Mikhail at ClickFacts, and he is a marketing machine. Hopefully he will write a little more than me. (I haven’t posted since November) It&#8217;s not that I don&#8217;t have anything to say, a ton of stuff has been happening its just that [...]]]></description>
			<content:encoded><![CDATA[<p>   Mikhail started up a blog at <a href="http://ledvich.com/blog/">http://ledvich.com/blog/</a>. I work with Mikhail at ClickFacts, and he is a marketing machine. Hopefully he will write a little more than me. (I haven’t posted since November) It&#8217;s not that I don&#8217;t have anything to say, a ton of stuff has been happening its just that most of it is confidential, and when you get home at 2 in the morning every night writing is the last thing on your mind.<br />
   Lehigh is just about over, I have finals, although I barely make it to class anymore, and when I do I usually end up coding and talking business on my cellphone / pda. I was going to stop after midterms, but I did well enough on them that there really is no point. Needless to say I can’t wait to finish up school and to get out into the “real world.”</p>
]]></content:encoded>
			<wfw:commentRss>http://greggurevich.com/2006/03/28/mikhail-started-up-a-blog/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Tremblant over break</title>
		<link>http://greggurevich.com/2005/11/26/tremblant-over-break/</link>
		<comments>http://greggurevich.com/2005/11/26/tremblant-over-break/#comments</comments>
		<pubDate>Sat, 26 Nov 2005 18:14:07 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
		
		<category>Personal</category>

		<guid isPermaLink="false">http://greggurevich.com/?p=8</guid>
		<description><![CDATA[
Went to Canada during winter break, spent the days skiing and the nights coding. Good break from school, and got lots of work done.

]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/21587941@N00/72217616"><img src="http://static.flickr.com/20/72217616_5682e1c8b7_s.jpg" width="75" height="75" alt="Russian Thanksgiving 021" border="0" /></a><br />
Went to Canada during winter break, spent the days skiing and the nights coding. Good break from school, and got lots of work done.
</p>
]]></content:encoded>
			<wfw:commentRss>http://greggurevich.com/2005/11/26/tremblant-over-break/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Working Late</title>
		<link>http://greggurevich.com/2005/11/15/working-late/</link>
		<comments>http://greggurevich.com/2005/11/15/working-late/#comments</comments>
		<pubDate>Tue, 15 Nov 2005 07:20:16 +0000</pubDate>
		<dc:creator>Administrator</dc:creator>
		
		<category>Uncategorized</category>

		<guid isPermaLink="false">http://greggurevich.com/?p=4</guid>
		<description><![CDATA[Something about working late at night makes me more productive, that ungodly hour where all the sane people are sleeping. I am not sure whether it’s the urgency of knowing that the night is coming to an end, or just that fact there are no interruptions, but around 1 am is when my productivity really [...]]]></description>
			<content:encoded><![CDATA[<p>Something about working late at night makes me more productive, that ungodly hour where all the sane people are sleeping. I am not sure whether it’s the urgency of knowing that the night is coming to an end, or just that fact there are no interruptions, but around 1 am is when my productivity really picks up. This summer we would regularly work till 4 or 5 in the morning, with classes I do not have the option of sleeping till 12, but that doesn’t mean I can’t try. </p>
]]></content:encoded>
			<wfw:commentRss>http://greggurevich.com/2005/11/15/working-late/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
