<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>20bits &#187; Articles</title>
	<atom:link href="http://20bits.com/category/articles/feed/" rel="self" type="application/rss+xml" />
	<link>http://20bits.com</link>
	<description>Driven by Data</description>
	<lastBuildDate>Wed, 24 Aug 2011 16:54:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.1</generator>
		<item>
		<title>The Value of a Social Commerce Referral</title>
		<link>http://20bits.com/articles/the-value-of-a-social-commerce-referral/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-value-of-a-social-commerce-referral</link>
		<comments>http://20bits.com/articles/the-value-of-a-social-commerce-referral/#comments</comments>
		<pubDate>Mon, 22 Aug 2011 18:44:37 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[commerce]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[social]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=825</guid>
		<description><![CDATA[At Everlane we recently shut down a side-project of ours, Indie Cases, where we were selling some pretty sweet iPhone cases. Over the month and a half that the store was live we saw some good traffic from up-and-coming social commerce sites like Svpply and Pinterest. I thought I&#8217;d take the time to share some [...]]]></description>
			<content:encoded><![CDATA[<p>At <a href="http://www.everlane.com">Everlane</a> we recently shut down a side-project of ours, <a href="http://www.indiecases.com">Indie Cases</a>, where we were selling <a href="http://www.indiecases.com/collections/frontpage/products/made-to-measure">some</a> <a href="http://www.indiecases.com/collections/frontpage/products/staches-in-space">pretty</a> <a href="http://www.indiecases.com/collections/frontpage/products/peace">sweet</a> iPhone cases.  Over the month and a half that the store was live we saw some good traffic from up-and-coming social commerce sites like <a href="http://svpply.com">Svpply</a> and <a href="http://pinterest.com">Pinterest</a>.</p>
<p>
I thought I&#8217;d take the time to share some of the numbers.
</p>

<h2>The Sites</h2>
<p>
Startup ideas come in bunches, and these social commerce sites are no exception.  They all are built around aiding discovery, typically by allowing users on the site to share &#8220;finds&#8221; from around the web  and build a following.
</p>
<p>
These finds may or may not be products and have a price tag associated with them.  They may also be restricted to a specific category (e.g., women&#8217;s high fashion) on the site.
</p>
<p>
Here&#8217;s a list of all the sites I know.  If you know any other, please, <a href="mailto:jesse@20bits.com">send me an email</a> and I&#8217;ll update the list.</p>
<ul style="list-style: none;margin-left: 20px;">
<li><a href="http://buyosphere.com">Buyosphere</a></li>
<li><a href="http://ly.st">Ly.st</a></li>
<li><a href="http://notcouture.notcot.org/">NotCouture</a></li>
<li><a href="http://pinterest.com">Pinterest</a></li>
<li><a href="http://svpply.com">Svpply</a></li>
<li><a href="http://thefancy.com">The Fancy</a></li>
</ul>

<p>
Of these sites, only Pinterest, Svpply, and The Fancy sent any traffic to <a href="http://www.indiecases.com">Indie Cases</a>.
</p>

<h2>The Numbers</h2>
<p>
The hope for these startups is that referrals from these sites are worth more than the average, non-qualified visitor.  Some, like The Fancy, are even experimenting <a href="http://shop.thefancy.com">directly with commerce</a>.
</p>

<p>
<table class="monthly-data" style="margin: 0 auto">
  <tr class="top">
    <th colspan="4">Value of a Social Commerce Visitor</th>
  </tr>
  <tr class="odd">
    <td>Site</td>
    <td>Conversion %</td>
    <td>$/visitor</td>
  </tr>
  <tr>
    <td>Pinterest</td>
    <td>0.93%</td>
    <td>$0.31</td>
  </tr>
  <tr class="odd">
    <td>Svpply</td>
    <td>3.70%</td>
    <td>$1.09</td>
  </tr>
  <tr>
    <td>The Fancy</td>
    <td>0.48%</td>
    <td>$0.13</td>
  </tr>
</table>
</p>

<p>
I left out the traffic number deliberately, but will say this: The Fancy drove approximately 10x the traffic of Svpply, which drove the least amount of traffic of the three.
</p>

<h2>A Wrinkle</h2>
<p>
Why did Svpply convert so much better than Pinterest or The Fancy?  Well, half the purchases from Svpply were of <a href="http://www.indiecases.com/products/class-a-cellphone">this iPhone case</a>, which Ben Pieratt, CEO of Svpply, tweeted about directly.</p>

<p>
<a href="http://twitter.com/#!/pieratt/status/98392249120993280"><img src="http://20bits.com/wp-content/uploads/2011/08/Screen-shot-2011-08-22-at-12.07.00-AM.png" alt="" title="Ben Pieratt &lt;3 Indie Cases" width="594" height="288" class="aligncenter size-full wp-image-842" /></a>
</p>

<h2>Social Commerce?</h2>

<p>
Consider the above: a plug from a respected member of a community interested in our products (Svpply) produced roughly the same number of sales as a site driving 10x the traffic (The Fancy).
</p>

<p>
That, in a nutshell, is the promise of social commerce: the right recommendation at the right time from the right person.
</p>

<p>
The success of sites like Svpply, Pinterest, and The Fancy will hinge on their ability to consistently produce that.
</p>

<h2>Additional Info</h2>
<p>
Just for reference, here are the Compete graphs of the sites that sent traffic to <a href="http://www.indiecases.com">Indie Cases</a>.
<a href='http://siteanalytics.compete.com/pinterest.com+svpply.com+thefancy.com/?metric=uv' class="math"><img src='http://grapher.compete.com/pinterest.com+svpply.com+thefancy.com_uv_460.png' /></a>
</p>
<p>
Pinterest dominates in raw site traffic, but the question is whether their referrals are coming with intent to purchase.  I&#8217;m sure they were grilled nonstop about that while they were out fundraising &mdash; a traffic graph like that allows for a lot of leeway. ;)
</p>

<h2>Does this interest you?</h2>
<p>
At <a href="http://www.everlane.com">Everlane</a>, we&#8217;re not building a social commerce platform, we&#8217;re building a full-on store selling our own products.  Indie Cases was a small preview of what&#8217;s coming.
</p>

<p>
If you are an engineer or designer interested in defining what online retail should look like in a world where Twitter and Facebook exist, and YouTube creates stars in weeks (not months), shoot me an email at <a href="mailto:jesse@everlane.com">jesse@everlane.com</a> and let&#8217;s talk.
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/the-value-of-a-social-commerce-referral/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Click Hacking for Fun and Profit</title>
		<link>http://20bits.com/articles/click-hacking-for-fun-and-profit/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=click-hacking-for-fun-and-profit</link>
		<comments>http://20bits.com/articles/click-hacking-for-fun-and-profit/#comments</comments>
		<pubDate>Fri, 22 Apr 2011 23:12:38 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[social network analysis]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=758</guid>
		<description><![CDATA[A friend IMed me the other day, asking, &#8220;You know how to make people click on things. I&#8217;m submitting something to Reddit &#8212; can you help me title the post?&#8221; A stark description of my skills, certainly, but it made me laugh and inspired me to write an article about the art of click hacking. [...]]]></description>
			<content:encoded><![CDATA[<p>A friend IMed me the other day, asking, &#8220;You know how to make people click on things.  I&#8217;m submitting something to Reddit &mdash; can you help me title the post?&#8221; A stark description of my skills, certainly, but it made me laugh and inspired me to write an article about the art of click hacking.</p>

<h3>What is click hacking?</h3>
<p>
Aside from being a term I just made up, <strong>click hacking</strong> is a type of <a href="http://en.wikipedia.org/wiki/Social_engineering_(security)">social engineering</a> where the goal is to get someone to click a hyperlink.
</p>

<p>
The link could be the title of a Reddit post, a button on a landing page form that&#8217;s trying to capture your email, or an image in a Facebook ad.  Deception can be, but is not necessarily involved.
</p>

<p>
I&#8217;m going to use Facebook, Hacker News, and Reddit as the primary examples throughout this article because I know them best.  If you have other examples please leave a comment!
</p>

<h3>For Good or Evil</h3>
<p>
All the tactics I&#8217;m about to describe can be used for good or evil, and I&#8217;ve seen each used for both.  Don&#8217;t take this article as an endorsement for spamming, scamming, or other internet trickery.
</p>

<p>
There&#8217;s plenty of grey area, too.  At its start Reddit faked on-site activity to avoid looking like a ghost town.  Was that unethical?  A mistake?
</p>

<p>
None of these tactics are a substitute for generating real value for your customer, though they will help you understand how your customers react to what you present them (and how to incentivize them to react the way you want).  The hard (and most important) parts are still up to you.
</p>

<p>With that disclaimer aside, let&#8217;s dive into specifics.</p>

<h3>Understand Your Audience</h3>
<p>
Above all else, I take the time to understand my audience.  Each online community has its own customs, norms, and standards of behavior.  You have to understand them before you can blend in or stand out as necessary.
</p>

<p>
<a href="http://24.media.tumblr.com/tumblr_ljyor743nc1qzrifqo1_500.png"><img src="http://24.media.tumblr.com/tumblr_ljyor743nc1qzrifqo1_500.png" width="200" align="right" /></a>

For example, Hacker News values civility<sup><a href="http://20bits.com/articles/click-hacking-for-fun-and-profit/#footnote_0_758" id="identifier_0_758" class="footnote-link footnote-identifier-link" title="See, e.g., Some Tips to Improve the Civility on Hacker News.">1</a></sup>, straight-dealing, and intellectual honesty, but can be a little short-sighted and dour.  Reddit values wit (especially puns and in-jokes/memes) and has a strong sense of community justice.  It can also be completely juvenile. 
</p>


<p>
From the click hacker&#8217;s perspective, a pun-filled title could do well on Reddit, but would never see the light of day on Hacker News.
</p>

<h3>Ask for Help or Feedback</h3>
<p>
One way to get people to pay attention is to ask for help.  Hacker News has a &#8220;<a href="http://news.ycombinator.com/ask">Ask HN</a>&#8221; feature which evolved purely from community behavior.  There are similar &#8220;Tell HN&#8221; and &#8220;Review my Startup&#8221; features.
</p>

<p>
For example, here is <a href="http://news.ycombinator.com/item?id=8863">Drew Houston&#8217;s original Hacker News post</a> asking for reviews of Dropbox.  Every startup I know wants to publish a &#8220;showoff&#8221; entry on Hacker News and have it get to the front page, and their motives range from honest (they really want feedback) to self-serving (they just want the attention).
</p>

<h3>Give a Gift</h3>
<p>
Everyone loves gifts, but when we receive them we also feel pressure to reciprocate.  The click hacker can use that pressure to get people to do what they want.
</p>

<p>
This tactic is the difference between &#8220;I baked a cake&#8221; and &#8220;I baked a cake <em>for you</em>.&#8221;  A normal person is obliged to accept and even reciprocate.
</p>

<p>
For example, <a href="http://www.reddit.com/r/programming/comments/e45ch/">this Reddit user</a> wrote a CSS hack to change the appearance of the site &mdash; a little present for anyone using Reddit.  The Reddit community reciprocated by giving him upvotes.
</p>

<p>
Notice how he says he made it &#8220;for Reddit.&#8221;
</p>

<p>
The <a href="http://www.reddit.com/r/pics/comments/9h520/my_wife_made_me_a_reddit_alien_birthday_cake_it/c0cracz">cake example</a> works, too, though. :)
</p>

<h3>Bribery</h3>
<p>
Bribery is the opposite of gift-giving.  In this situation the click hacker says, &#8220;Do this for me and I&#8217;ll give you something.&#8221;  If that something is really awesome people will do almost anything.
</p>

<p>
The stereotypical example here are those scammy ads for free iPods and the like.  All you have to do is fill out this form and click this link and you&#8217;ll be entered to win.  Or, hey, instead maybe you can forward this offer to three of your friends, and if one of them wins, you win, too!
</p>

<p>
This can be done well in certain contexts.  For example, Facebook game developers often trade installs by paying players virtual currency, e.g., &#8220;Install this game to earn ten farm dollars.&#8221;  Or a game developer might have a special landing page offering players virtual currency as a way to encourage the player to click the install button. 
</p>

<p>
The player gets what they want (virtual currency) and the game developer acquired a customer for free.
</p>

<p>
Assuming the terms are clearly outlined there&#8217;s nothing wrong with this.  Of course, there&#8217;s plenty of room for outright dishonesty by promising goods that never arrive.  Don&#8217;t be that guy.
</p>

<h3>Scarcity</h3>
<p>
People want what they can&#8217;t have.  Or, more accurately, people want what they think they can&#8217;t have.
</p>

<p>
Gilt, for example, relies heavily on this tactic to drive sales.  Better click that buy button now, because we&#8217;re running out!
</p>

<p>
In Gilt&#8217;s case the scarcity is legitimate, but it can be artificial, too.  A game developer might make an in-game object ultra-rare in order to drive specific behavior.  Limited beta invites are a tried-and-true method of generating traffic and interest; give out 20 invites to an audience of 20,000 and collect follow-up information for the people who don&#8217;t get there in time.
</p>

<p>
Scarcity also works with time, e.g., &#8220;If you install this game within the next thirty seconds, you&#8217;ll get a ten credit bonus.&#8221;  Tick, tick, tick, tock.
</p>

<h3>Social Proof</h3>
<p>
Similar to scarcity, people don&#8217;t want to feel like they&#8217;re missing out.  If they see other people doing something, especially people they know or respect, they&#8217;ll be more likely to do it.
</p>

<p>
This is the <em>raison d&#8217;être</em> for Facebook&#8217;s <a href="http://developers.facebook.com/docs/reference/plugins/facepile/">Facepile</a> widget.  Click that Like button.  You know you want to.  Come on, all your friends are doing it.
</p>

<p>
Even adding faces of random but friendly-looking people can be effective in getting people to click through.  Here&#8217;s a screenshot of Match.com&#8217;s homepage.
</p>

<a href="http://20bits.com/wp-content/uploads/2011/04/match-com.jpg"><img src="http://20bits.com/wp-content/uploads/2011/04/match-com.jpg" alt="Match.com homepage" title="Match.com homepage" width="500" class="aligncenter math size-medium wp-image-811" /></a>

<p>Look at all those happy people.  Don&#8217;t you want to be happy, too?</p>

<h3>Pick a Fight</h3>
<p>
Rather than trying to blend in with a community, some times it helps to generate controversy.  This can either be you vs. the community, or setting two factions within a community against each other.
</p>

<p>
I&#8217;ll share a story myself.  Back in 2009 I was working on a polling application for Facebook.  People would vote in polls and their vote would appear in their newsfeed.
</p>

<p>
The most controversial topics were the most successful, so to get things started I created a poll: &#8220;Do you support same-sex marriage?&#8221;
</p>

<p>
This was just after the Proposition 8 fight, so I decided to run two sets of ads on Facebook: one targeting 50 miles around San Francisco and another targeting 50 miles around Salt Lake City.
</p>

<p>
I can&#8217;t say it was a win for civil political discourse, but it generated enough controversy to make that poll viral and in turn make the entire application viral.<sup><a href="http://20bits.com/articles/click-hacking-for-fun-and-profit/#footnote_1_758" id="identifier_1_758" class="footnote-link footnote-identifier-link" title="The way that application died makes a good story, too.">2</a></sup>
</p>

<h3>Environmental Flaws</h3>
<p>
Sometimes the environment has flaws which allow the click hacker to do some interesting things.
</p>

<p>
When Facebook first launched their user-generated polling product, I remember seeing several polls asking questions like, &#8220;Which model is hotter?&#8221;  The possible answers were URLs of images.
</p>

<p>
Unfortunately for the user, clicking on the text of the answer (the URL in this case) would register a vote and post that vote to your newsfeed.  Their friends, who of course also wanted to see pictures of hot models,  clicked the URLs, accidentally voted, and spread the poll to their friends.  Oops!
</p>

<p>
Click hackers were using this mis-feature deliberately to spread new polls across Facebook.<sup><a href="http://20bits.com/articles/click-hacking-for-fun-and-profit/#footnote_2_758" id="identifier_2_758" class="footnote-link footnote-identifier-link" title="There&amp;#8217;s a blog post describing data from this social hack &amp;mdash; if anyone has it please share it in the comments!">3</a></sup>
</p>

<p>
I don&#8217;t think many people will disagree when I say that most of the growth on the Facebook Platform from 2007-2009 was built on similar environmental flaws.  Here&#8217;s a two-year old example from Slide&#8217;s Super Pocus:
</p>

<img src="http://20bits.com/downloads/super-pocus.png" alt="" title="Super Pocus click hacking" class="alignnone size-medium wp-image-378 math">

<h3>What else?</h3>
<p>What else am I missing?  Post examples in the comments.</p>

<p>Have you ever been &#8220;tricked&#8221; into clicking something?  (I know I have.) Have a funny story?  Have experience optimizing landing pages, etc. and think about this all the time anyway?  Leave a comment and share!</p>

<h3>Everlane is Hiring</h3>
<p>
My startup, <a href="http://www.everlane.com">Everlane</a>, is hiring.  We&#8217;re trying to take the best elements of shopping offline &mdash; visual merchandising, personality, and curation &mdash; and bring them online.  Check out our jobs page at <a href="http://www.everlane.com/jobs">http://www.everlane.com/jobs</a> and send an email to <a href="mailto:jesse@everlane.com?subject=[20bits]%20Everlane">jesse@everlane.com</a> if you&#8217;re interested in talking.
</p><ol class="footnotes"><li id="footnote_0_758" class="footnote">See, e.g., <a href="http://news.ycombinator.com/item?id=1400882">Some Tips to Improve the Civility on Hacker News</a>.</li><li id="footnote_1_758" class="footnote">The way that application died <a href="http://www.huffingtonpost.com/2009/10/01/facebook-obama-death-poll_n_306637.html">makes a good story</a>, too.</li><li id="footnote_2_758" class="footnote">There&#8217;s a blog post describing data from this social hack &mdash; if anyone has it please share it in the comments!</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/click-hacking-for-fun-and-profit/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Speed vs. Certainty in A/B Testing</title>
		<link>http://20bits.com/articles/speed-vs-certainty-in-ab-testing/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=speed-vs-certainty-in-ab-testing</link>
		<comments>http://20bits.com/articles/speed-vs-certainty-in-ab-testing/#comments</comments>
		<pubDate>Mon, 25 May 2009 17:00:18 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[ab testing]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[tactics]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=717</guid>
		<description><![CDATA[A/B testing is a great tactical tool for studying customer behavior on the web. But like any randomized trial there&#8217;s some chance that the improvement we measure is just statistical noise. How worried should we be that the feature we thought improved our product actually does nothing, or worse, hurts our bottom line? How can [...]]]></description>
			<content:encoded><![CDATA[<p>
<a href="http://20bits.com/articles/an-introduction-to-ab-testing/">A/B testing</a> is a great tactical tool for studying customer behavior on the web.  But like any randomized trial there&#8217;s some chance that the improvement we measure is just statistical noise.
</p>

<p>
How worried should we be that the feature we thought improved our product actually does nothing, or worse, hurts our bottom line?  How can we ever really know that we&#8217;re making the correct decision?  And is it better to run tests more quickly or more accurately?
</p>

<p>
The answers to these questions depend on the cost of a bad decision.  If mistakes are cheap then it&#8217;s better to make 1,000 decisions and get only 60% of them right than to make 100 decisions and get 100% of them right.
</p>

<p>
One way to achieve this balance in the context of A/B testing is to tune the confidence level.
</p>

<h3>Tuning the Confidence Level</h3>
<p>
Intuitively, the confidence level of an A/B test tells you how certain you can be of the result of the A/B test.  For example, a confidence level of 95% means that there&#8217;s a 5% chance that a statistically significant result is actually random variation, i.e., there is a 5% chance of a false positive.
</p>

<p>
Of course, we&#8217;re free to choose some other confidence level besides 95%.  We could choose 80%, 90%, or 99.999%.  A higher confidence level requires more data before reaching statistical significance, but we will be more certain of the result.
</p>

<p>
If you&#8217;re not comfortable with the nuts and bolts of statistical analysis, confidence levels, and A/B testing I recommend reading my article about <a href="http://20bits.com/articles/statistical-analysis-and-ab-testing/">statistical analysis and A/B testing</a>, which explains exactly how one &#8220;chooses&#8221; a confidence level.
</p>

<p>
In short, the confidence level acts as a dial between speed and certainty, and we&#8217;re free to choose where to set that dial depending on the priorities of our business or product.
</p>

<h3>Speed vs. Certainty</h3>
<p>
So where on the speed-certainty spectrum should you, as a product manager or startup entrepreneur, sit?
</p>

<p>
Mike Cassidy has a great presentation where he argues that <a href="http://www.slideshare.net/dmc500hats/best-strategy-is-speed-startup2startup-may-2008">speed is the primary business startegy</a> for startups.
</p>

<p>
Why is speed great for startups?  Because mistakes are cheap and calculated risks are rewarded.  Most product decisions can be undone, and important early tests can be redone at a higher confidence level when the product has more traction.
</p>

<p>
But mistakes aren&#8217;t always cheap.  Here are some factors that increase the cost of a mistake.
</p>

<h4>Volume</h4>
<p>
Volume is leverage.  If you have millions of customers, like Google or Amazon, a 1% improvement to the bottom line is a huge win.  Conversely, a 1% mistake is a huge hit. 
</p>

<p>
Fortunately this problem helps mitigate itself.  Increased volume affords you the luxury of running A/B tests at a higher confidence level in the same amount of time.
</p>

<h4>Reversibility</h4>
<p>
Most product decisions in a consumer technology startup can be undone, for a price.  For example, it&#8217;s easy to undo a bad decision for a web-based product, slightly harder to undo a decision for desktop software, and very difficult (and costly) to undo a decision for a physical product.
</p>

<p>
The less reversible a decision is the more certain you should be before you make it.  In the context of A/B testing a product feature this means a higher confidence level, even if it takes longer to run the test.
</p>

<h4>Real Money</h4>
<p>
Imagine you&#8217;re an ad network.  You&#8217;re constantly A/B testing formatting, positioning, offers, etc. to see which performs best.  Making a mistake in this regard costs your publishers money.
</p>

<p>
Like volume, money creates leverage.  But it is more complicated than that: publishers don&#8217;t just want increased revenues, they want reliable cash flow.  That is, when money is involved, not only do you have to perform better but you have to perform more consistently because of phenomena like the <a href="http://en.wikipedia.org/wiki/Peak-end_rule">peak-end rule</a>.
</p>

<p>
In this case a &#8220;three steps forward one step back&#8221; strategy might actually be worse than going step-by-step in the right direction, even if the former averages out to better performance.
</p>

<h3>Conclusions</h3>
<p>
Maintaining momentum in a startup isn&#8217;t about making <em>only</em> correct decisions &mdash; it&#8217;s about making <em>enough</em> correct decisions.  This presents a continuum from speed to certainty.  At one extreme you run the business with a magic eight-ball; at the other you agonize over every detail until you&#8217;re 100% certain that you&#8217;ve made the correct choice.
</p>

<p>
This thought process extends naturally to A/B testing where the idea of &#8220;certainty&#8221; and &#8220;cost&#8221; can be quantified.  To recap:
</p>

<ul>
	<li>A/B testing is a great tactical tool for testing <em>specific</em> hypotheses about your customers.</li>
	<li>However, there is a tradeoff between speed and certainty, controlled by the confidence level of the A/B test.</li>
	<li>The cost of doing A/B tests quickly is that you will make more wrong decisions, but that is ok if mistakes are cheap.</li>
	<li>For example, it&#8217;s better to make 1,000 decisions and get only 60% of them right than to make 100 decisions and get 100% of them right, all else being equal.</li>
</ul>

<h3>A Spreadsheet Model</h3>
<p>
Below is a little spreadsheet model that illustrates all my points above.
</p>

<p>
The two independent variables are the gain from a good decision and the cost of a bad decision.  The spreadsheet assumes a fixed time period, so a higher confidence level means more certainty but <strong>fewer tests</strong>.  The ideal confidence level is highlighted as you change the parameters of the model.
</p>

<p>
You can download <a href="http://20bits.com/downloads/confidence-model.xls">the A/B testing confidence model</a> here.
</p>

<p>
For the statistically inclined this model assumes that traffic increases linearly over time, that the sample statistic is normally distributed, and that a one-tailed t-test is the appropriate statistical test.
</p>

<h3>Credits</h3>
<p>
This article was inspired by a conversation with <a href="http://startup-marketing.com/">Sean Ellis</a> and edited by my <a href="http://twitter.com/aleeeex">wonderful girlfriend</a>, who is probably going to yell at me for linking to her Twitter account.
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/speed-vs-certainty-in-ab-testing/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>8 Tips for Crafting Metrics That Matter</title>
		<link>http://20bits.com/articles/8-tips-for-crafting-metrics-that-matter/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=8-tips-for-crafting-metrics-that-matter</link>
		<comments>http://20bits.com/articles/8-tips-for-crafting-metrics-that-matter/#comments</comments>
		<pubDate>Tue, 12 May 2009 17:00:31 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data-driven-development]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[product development]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=661</guid>
		<description><![CDATA[Metrics are the marketer&#8217;s microscope. They show him what his customers are actually doing, as opposed to what they say they are doing or intend to do. With proper metrics he can make decisions faster and more accurately. You can decide to measure anything, but what metrics matter and what ones are just for show? [...]]]></description>
			<content:encoded><![CDATA[<p>
Metrics are the marketer&#8217;s microscope.  They show him what his customers are actually doing, as opposed to what they say they are doing or intend to do. 
With proper metrics he can make decisions faster and more accurately.
</p>

<p>
You can decide to measure anything, but what metrics matter and what ones are just for show?  Here are some rules I hope will guide you toward creating meaningful metrics that help, rather than hinder, the decision-making process.
</p>


<h3>Be Actionable</h3>
<p>
If I had to give a one-sentence answer to the question &#8220;What metrics should I implement for my product?&#8221; it would be &#8220;Whatever metrics are actionable.&#8221;  This means the line from a question to a metric and the line from a metric to an action should be as short as possible.
</p>

<p>
Most of the tips below are meant to focus attention on this issue.  What can you do to make sure your metrics are actionable?
</p>

<h3>Be Understandable and Trustworthy</h3>
<p>
Do you understand what your metric measures?  Does everyone in your organization also understand and do they trust it?
</p>

<p>
Trust is the important part.  Everyone has to trust the metric if you&#8217;re going to use it to make decisions, otherwise you&#8217;ll be getting constant pushback.  This will slow the decision-making process and cause a lot of ill-tempered arguments.
</p>

<h3>Measure Results</h3>
<p>
Does your metric measure customer behavior or a correlate of customer behavior?  In the past approximations and correlations were necessary because measuring behavior directly was hard, but on the web there&#8217;s no excuse &mdash; you have access to every single thing a person does on your site, down to where their mouse is hovering and for how long.
</p>

<p>
For example, if you want to know how good Twitter is for your business don&#8217;t measure the number of positive tweets about your company.  Instead, measure how many customers it drives to your product and how much money those customers put in your hands.
</p>

<h3>Understand the Downside</h3>
<p>
What would you do if your metric were 50% off the mark today?  Would you be able to articulate why this is a problem for your business?  What it costs you?  Would you know where to start looking for possible causes?
</p>

<p>
As an example, I&#8217;ve worked with a startup that used &#8220;number of MySpace friends&#8221; as a go-to metric in every marketing meeting.  Is that really material to the business? 
</p>

<p>
What would happen if tomorrow we had half as many MySpace friends? Would we lose $100?  $10,000?
</p>

<p>
This number says nothing about the value of MySpace as a marketing channel, which in the most charitable case is what it is trying to approximate, and the downside is completely ambiguous.  Is the number of MySpace friends tied to anything meaningful?
</p>

<p>
Like the Twitter example above, if I thought MySpace were an important marketing channel for my product I&#8217;d be measuring things like the number of qualified leads from MySpace and the value they generate for the business.
</p>

<h3>Understand the Upside</h3>
<p>
Conversely, ask yourself, &#8220;What value does improving the metric bring to the company?&#8221;  Some metrics are blindingly obvious in this regard, e.g., top-line revenue numbers and some &#8220;efficiency&#8221; metrics like effective CPM and revenue per user. 
</p>

<p>
Some metrics are less obvious.  What about page views?  Can you imagine a scenario where your pageviews double but the net effect was bad for your business?
</p>

<p>
Metrics like these are dangerous because they lull you into a false sense of security.  Everything on your analytics dashboard is going up and to the right, but the fundamentals of the business might still be suffering.
</p>

<h3>Don&#8217;t Be Ambiguous</h3>
<p>
Does your metric measure one thing and one thing only?  Or is it really an aggregation of several variables, each of which can rise or fall independently?
</p>

<p>
A good example of an ambiguous metric is the notion of a &#8220;daily active user,&#8221; or the total number of people who interacted with your product today.  This number is ambiguous because it is really the sum of two metrics: the number of new users and the number of returning users.
</p>

<p>
Ambiguous metrics are bad because they obscure the underlying variables that truly reflect customer behavior and delay the decision-making process as you are forced to determine which of these variables is actually affecting the aggregate metric.  Furthermore, it&#8217;s possible that one variable accounts for all the growth in the metric, e.g., you have millions of new users per day but no returning users.
</p>

<p>
This latter scenario has been the death of many Facebook apps.
</p>

<h3>Segment by Purpose</h3>
<p>
Whenever I&#8217;m building a web product I divide usage into key segments.  Generally these are acquisition, retention, engagement, and monetization.  That is, how do customers find my product?  Are those customers coming back?  Are they doing the things I want or need them to do?  And how much money are those customers making me?
</p>

<p>
See Dave McClure&#8217;s <a href="http://www.slideshare.net/dmc500hats/startup-metrics-for-pirates-long-version">Startup Marketing for Pirates</a> presentation, which focuses on this idea.
</p>

<p>
Doing this lets you tackle each segment independently and allows for a sharper product focus.  Early on you might want to focus on acquisition, or maybe you want to focus on monetization from day one.  Eventually you&#8217;ll starting caring about longer-term metrics like retention and you won&#8217;t be distracted or overwhelmed by unrelated metrics from different segments.
</p>

<p>
You might select one or two from each category to act as top-line variables in a dashboard that you look at every day, e.g., new users (by channel), returning users (by channel), activity, and revenue.
</p>

<h3>Appropriate Granularity</h3>
<p>
Sometimes you need a bird eye&#8217;s view and sometimes you need a tunneling electron microscope.  Know when you need which.
</p>

<p>
As a general rule of thumb I focus on the microscopic when I am designing specific optimizations but focus on the macroscopic when I&#8217;m determining whether the decisions we&#8217;re making are working.  Another way is to finish this sentence, &#8220;I know my product is healthy because&#8230;&#8221;
</p>

<p>
You&#8217;d never finish that sentence with &#8220;because the click-through-rate on my login page is 20%.&#8221;  You&#8217;d say something like &#8220;because 80% of my customers return every week&#8221; or &#8220;because our revenue is growing by 5% month-over-month.&#8221;
</p>

<h3>Conclusions</h3>
<p>
These are just tips, not hard-and-fast rules.  They are meant to focus the discussion of what metrics matter and why because the choice of metrics has a long-lasting impact if your team is committed to building a data-driven culture.
</p>

<p>
If you have any tips on deciding what metrics matter and why, leave them in the comments!  This list wasn&#8217;t meant to be exhaustive and I know (or hope) people will have some strong opinions!
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/8-tips-for-crafting-metrics-that-matter/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Building a Social Network, Island by Island</title>
		<link>http://20bits.com/articles/building-a-social-network-island-by-island/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=building-a-social-network-island-by-island</link>
		<comments>http://20bits.com/articles/building-a-social-network-island-by-island/#comments</comments>
		<pubDate>Mon, 11 May 2009 15:30:36 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[acquisition]]></category>
		<category><![CDATA[density]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[hi5]]></category>
		<category><![CDATA[myspace]]></category>
		<category><![CDATA[product development]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strategy]]></category>
		<category><![CDATA[xfire]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=649</guid>
		<description><![CDATA[A necessary condition for building a self-sustaining social network is density. We understand this intuitively. After all, a network of one person is hardly a &#8220;network&#8221; at all. Metcalfe&#8217;s Law, which states that the value of a network grows in proportion to the square of the number of users of that network, express this idea [...]]]></description>
			<content:encoded><![CDATA[<p>
A necessary condition for building a self-sustaining social network is density.  We understand this intuitively.  After all, a network of one person is hardly a &#8220;network&#8221; at all.
</p>

<p>
Metcalfe&#8217;s Law, which states that the value of a network grows in proportion to the square of the number of users of that network, express this idea formally.  The value of a social network rests in its ability to foster communication, in its connections.
</p>

<p>	
If you&#8217;re building a social network, whether it&#8217;s a destination website or an application that exists on another social network, density must figure into key strategic decisions.  Let&#8217;s see how.
</p>

<h3>Islands</h3>
<p>
One way to think about social networks is as a network of networks.  On Facebook, for example, if I graph the connections between all my friends I see distinct groups: my high school friends, my college friends, my current circle of friends, and my professional network.
</p>

<p>
Each group of people is more or less isolated from each other.  Density exists within each of these islands, but not between them.  I&#8217;m fairly certain this is a topological property of any social network.
</p>

<h3>Case Study: Facebook</h3>
<p>
Facebook started at Harvard and was initially college-only.  Their growth strategy was explicit from day one: move from school to school as demand warranted.  I&#8217;ve been told that Sean Parker wouldn&#8217;t consider opening up access to a new school until at least several dozen students from that school requested an account.
</p>

<p>
Each school was an island.  Once Facebook saturated a specific set of colleges it moved onto the next round.  Eventually there was enough anticipatory buzz that they could launch at large, state schools without risk of fading away.
</p>

<p>
They still pursue this strategy today.  After establishing a critical density among colleges they opened up access to high schools and then to everyone with an email address.  From there they started moving country-to-country.
</p>

<p>
The countries where Facebook is having the most difficulty gaining traction are the ones with already-established social networks, like Germany with <a href="http://en.wikipedia.org/wiki/StudiVZ">StudiVZ</a>.  In fact, if you look at <a href="http://gawker.com/tech/data-junkie/the-world-map-of-social-networks-273201.php">this old map</a> of the most popular social network in each country, you get an idea of how isolated this country-by-country growth really is.
</p>

<h3>Other Networks</h3>
<p>
Facebook isn&#8217;t the only example of a social network who grew this way.  hi5 has a similar story, starting with smaller markets overseas and spreading from country-to-country.  Or Craigslist, by starting small in San Francisco and eventually becoming a presence in most major US cities.
</p>

<p>
The MySpace team had a background in direct marketing, which is all about targeting specific offers at the people who are most likely to respond.  They started with the club scene in LA and grew from there.
</p>

<p>
The key to all these strategies was density.  
</p>

<p>
If you&#8217;re launching a new social service, even if your end goal is to have everyone and their mother using it, it&#8217;s important to understand the impact density has on the growth
</p>

<h3>Multiple Dimensions of Density</h3>
<p>
So far the only kind of density we&#8217;ve talked about is network density, i.e., multiple people connected through their shared use of a service.  You could call this &#8220;product density.&#8221;
</p>

<p>
Sometimes product density isn&#8217;t enough.  Take IM, for example, or any network that requires synchronous communication.  Not only do two people have to be using the same product but they have to be using it at the same time.  What good is your friend being on IM if you&#8217;re never awake at the same time?
</p>

<p>
<a href="http://en.wikipedia.org/wiki/Xfire">Xfire</a>, an IM client for gamers, is an example of a product that innovated in this space by tackling a segment of customers who were already interacting synchronously.
</p>

<p>
Mobile social networks take this to an even greater extreme.  To connect with people on Loopt or Google Latitude not only do we have to be using the same product at the same time, but we have to be in the same place!
</p>

<p>
This isn&#8217;t to say building these networks is impossible.  Rather, they come with an extra handicap in the form of reduced density.  Overcoming that problem has to be a key part of the product strategy.
</p>

<h3>Conclusion and Counterexamples?</h3>
<p>
Most product strategy discussions, in my experience, are focused on acquisition or other topline metrics that go &#8220;up and to the right.&#8221;  Instead, if you&#8217;re building social software, I believe density is a necessary condition for long-term success and needs to be a part of the strategy discussion from day one.
</p>

<p>
First, understand the density requirements for your product.  Do customers need to sign up for the same service?  Do they need to be using it at the same time?  Do they need to be in the same place?  Is there anything you can do to lower the density requirement?
</p>

<p>
Second, build a &#8220;depth first&#8221; strategy.  Are there any naturally dense customer segments that might fit your product?  Do you have the ability to target specific demographics or segments for acquisition?  Which ones respond positively and is it possible to build density there?
</p>

<p>
Once you&#8217;ve achieved sufficient density on one island hop to the next and repeat.
</p>

<p>
And if anyone out there can think of any counterexamples &mdash; social networks or services that got big &#8220;all at once&#8221; &mdash; leave a comment and let me know!  I honestly can&#8217;t think of any.
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/building-a-social-network-island-by-island/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>What Verna Taught Me</title>
		<link>http://20bits.com/articles/what-verna-taught-me/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=what-verna-taught-me</link>
		<comments>http://20bits.com/articles/what-verna-taught-me/#comments</comments>
		<pubDate>Wed, 06 May 2009 15:30:28 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[customers]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[product developer]]></category>
		<category><![CDATA[strategy]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=627</guid>
		<description><![CDATA[When I was student at the University of Chicago I worked for Residential Computing, or ResCom. ResCom was responsible for maintaining all the computer labs and IT systems in the residential dorms. I had two jobs. The first was to help manage the dorm computer labs. If a new virus broke out and computers were [...]]]></description>
			<content:encoded><![CDATA[<p>
When I was student at the University of Chicago I worked for Residential Computing, or ResCom.  ResCom was responsible for maintaining all the computer labs and IT systems in the residential dorms.
</p>

<p>
I had two jobs.  The first was to help manage the dorm computer labs.  If a new virus broke out and computers were banned from the network because they were spamming everybody on campus, I had to go in and clean it up.  I still remember when the <a href="http://en.wikipedia.org/wiki/Blaster_(computer_worm)">Blaster worm</a> infected most of the Windows machines on campus.
</p>

<p>
The second job was to help build a web-based dorm management system called Chopin.  This system was at the center of the daily operation of the dorms.  Students could use it to submit work requests and report problems with the dorms.  Staff could use it to send out mass mailings, sell students printing credits for the dorm printers, and any number of other routine tasks.
</p>

<h3>Enter Verna</h3>

<p>
Verna was a front-desk clerk at one of the dorms.  She was responsible for helping students if a problem came up and used Chopin to get her job done.
</p>

<p>
She had also lived in Hyde Park, the neighborhood surrounding the University, for most of her life and just wanted to do her job without having to deal with whiny students or crappy software.  Chopin was supposed to help her do that.
</p>

<p>
One day while fixing the the front-desk computer I asked Verna, &#8220;What do you think of Chopin?&#8221;  She immediately supplied a laundry list of complaints.  It didn&#8217;t work well, it was confusing, she never knew where to go or what to do, and so forth.
</p>

<p>
It would have been easy to blame her.  Maybe she just needed better training or maybe she wasn&#8217;t trying hard enough.  But that was all ego: I was just upset because she told me the product I helped build was pretty awful.
</p>

<p>
Her critique was absolutely fair.  I had built solutions I liked for problems I wasn&#8217;t even sure existed.  In fact, before then, I had never honestly talked to any of the people who actually <em>used</em> Chopin about the product itself.  In retrospect that seems completely insane.
</p>

<p>
And watching her use Chopin I saw so much waste.  She would click ten times where two would have sufficed.  Some features would never get used, others used for things they weren&#8217;t intended for.  Once I was there, talking with her and watching how she used Chopin, I saw that the whole process was messed up, from top to bottom.  Most of her energy was spent working around problems I had caused!
</p>

<h3>Go and See</h3>

<p>
If my job hadn&#8217;t required that I work on Chopin and get out of the office I never would have even realized there was a problem.
</p>

<p>
That experience taught me that whenever I didn&#8217;t understand a customer&#8217;s frustration or thought that maybe they were feeling this way or that way I should just go ask them before building solutions that might be worse than the problem.  When in doubt, go and see for yourself. Actually, scratch that: <em>always</em> go see for yourself.
</p>

<p>
Too often I&#8217;d pass off mere belief as knowledge, or generalize from a specific set of circumstances to a fundamentally different set of circumstances.  Maybe this solution worked over there, but why should it work over here?  The only people who can validate your product are your customers &mdash; everyone else, including yourself, can wait their turn.
</p>

<p>
Verna taught me that.
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/what-verna-taught-me/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Notification Strategies for Social Networks</title>
		<link>http://20bits.com/articles/notification-strategies-for-social-networks/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=notification-strategies-for-social-networks</link>
		<comments>http://20bits.com/articles/notification-strategies-for-social-networks/#comments</comments>
		<pubDate>Tue, 05 May 2009 16:00:02 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[notifications]]></category>
		<category><![CDATA[retention]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=599</guid>
		<description><![CDATA[You&#8217;ve built a social application and launched a new feature. The number of notifications you can send out is constrained. Which set of users should you notify to guarantee the most people start using this new feature? This problem might seem artificial. Why not put up an ad on your product, or send a notification [...]]]></description>
			<content:encoded><![CDATA[<p>
You&#8217;ve built a social application and launched a new feature.  The number of notifications you can send out is constrained.  Which set of users should you notify to guarantee the most people start using this new feature?
</p>

<p>
This problem might seem artificial.  Why not put up an ad on your product, or send a notification to every single person who might be interested?  There are several reasons the number of people you can notify might be constrained.
</p>
<ol>
<li>There is a technical constraint, e.g., Facebook limits the number of application-to-user notifications at the API level.</li>
<li>There is a financial constraint, e.g., you&#8217;re sending notifications over SMS and every message costs you money.</li>
<li>There is a strategic constraint, e.g., sending notifications too frequently causes fatigue and reduces the effectiveness of future notifications.</li>
</ol>

<p>
So, the situation is not too far fetched.  Let&#8217;s investigate the issue.
</p>

<p>
For the rest of the article the &#8220;application&#8221; is going to be a Facebook application and it can only send 100 application-to-user notifications per week.  Which 100 users should we notify?
</p>

<h3>The Basic Considerations</h3>
<p>
In the <a href="http://20bits.com/articles/behavior-adoption-on-social-networks/">linear cascade model</a> when a user in a social network adopts a new behavior there is a probability that each neighbor in the network will adopt it.
</p>

<p>
Under this model we probably wouldn&#8217;t want to notify two people who are friends, and especially not a cluster of friends or a <a href="http://en.wikipedia.org/wiki/Clique_(graph_theory)">clique</a>.  The new feature wouldn&#8217;t spread very far beyond this group.
</p>

<p>
Likewise, we wouldn&#8217;t want to notify people who are very far apart on the social network because a user is more likely to adopt a behavior if more than one of his friends has also adopted it.  So there is a balancing act between notifying users who are close together, to achieve density, and notifying users who are far apart, to achieve breadth.
</p>

<h3>Heuristics and Centrality Measures</h3>
<p>
The easiest solution is to pick 100 random users to notify, but this is also the most naive since it takes into account neither the structure of the network nor likelihood that a person will influence their neighbors.
</p>

<p>
A better<sup><a href="http://20bits.com/articles/notification-strategies-for-social-networks/#footnote_0_599" id="identifier_0_599" class="footnote-link footnote-identifier-link" title="&amp;#8220;Better&amp;#8221; according to what?  As we&amp;#8217;ll see, randomly selecting seed users performs worse than all the other heuristics.">1</a></sup> solution to this problem is to develop a heuristic that ranks every user in the network according to some metric.  If we can only send 100 notifications then they are sent to the first 100 people on this ranked list.
</p>

<p>
The idea here is to use <a href="http://en.wikipedia.org/wiki/Centrality">centrality measures</a> to come up with heuristics.  In graph theory &#8220;centrality&#8221; is a measure of how important an individual node is.
</p>

<p>
The simplest measure is called &#8220;degree centrality&#8221; and is equal to the number of neighbors of a node.  On a social network this is the number of friends of a given user.  So, if you wanted to send out 100 notifications using this heuristic we&#8217;d send notifications to the 100 users with the most friends.  This heuristic involves convincing celebrities to use the new feature.
</p>

<p>
There are other, more complex heuristics.  The Wikipedia article linked above has a list of other centrality measures, and I wrote an article about calculating <a href="http://20bits.com/articles/graph-theory-part-iii-facebook/">eigenvalue centrality</a>, which is similar to PageRank.  Each of these admits a heuristic which can tell us which users to notify.
</p>

<p>
Of course, which strategy works best is hard to know beforehand, as it varies with respect to both time and the underlying notification.  A/B testing this is difficult because the effects are intentionally dependent.  If anyone has a good solution to this that doesn&#8217;t involve collecting massive amounts of data about user behavior I&#8217;d be interested in hearing it!
</p>

<p>
It should be noted that each of these heuristics only takes into account the underlying structure of the graph and not the probability of &#8220;infection.&#8221;  By including the latter we can come up with a nearly exact model of the optimal subset of users to notify.
</p>

<h3>A Global Solution</h3>
<p>
<a href="http://www.cs.cmu.edu/~bmeeder/">Brendan Meeder</a> at CMU pointed me to a great paper that discuss this very topic, <a href="http://www.cs.cornell.edu/home/kleinber/kdd03-inf.pdf">Maximizing the Spread of Inﬂuence through a Social Network</a> by Kempe, et al.
</p>

<p>
Rather than take a localized view of the problem by ranking each node individually, we create a statistical model of how the new feature propagates through the network.
</p>

<p>
First, we start with a finite seed set, A.  In our case A is a set of 100 users.  Say we convert each of these 100 users.
</p>

<p>
In our model if a user <em>u</em> is converted then for each neighbor <em>v</em> there is some probability
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=p_%7Bu%2Cv%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='p_{u,v}' title='p_{u,v}' class='latex' />
</div>
<p>
that v will also be converted.
</p>

<p>
After the process has run its course some set of users has adopted the new feature.  Because adoption is probabilistic the size of this final configuration is a random variable.  Using the notation from Kempe, et al., for a given seed set A the size of the final set of adopters is a random variable denoted by
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Cvarphi%5Cleft%28A%5Cright%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle{\varphi\left(A\right)}' title='\displaystyle{\varphi\left(A\right)}' class='latex' />
</div>

<p>
Our goal is to pick the set A which maximizes the expected value of this random variable.
</p>

<p>
Formally, we want to find the subset A such that
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Csigma%5Cleft%28A%5Cright%29%20%3D%20E%5Cleft%5B%5Cvarphi%5Cleft%28A%5Cright%29%5Cright%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle{\sigma\left(A\right) = E\left[\varphi\left(A\right)\right]}' title='\displaystyle{\sigma\left(A\right) = E\left[\varphi\left(A\right)\right]}' class='latex' />
</div>
<p>
is maximized, where
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Csigma%5Cleft%28%5Ccdot%5Cright%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle{\sigma\left(\cdot\right)}' title='\displaystyle{\sigma\left(\cdot\right)}' class='latex' />
</div>
<p>
is called the <em>influence function</em>.
</p>

<h3>The Algorithm and The Results</h3>
<p>
It turns out that calculating the influence function exactly is <a href="http://en.wikipedia.org/wiki/NP-hard">NP-hard</a>, but there is a <a href="http://en.wikipedia.org/wiki/Greedy_algorithm">greedy algorithm</a> which approximates the value under certain (unrestrictive) conditions.
</p>

<p>
If you want more details read the paper linked above or the related <a href="http://www.cs.cornell.edu/home/kleinber/icalp05-inf.pdf">Inﬂuential Nodes in a Diﬀusion Model for Social Networks</a> by Kempe, et al.
</p>

<p>
Using Monte Carlo methods Kempe, et al. simulated the diffusion process using this algorithm versus several of the heuristics I described above.  The results are fairly striking: their algorithm performs at least 18% beter than the best-performing heuristic (degree centrality) and 48% better than if the seed set were randomly selected.  I&#8217;ve embedded a graph of their results below.
</p>
<img src="http://20bits.com/wp-content/uploads/2009/05/kempe-graph.png" alt="kempe-graph" title="kempe-graph" width="429" height="334" class="alignnone math size-full wp-image-613" />
<p>
The &#8220;target set&#8221; is the initial seed set of users to notify, and the &#8220;active set&#8221; is the final set of users who actually adopted the new feature or product.  The more users who adopt the feature the better the strategy.
</p>

<h3>Feasibility</h3>
<p>
Kempe&#8217;s algorithm is more feasible than many of the heuristics discussed above, although the best performing heuristic &mdash; degree centrality &mdash; is also the easiest to calculate.  He also doesn&#8217;t include eigenvalue centrality in his analysis, which I&#8217;d be interested in comparing.
</p>

<p>
The biggest downside to his algorithm is that it requires both full knowledge of the underlying graph and an accounting of all the user-to-user transmission probabilities.  Modeling these probabilities would require a lot of data about users over an extended period of time.
</p>

<p>
Whether the additional 18% is worth the extra computation and data collection depends on a lot on specific circumstances, but personally I&#8217;m going to try to implement it in my projects and see how the performance compares first-hand.
</p><ol class="footnotes"><li id="footnote_0_599" class="footnote">&#8220;Better&#8221; according to what?  As we&#8217;ll see, randomly selecting seed users performs worse than all the other heuristics.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/notification-strategies-for-social-networks/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Why hi5 Might Have an Edge on Facebook</title>
		<link>http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=why-hi5-might-have-an-edge-on-facebook</link>
		<comments>http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#comments</comments>
		<pubDate>Tue, 28 Apr 2009 17:50:26 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[hi5]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[myspace]]></category>
		<category><![CDATA[opinion]]></category>
		<category><![CDATA[social-networking]]></category>
		<category><![CDATA[strategy]]></category>
		<category><![CDATA[virtual goods]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=579</guid>
		<description><![CDATA[Facebook has been trying hard to find a business model. Their Beacon advertising product is probably the most infamous example. So far they&#8217;ve been left empty handed and have been forced to look outside the company for money, first from Microsoft1 and then from foreign investors.2 If Facebook wants to be the internet&#8217;s cable company [...]]]></description>
			<content:encoded><![CDATA[<p>
Facebook has been trying hard to find a business model.  Their <a href="http://en.wikipedia.org/wiki/Beacon_(Facebook)">Beacon</a> advertising product is probably the most infamous example.  So far they&#8217;ve been left empty handed and have been forced to look outside the company for money, first from Microsoft<sup><a href="http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#footnote_0_579" id="identifier_0_579" class="footnote-link footnote-identifier-link" title="Microsoft acquires equity stake in Facebook, expands ad partnership (cnet)">1</a></sup> and then from foreign investors.<sup><a href="http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#footnote_1_579" id="identifier_1_579" class="footnote-link footnote-identifier-link" title="Update On Facebook&amp;#8217;s Dubai Fundraising Trip (Business Insider)">2</a></sup>
</p>

<p>
If Facebook wants to be <a href="http://news.cnet.com/8301-10784_3-9946606-7.html">the internet&#8217;s cable company</a> what are they going to have to do to turn themselves into a <a href="http://www.google.com/finance?q=NASDAQ%3ACMCSA">$40Bn</a> company?
</p>

<h3>Are Virtual Goods the Key?</h3>
<p>
Not all social networks are struggling to find a great business model.  Tencent, a Chinese social networking company, pulled in over $1Bn in revenue last year, primarily through its use of virtual goods.<sup><a href="http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#footnote_2_579" id="identifier_2_579" class="footnote-link footnote-identifier-link" title="The world&rsquo;s most lucrative social network? China&rsquo;s Tencent beats $1 billion revenue mark. (VentureBeat)">3</a></sup>
</p>

<p>
But Facebook doesn&#8217;t need to look overseas to see that virtual goods could work for them.  Most of the top Facebook games use virtual currency to make money, powered by leadgen-based ad networks like <a href="http://offerpal.com">Offerpal</a> and <a href="http://getgambit.com">Gambit</a>.  There are reports that some of these apps are pulling in eight figures per year.<sup><a href="http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#footnote_3_579" id="identifier_3_579" class="footnote-link footnote-identifier-link" title="Developer Analytics: Facebook game Mob Wars making $22,000 a day (VentureBear)">4</a></sup>
</p>

<p>
And of course there&#8217;s Facebook&#8217;s own gifting service, which has recently moved to a virtual currency system, pricing gifts in &#8220;points&#8221; that can be bought with real money.<sup><a href="http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#footnote_4_579" id="identifier_4_579" class="footnote-link footnote-identifier-link" title="Gift Shop Credits Have Arrived (Facebook)">5</a></sup>
</p>

<p>
All of this is to say that it appears that virtual goods are a natural business model for social networks and Facebook has enough data to see that.  Why isn&#8217;t Facebook pursuing this strategy more aggressively?  Why do they seem dead-set on building advertising technologies like Social Ads and Beacon?
</p>

<h3>The US Advertising Crutch</h3>
<p>
In the world of advertising not all countries are equal.  US traffic is generally valued the highest, followed by other English-speaking countries, the <a href="http://en.wikipedia.org/wiki/G20_industrial_nations">G20</a>
, and finally the rest of the world.
</p>

<p>
Until recently Facebook was concentrated in the English-speaking world.  It&#8217;s the second largest social network in the US, after MySpace, and the largest in both Canada and the UK.
</p>

<p>
Unlike other social networks which don&#8217;t have a significant presence in the English-speaking world, Facebook can support itself through advertising.  This is a crutch that prevents Facebook making bold decisions with their business model.  I believe Facebook sees themselves as the next Google, one piece of technology away from <a href="http://www.roughtype.com/archives/2007/11/the_social_graf_1.php">changing the world of advertising</a>.
</p>

<h3>The Demographic Crunch</h3>
<p>
Not all social networks have Facebook&#8217;s demographics, of course. hi5, the world&#8217;s third largest social network after MySpace and Facebook, has an extensive presence throughout Latin America and other countries which advertisers and publishers typically ignore.  The same can be said of the advertising market in China, but recall that Tencent pulled in $1Bn last year through virtual goods.
</p>

<p>
It&#8217;s little wonder, then, that hi5 is aggressively pursuing a virtual goods strategy.<sup><a href="http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#footnote_5_579" id="identifier_5_579" class="footnote-link footnote-identifier-link" title="Hi5&rsquo;s virtual entertainment plans could hit a virtual jackpot (VentureBeat)">6</a></sup>  Their demographics makes this strategy much more appealing.  Facebook has the money and the audience to waste pursuing a pure-advertising strategy for social networks.
</p>

<p>
What once seemed like a demographic disadvantage might turn out to be a demographic advantage for hi5.  Will they beat Facebook to the business model punch?
</p>
<p>
And a year from now will we be reading articles about Facebook&#8217;s virtual goods strategy compares to hi5&#8242;s, as opposed to articles about how Facebook&#8217;s new homepage compares to Twitter?
</p>

<h3> You&#8217;re crazy.  You know that, right?</h3>

<p>
Obviously hi5 has an uphill battle.  Facebook is growing on the order of 500,000 new users <em>per day</em> and shows no signs of slowing.  But the same was said of MySpace and Friendster when Facebook launched.  I think we still have a few more twists in the story of social networking on the web, and this is just one possible twist among many.
</p><ol class="footnotes"><li id="footnote_0_579" class="footnote"><a href="http://news.cnet.com/8301-13577_3-9803872-36.html">Microsoft acquires equity stake in Facebook, expands ad partnership</a> (cnet)</li><li id="footnote_1_579" class="footnote"><a href="http://www.businessinsider.com/2008/11/update-on-facebook-s-dubai-fundraising-trip">Update On Facebook&#8217;s Dubai Fundraising Trip</a> (Business Insider)</li><li id="footnote_2_579" class="footnote"><a href="http://venturebeat.com/2009/03/19/the-worlds-most-lucrative-social-network-chinas-tencent-beats-1-billion-revenue-mark/">The world’s most lucrative social network? China’s Tencent beats $1 billion revenue mark.</a> (VentureBeat)</li><li id="footnote_3_579" class="footnote"><a href="http://venturebeat.com/2008/08/25/developer-analytics-facebook-game-mob-wars-making-22000-a-day/">Developer Analytics: Facebook game Mob Wars making $22,000 a day</a> (VentureBear)</li><li id="footnote_4_579" class="footnote"><a href="http://blog.facebook.com/blog.php?post=36577782130">Gift Shop Credits Have Arrived</a> (Facebook)</li><li id="footnote_5_579" class="footnote"><a href="http://venturebeat.com/2009/01/22/hi5s-virtual-entertainment-plans-could-hit-a-virtual-jackpot/">Hi5’s virtual entertainment plans could hit a virtual jackpot</a> (VentureBeat)</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Behavior Adoption on Social Networks</title>
		<link>http://20bits.com/articles/behavior-adoption-on-social-networks/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=behavior-adoption-on-social-networks</link>
		<comments>http://20bits.com/articles/behavior-adoption-on-social-networks/#comments</comments>
		<pubDate>Fri, 24 Apr 2009 16:20:12 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[graph-theory]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[obama]]></category>
		<category><![CDATA[social-networking]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[viral growth]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=548</guid>
		<description><![CDATA[Why and how do people adopt new behaviors? Why do they start using new products? Did you sign up for Facebook because all of your friends were on it, or because a specific friend recommended it to you? Or do you refuse to sign up at all? In this article I&#8217;m going to outline two [...]]]></description>
			<content:encoded><![CDATA[<p>
Why and how do people adopt new behaviors?  Why do they start using new products?  Did you sign up for Facebook because all of your friends were on it, or because a specific friend recommended it to you?  Or do you refuse to sign up at all?
</p>

<p>
In this article I&#8217;m going to outline two models that describe how new behaviors, ideas, and messages propagate through social networks.
</p>

<h3>The Threshold Model</h3>
<p>
The first model is called the Threshold Model.<sup><a href="http://20bits.com/articles/behavior-adoption-on-social-networks/#footnote_0_548" id="identifier_0_548" class="footnote-link footnote-identifier-link" title="See Threshold Models of Collective Behavior (1978) by the famous sociologist Mark Granovetter. ">1</a></sup>  It says that people adopt a new behavior bceause a sufficiently large proportion of their friends have adopted that behavior.  Early adopters have a very low threshold, say 5% or 10%, while late adopters would have a much higher threshold. Every person, however, has their own individual threshold.
</p>

<p>
For example, my girlfriend&#8217;s stated reason for signing up for Twitter was that &#8220;all my friends were using it.&#8221;  And during the 2008 US Presidential election, some Obama supporters would adopt Hussein as their middle name.<sup><a href="http://20bits.com/articles/behavior-adoption-on-social-networks/#footnote_1_548" id="identifier_1_548" class="footnote-link footnote-identifier-link" title="See Obama Supporters Adopting Middle Name &amp;#8220;Hussein&amp;#8221; As Their Own">2</a></sup>  When I saw that lots of my friends were doing it I was certainly tempted to do the same.
</p>

<p>
The underlying psychological principle is one of &#8220;missing out&#8221; or &#8220;when in Rome.&#8221;  The key variable here is the initial distribution of thresholds across a social network, which describes in totality the final extent of the behavior.
</p>

<p>
It&#8217;s worth noting that this model says nothing about how people <em>initially</em> adopt behavior.  That is, it says nothing about innovators, only about the spread of innovation through a social network.
</p>

<h3>The Cascade Model</h3>
<p>
The second model is called the Cascade or Word-of-Mouth Model<sup><a href="http://20bits.com/articles/behavior-adoption-on-social-networks/#footnote_2_548" id="identifier_2_548" class="footnote-link footnote-identifier-link" title="See Talk of the Network: A Complex Systems Look at the 
Underlying Process of Word-of-Mouth (2001) by Goldenburg, Libari, and Muller.">3</a></sup>, and is the method of &#8220;viral growth&#8221; that most <a href="http://20bits.com/articles/social-applications-are-social-networks/">social application developers</a> are familiar with.  It says that every person has a chance of adopting a new behavior whenever one of their neighbors adopts it.
</p>

<p>
This model describes phenomena like product recommendations or user-to-user notifications on Facebook.  The probability that a person adopts the new behavior is the conversion rate for the notification.<sup><a href="http://20bits.com/articles/behavior-adoption-on-social-networks/#footnote_3_548" id="identifier_3_548" class="footnote-link footnote-identifier-link" title="More accurately, we&amp;#8217;d model the &amp;#8220;probability&amp;#8221; as a random variable whose mean was the conversion rate.">4</a></sup>
</p>

<p>
This probability is both a function of the sender and the recipient, so more influential people are more likely to convince you to adopt a behavior (or purchase a product, or install an application).
</p>

<h3>Practical Implications</h3>
<p>
Both of these models describe facets of real-world interaction on social networks.  My take is that the cascade model is more accurate at the beginning of a social network&#8217;s life, where behavior is spreading through sparse areas, connected by influencers.  Later on, after a critical density has settled in, people start adopting the behavior because everyone else is adopting it and there&#8217;s a social cost to not doing the same.
</p>

<p>
We see this pattern in services like Facebook and MySpace, both of which got their start by harvesting emails and spreading through word-of-mouth (and spam) across a social network.<sup><a href="http://20bits.com/articles/behavior-adoption-on-social-networks/#footnote_4_548" id="identifier_4_548" class="footnote-link footnote-identifier-link" title="See Stealing MySpace: The Battle to Control the Most Popular Website in America for details about the MySpace team&amp;#8217;s background in direct marketing.  The ConnectU vs. Facebook court documents, which you can find via Google, paint a similar story for Facebook&amp;#8217;s early years.">5</a></sup> Eventually each network reached a point where a sufficient number of people were familiar with the product and new users adopted it not because their friends recommended it (the cascade model), but because there was a social expectation that they do (the threshold model).
</p>

<p>
Also, with respect to analytics and viral growth, the threshold model is more difficult to track.  In the cascade model we record who sent what to whom and which messages they responded to.  It&#8217;s clear who gets credit for a user&#8217;s conversion.  In the threshold model you have to track passive exposures, and there&#8217;s no clear causal relationship. 
</p>

<p>
If ten of my friends are doing something and I decide to start doing the same thing, who gets credit?  Most analytics packages will show this behavior as a direct visit, with no connection to other users&#8217; behavior, even though there is a viral process underlying it.
</p>

<p>
In short, the threshold model requires a certain level of behavioral density, while the cascade model doesn&#8217;t.  However, we see both models expressed in how people actually adopt new behaviors in social contexts.
</p>

<h3>Formalisms</h3>
<p>
In the threshold model every person <em>u</em> has a threshold
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=T_u%20%5Cin%20%5B0%2C1%5D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='T_u \in [0,1]' title='T_u \in [0,1]' class='latex' /></div>
<p>
and each of their neighbors <em>v</em> is weighted according to
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=w_%7Bu%2Cv%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='w_{u,v}' title='w_{u,v}' class='latex' /></div>
<p>
If
</p>
<div class="math"<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BT_u%20%3C%20%5Csum_%7Bv%20%5Cin%20%5Ctext%7Badopters%7D%7D%20w_%7Bu%2Cv%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle{T_u &lt; \sum_{v \in \text{adopters}} w_{u,v}}' title='\displaystyle{T_u &lt; \sum_{v \in \text{adopters}} w_{u,v}}' class='latex' /></div>
<p>
then the person <em>u</em> adopts the behavior.
</p>

<p>
The set of thresholds, weights, and initial adopters completely determines the extent of the behavior in the social network.
</p>

<p>
In the cascade model, for every person <em>u</em> and neighbor <em>v</em> there is a random variable
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=X_%7Bu%2Cv%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='X_{u,v}' title='X_{u,v}' class='latex' /></div>
<p>
which describes the likelihood of <em>u</em> adopting the behavior if <em>v</em> has adopted it.
</p>

<h3>Takeaways</h3>
<p>
I&#8217;ll try to boil all this down into a few, practical takeaways.
</p>
<ol>
	<li>The Threshold and Cascade Models describe two mechanisms of behavior adoption in social networks.</li>
	<li>The Threshold Model says that people do something if enough of their friends are doing it.</li>
	<li>The Cascade Model says that people have a chance of doing something if one of their friends is doing it.</li>
	<li>Both models correspond to different real-life adoption patterns.</li>
	<li>The typical &#8220;viral loop&#8221; involves the cascade model, but most successful social networks rely on the mechanics of the threshold model in the long run, i.e., density is important for long-term success.</li>
	<li>The cascade model is a good tool for analyzing acquisition scenarios, but the threshold model is probably more helpful for understanding retention and engagement &mdash; it at least implies that <em>density</em> is a key factor in social network growth, a metric that&#8217;s not often discussed publicly.</li>
</ol> 

<p>
Agree?  Disagree?  Leave a comment, send me an email, or <a href="http://twitter.com/jessefarmer">follow me on Twitter</a>!
</p><ol class="footnotes"><li id="footnote_0_548" class="footnote"><footnote>See <a href="http://rumordynamics.awardspace.com/phfs/Threshold_Models_of_Collective_Behavior.pdf">Threshold Models of Collective Behavior</a> (1978) by the famous sociologist Mark Granovetter. </li><li id="footnote_1_548" class="footnote">See <a href="http://www.huffingtonpost.com/2008/06/28/obama-supporters-adopting_n_109788.html">Obama Supporters Adopting Middle Name &#8220;Hussein&#8221; As Their Own</a></li><li id="footnote_2_548" class="footnote">See <a href="http://pluto.huji.ac.il/~msgolden/home_page/pdf/TalkofNetworks.pdf">Talk of the Network: A Complex Systems Look at the 
Underlying Process of Word-of-Mouth</a> (2001) by Goldenburg, Libari, and Muller.</li><li id="footnote_3_548" class="footnote">More accurately, we&#8217;d model the &#8220;probability&#8221; as a random variable whose mean was the conversion rate.</li><li id="footnote_4_548" class="footnote">See <a href="http://www.amazon.com/Stealing-MySpace-Control-Popular-Website/dp/1400066948">Stealing MySpace: The Battle to Control the Most Popular Website in America</a> for details about the MySpace team&#8217;s background in direct marketing.  The ConnectU vs. Facebook court documents, which you can find via Google, paint a similar story for Facebook&#8217;s early years.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/behavior-adoption-on-social-networks/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Almost Viral: A Hybrid Acquisition Strategy</title>
		<link>http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=almost-viral-a-hybrid-acquisition-strategy</link>
		<comments>http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/#comments</comments>
		<pubDate>Wed, 15 Apr 2009 16:00:54 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[acquisition]]></category>
		<category><![CDATA[arpu]]></category>
		<category><![CDATA[data-driven-development]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[strategy]]></category>
		<category><![CDATA[viral]]></category>
		<category><![CDATA[viral coefficient]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=520</guid>
		<description><![CDATA[Two common acquisition strategies for a new application are a paid acquisition strategy and a viral acquisition strategy. The former involves acquiring users at a cost less than the revenue they generate. The latter involves users inviting their friends to the application. &#8220;Going viral&#8221; has become a sort of holy grail and most people would [...]]]></description>
			<content:encoded><![CDATA[<p>
Two common acquisition strategies for a new application are a paid acquisition strategy and a viral acquisition strategy.  The former involves acquiring users at a cost less than the revenue they generate.  The latter involves users inviting their friends to the application.
</p>

<p>
&#8220;Going viral&#8221; has become a sort of holy grail and most people would say they&#8217;d rather have a viral application than not, but it has a distinct downside: uncontrolled growth and ever-increasing operational costs.  By being almost-but-not-quite viral you can dramatically reduce your cost of acquisition without setting your servers on fire.
</p>

<h3>The Paid Strategy</h3>
<p>
The key variables for a paid strategy are cost of acquisition and average revenue per user, or ARPU.  If your ARPU is greater than your cost of acquisition then you can buy as many users as your budget allows, reinvesting the new revenue into acquiring new users.  Generally this is done by advertising your product through something like AdWords or Facebook&#8217;s Social Ads.
</p>

<p>
The best thing about a paid strategy is its relative simplicity.  If your ARPU is $2.00 then you can run as many ads you want so long as the cost of acquisition is less than $2.00 and still be profitable.
</p>

<p>
Eventually, though, you will run out of ad inventory.  There are only so many publishers who are willing to accept $0.10 per click.  At this point your only options are to decrease your cost of acquisition<sup><a href="http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/#footnote_0_520" id="identifier_0_520" class="footnote-link footnote-identifier-link" title="If you increase the conversion rates for your ads then you can pay more per click to get more impressions without hurting your bottom line &amp;mdash; you pay the same to acquire a user, but get more users through the door.">1</a></sup> or increase your ARPU.
</p>

<h3>The Viral Strategy</h3>
<p>
The viral acquisition strategy requires you get current users to invite other users to the application.  The two key variables for the viral strategy are the average number of invites each new users sends and the rate at which those invites convert into new users.
</p>

<p>
The ratio of converting invites to new users is your viral coefficient, k.  If this is greater than one you will see self-sustaining, viral growth.<sup><a href="http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/#footnote_1_520" id="identifier_1_520" class="footnote-link footnote-identifier-link" title="See my article Three Myths of Viral Growth for more information about viral growth.">2</a></sup>  If the coefficient is less than one each user will bring in a fixed number of new users, but the application&#8217;s growth is still linear.
</p>

<p>
People who have decided on a viral acquisition strategy focus on this number obsessively.  It&#8217;s the first big hurdle to clear and if you haven&#8217;t had experience engineering a viral application it can take months to build something viral.
</p>

<p>
But being viral isn&#8217;t an either-or proposition.  Increasing your viral coefficient from 0.5 to 0.8 has other advantages, especially if you integrate it with a paid strategy.  Let&#8217;s see how.
</p>

<h3>A Hybrid Strategy</h3>
<p>
Say you&#8217;re building a game on Facebook backed by a virtual currency.  Users give you money or fill out offers to get coins, so you have a positive ARPU.  This means you&#8217;re free to pursue a paid acquisition strategy.
</p>

<p>
On the other hand, you&#8217;re on Facebook, which has many viral hooks.  Many of the technical hurdles are much smaller there, so it becomes more a question of design and optimization rather than implementation.  At the very least you encourage players to invite their friends.
</p>

<p>
The first version of your application has a viral coefficient of k=0.5, that is, every new user who joins brings in 0.5 new users.  Equivalently, for every two users you acquire you get one free.
</p>

<p>
That&#8217;s interesting, especially if you&#8217;re also <em>paying</em> for users.  If you&#8217;re paying $1.50 per user then you paid $3.00 to get two users, but acquired a third user for free!  This means that you effectively paid $1.00 per user: $3.00 paid / 3 users. 
</p>

<p>
This process is actually geometric, however.  If you purchased 4 users with a viral coefficient of 0.5, you&#8217;d first get 2 new users for free.  These 2 new users would then bring in 1 additional user, for a total of 3 new users, reducing your cost of acquisition even further.  This is an infinite geometric series, which I&#8217;ll outline below.
</p>

<p>
<strong>Having a non-zero viral coefficient reduces your effective cost of acquisition.</strong>
</p>
<p>
This means that you get more users for every dollar you spend on ads.  Or, if you&#8217;ve run out of inventory, this means you can now spend more per user and retain the same net revenue level.
</p>

<p>
Let&#8217;s figure out how to calculate your <em>effective</em> cost per acquisition.
</p>

<h3>Effective Cost of Acquisition</h3>
<p>
If you just want the formula, here it is:
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=C%27%20%3D%20C%281-k%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='C&#039; = C(1-k)' title='C&#039; = C(1-k)' class='latex' /></div>
<p>
Where C is your cost of acquisition, k is your viral coefficient, and C&#8217; is your effective cost of acquisition.
</p>
<p>
If k=0, i.e., you have no viral acquisition, then C&#8217; = C.
</p>
<p>
If k = 1 and your application is viral then C&#8217; = 0 and your application grows without spending any additional money.  But rather than being an either-or proposition &mdash; you&#8217;re either viral or you&#8217;re not &mdash; there&#8217;s a sliding scale.  The more viral you are the cheaper it is to acquire users.
</p>

<h3>The Benefits of Being Almost Viral</h3>
<p>
From the above formula, if you have a viral coefficient of 0.90 then you have reduced your cost of acquisition by 90%.  This is a great situation to be in.  You might ask, &#8220;Why wouldn&#8217;t I want go the last 0.10 and make my application viral?&#8221;
</p>

<p>
The one benefit of being viral is huge growth, which looks sexy on a graph and can tip an investor to your side if you&#8217;re looking for outside money, but the growth is unpredictable.  Not only do you have little control over what demographics come to dominate your application, but sometimes it grows so quickly that you run into operational problems (servers on fire, etc.).
</p>

<p>
By being almost viral you can grow very cheaply, control your rate of growth and demographics, and get enough traffic to conduct meaningful <a href="http://20bits.com/articles/scientific-product-development/">experiments</a>.  Need to grow more slowly?  Just decrease your daily ad spend.  Need statistically significant results more quickly?  Increase your daily ad spend.
</p>

<p>
Put another way, with a viral coefficient of 0.9 you&#8217;ve dealt with your acquisition risk.  Rather than going fully viral and dealing with the operational difficulties, it might be worth your time to deal with other market risks: retention, engagement, and monetization.
</p>

<p>
So, stop sweating about &#8220;being viral.&#8221;  Sometimes it&#8217;s better to be almost viral.
</p>

<h3>Deriving the Formula for Effective Cost of Acquisition</h3>

<p>
You can skip this and go right to the comments if you&#8217;re not interested in the math.
</p>

<p>
We have an application with a viral coefficient of k = 0.5.  Every new user who joins brings in 0.5 new users.  Another way of thinking of it is that for every new user who joins there is a 50% probability that he will bring in another user.
</p>

<p>
But this potential user also has a 50% chance of bringing in a new user, so the expected number of users is now 1 + 0.5 + 0.5*0.5.  This continues <em>ad infinitum</em>.
</p>

<p>
Formally, if we acquire one user and have a viral coefficient of k then the number of users we expect to join is N(k), given by the formula
</p>

<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BN%28k%29%20%3D%201%20%2B%20k%20%2B%20k%5E2%20%2B%20k%5E3%20%2B%20%5Ccdots%20%3D%20%5Csum_%7Bi%20%3D%200%7D%5E%7B%5Cinfty%7D%20k%5Ei%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle{N(k) = 1 + k + k^2 + k^3 + \cdots = \sum_{i = 0}^{\infty} k^i}' title='\displaystyle{N(k) = 1 + k + k^2 + k^3 + \cdots = \sum_{i = 0}^{\infty} k^i}' class='latex' />
</div>

<p>
The initial 1 is our original user, whom we paid for, and each k represents the expected number of users from each step in the viral process.
</p>

<p>
This is a <a href="http://en.wikipedia.org/wiki/Geometric_series">geometric series</a>, so we know that
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BN%28k%29%20%3D%20%5Cfrac%7B1%7D%7B1-k%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle{N(k) = \frac{1}{1-k}}' title='\displaystyle{N(k) = \frac{1}{1-k}}' class='latex' />
</div>

<p>
Therefore, our effective cost of acquisition is
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BC%27%20%3D%20%5Cfrac%7BC%7D%7BN%28k%29%7D%20%3D%20%5Cfrac%7BC%7D%7B%5Cfrac%7B1%7D%7B1-k%7D%7D%20%3D%20C%281-k%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle{C&#039; = \frac{C}{N(k)} = \frac{C}{\frac{1}{1-k}} = C(1-k)}' title='\displaystyle{C&#039; = \frac{C}{N(k)} = \frac{C}{\frac{1}{1-k}} = C(1-k)}' class='latex' />
</div><ol class="footnotes"><li id="footnote_0_520" class="footnote">If you increase the conversion rates for your ads then you can pay more per click to get more impressions without hurting your bottom line &mdash; you pay the same to acquire a user, but get more users through the door.</li><li id="footnote_1_520" class="footnote">See my article <a href="http://20bits.com/articles/three-myths-of-viral-growth/">Three Myths of Viral Growth</a> for more information about viral growth.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>

