<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>20bits</title>
	<atom:link href="http://20bits.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://20bits.com</link>
	<description>Driven by Data</description>
	<lastBuildDate>Wed, 07 Oct 2009 06:07:48 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Speed vs. Certainty in A/B Testing</title>
		<link>http://20bits.com/articles/speed-vs-certainty-in-ab-testing/</link>
		<comments>http://20bits.com/articles/speed-vs-certainty-in-ab-testing/#comments</comments>
		<pubDate>Mon, 25 May 2009 17:00:18 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[ab testing]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[tactics]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=717</guid>
		<description><![CDATA[
A/B testing is a great tactical tool for studying customer behavior on the web.  But like any randomized trial there&#8217;s some chance that the improvement we measure is just statistical noise.



How worried should we be that the feature we thought improved our product actually does nothing, or worse, hurts our bottom line?  How [...]]]></description>
			<content:encoded><![CDATA[<p>
<a href="http://20bits.com/articles/an-introduction-to-ab-testing/">A/B testing</a> is a great tactical tool for studying customer behavior on the web.  But like any randomized trial there&#8217;s some chance that the improvement we measure is just statistical noise.
</p>

<p>
How worried should we be that the feature we thought improved our product actually does nothing, or worse, hurts our bottom line?  How can we ever really know that we&#8217;re making the correct decision?  And is it better to run tests more quickly or more accurately?
</p>

<p>
The answers to these questions depend on the cost of a bad decision.  If mistakes are cheap then it&#8217;s better to make 1,000 decisions and get only 60% of them right than to make 100 decisions and get 100% of them right.
</p>

<p>
One way to achieve this balance in the context of A/B testing is to tune the confidence level.
</p>

<h3>Tuning the Confidence Level</h3>
<p>
Intuitively, the confidence level of an A/B test tells you how certain you can be of the result of the A/B test.  For example, a confidence level of 95% means that there&#8217;s a 5% chance that a statistically significant result is actually random variation, i.e., there is a 5% chance of a false positive.
</p>

<p>
Of course, we&#8217;re free to choose some other confidence level besides 95%.  We could choose 80%, 90%, or 99.999%.  A higher confidence level requires more data before reaching statistical significance, but we will be more certain of the result.
</p>

<p>
If you&#8217;re not comfortable with the nuts and bolts of statistical analysis, confidence levels, and A/B testing I recommend reading my article about <a href="http://20bits.com/articles/statistical-analysis-and-ab-testing/">statistical analysis and A/B testing</a>, which explains exactly how one &#8220;chooses&#8221; a confidence level.
</p>

<p>
In short, the confidence level acts as a dial between speed and certainty, and we&#8217;re free to choose where to set that dial depending on the priorities of our business or product.
</p>

<h3>Speed vs. Certainty</h3>
<p>
So where on the speed-certainty spectrum should you, as a product manager or startup entrepreneur, sit?
</p>

<p>
Mike Cassidy has a great presentation where he argues that <a href="http://www.slideshare.net/dmc500hats/best-strategy-is-speed-startup2startup-may-2008">speed is the primary business startegy</a> for startups.
</p>

<p>
Why is speed great for startups?  Because mistakes are cheap and calculated risks are rewarded.  Most product decisions can be undone, and important early tests can be redone at a higher confidence level when the product has more traction.
</p>

<p>
But mistakes aren&#8217;t always cheap.  Here are some factors that increase the cost of a mistake.
</p>

<h4>Volume</h4>
<p>
Volume is leverage.  If you have millions of customers, like Google or Amazon, a 1% improvement to the bottom line is a huge win.  Conversely, a 1% mistake is a huge hit. 
</p>

<p>
Fortunately this problem helps mitigate itself.  Increased volume affords you the luxury of running A/B tests at a higher confidence level in the same amount of time.
</p>

<h4>Reversibility</h4>
<p>
Most product decisions in a consumer technology startup can be undone, for a price.  For example, it&#8217;s easy to undo a bad decision for a web-based product, slightly harder to undo a decision for desktop software, and very difficult (and costly) to undo a decision for a physical product.
</p>

<p>
The less reversible a decision is the more certain you should be before you make it.  In the context of A/B testing a product feature this means a higher confidence level, even if it takes longer to run the test.
</p>

<h4>Real Money</h4>
<p>
Imagine you&#8217;re an ad network.  You&#8217;re constantly A/B testing formatting, positioning, offers, etc. to see which performs best.  Making a mistake in this regard costs your publishers money.
</p>

<p>
Like volume, money creates leverage.  But it is more complicated than that: publishers don&#8217;t just want increased revenues, they want reliable cash flow.  That is, when money is involved, not only do you have to perform better but you have to perform more consistently because of phenomena like the <a href="http://en.wikipedia.org/wiki/Peak-end_rule">peak-end rule</a>.
</p>

<p>
In this case a &#8220;three steps forward one step back&#8221; strategy might actually be worse than going step-by-step in the right direction, even if the former averages out to better performance.
</p>

<h3>Conclusions</h3>
<p>
Maintaining momentum in a startup isn&#8217;t about making <em>only</em> correct decisions &mdash; it&#8217;s about making <em>enough</em> correct decisions.  This presents a continuum from speed to certainty.  At one extreme you run the business with a magic eight-ball; at the other you agonize over every detail until you&#8217;re 100% certain that you&#8217;ve made the correct choice.
</p>

<p>
This thought process extends naturally to A/B testing where the idea of &#8220;certainty&#8221; and &#8220;cost&#8221; can be quantified.  To recap:
</p>

<ul>
	<li>A/B testing is a great tactical tool for testing <em>specific</em> hypotheses about your customers.</li>
	<li>However, there is a tradeoff between speed and certainty, controlled by the confidence level of the A/B test.</li>
	<li>The cost of doing A/B tests quickly is that you will make more wrong decisions, but that is ok if mistakes are cheap.</li>
	<li>For example, it&#8217;s better to make 1,000 decisions and get only 60% of them right than to make 100 decisions and get 100% of them right, all else being equal.</li>
</ul>

<h3>A Spreadsheet Model</h3>
<p>
Below is a little spreadsheet model that illustrates all my points above.
</p>

<p>
The two independent variables are the gain from a good decision and the cost of a bad decision.  The spreadsheet assumes a fixed time period, so a higher confidence level means more certainty but <strong>fewer tests</strong>.  The ideal confidence level is highlighted as you change the parameters of the model.
</p>

<p>
You can download <a href="/downloads/confidence-model.xls">the A/B testing confidence model</a> here.
</p>

<p>
For the statistically inclined this model assumes that traffic increases linearly over time, that the sample statistic is normally distributed, and that a one-tailed t-test is the appropriate statistical test.
</p>

<h3>Credits</h3>
<p>
This article was inspired by a conversation with <a href="http://startup-marketing.com/">Sean Ellis</a> and edited by my <a href="http://twitter.com/aleeeex">wonderful girlfriend</a>, who is probably going to yell at me for linking to her Twitter account.
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/speed-vs-certainty-in-ab-testing/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>8 Tips for Crafting Metrics That Matter</title>
		<link>http://20bits.com/articles/8-tips-for-crafting-metrics-that-matter/</link>
		<comments>http://20bits.com/articles/8-tips-for-crafting-metrics-that-matter/#comments</comments>
		<pubDate>Tue, 12 May 2009 17:00:31 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data-driven-development]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[product development]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=661</guid>
		<description><![CDATA[
Metrics are the marketer&#8217;s microscope.  They show him what his customers are actually doing, as opposed to what they say they are doing or intend to do. 
With proper metrics he can make decisions faster and more accurately.



You can decide to measure anything, but what metrics matter and what ones are just for show? [...]]]></description>
			<content:encoded><![CDATA[<p>
Metrics are the marketer&#8217;s microscope.  They show him what his customers are actually doing, as opposed to what they say they are doing or intend to do. 
With proper metrics he can make decisions faster and more accurately.
</p>

<p>
You can decide to measure anything, but what metrics matter and what ones are just for show?  Here are some rules I hope will guide you toward creating meaningful metrics that help, rather than hinder, the decision-making process.
</p>


<h3>Be Actionable</h3>
<p>
If I had to give a one-sentence answer to the question &#8220;What metrics should I implement for my product?&#8221; it would be &#8220;Whatever metrics are actionable.&#8221;  This means the line from a question to a metric and the line from a metric to an action should be as short as possible.
</p>

<p>
Most of the tips below are meant to focus attention on this issue.  What can you do to make sure your metrics are actionable?
</p>

<h3>Be Understandable and Trustworthy</h3>
<p>
Do you understand what your metric measures?  Does everyone in your organization also understand and do they trust it?
</p>

<p>
Trust is the important part.  Everyone has to trust the metric if you&#8217;re going to use it to make decisions, otherwise you&#8217;ll be getting constant pushback.  This will slow the decision-making process and cause a lot of ill-tempered arguments.
</p>

<h3>Measure Results</h3>
<p>
Does your metric measure customer behavior or a correlate of customer behavior?  In the past approximations and correlations were necessary because measuring behavior directly was hard, but on the web there&#8217;s no excuse &mdash; you have access to every single thing a person does on your site, down to where their mouse is hovering and for how long.
</p>

<p>
For example, if you want to know how good Twitter is for your business don&#8217;t measure the number of positive tweets about your company.  Instead, measure how many customers it drives to your product and how much money those customers put in your hands.
</p>

<h3>Understand the Downside</h3>
<p>
What would you do if your metric were 50% off the mark today?  Would you be able to articulate why this is a problem for your business?  What it costs you?  Would you know where to start looking for possible causes?
</p>

<p>
As an example, I&#8217;ve worked with a startup that used &#8220;number of MySpace friends&#8221; as a go-to metric in every marketing meeting.  Is that really material to the business? 
</p>

<p>
What would happen if tomorrow we had half as many MySpace friends? Would we lose $100?  $10,000?
</p>

<p>
This number says nothing about the value of MySpace as a marketing channel, which in the most charitable case is what it is trying to approximate, and the downside is completely ambiguous.  Is the number of MySpace friends tied to anything meaningful?
</p>

<p>
Like the Twitter example above, if I thought MySpace were an important marketing channel for my product I&#8217;d be measuring things like the number of qualified leads from MySpace and the value they generate for the business.
</p>

<h3>Understand the Upside</h3>
<p>
Conversely, ask yourself, &#8220;What value does improving the metric bring to the company?&#8221;  Some metrics are blindingly obvious in this regard, e.g., top-line revenue numbers and some &#8220;efficiency&#8221; metrics like effective CPM and revenue per user. 
</p>

<p>
Some metrics are less obvious.  What about page views?  Can you imagine a scenario where your pageviews double but the net effect was bad for your business?
</p>

<p>
Metrics like these are dangerous because they lull you into a false sense of security.  Everything on your analytics dashboard is going up and to the right, but the fundamentals of the business might still be suffering.
</p>

<h3>Don&#8217;t Be Ambiguous</h3>
<p>
Does your metric measure one thing and one thing only?  Or is it really an aggregation of several variables, each of which can rise or fall independently?
</p>

<p>
A good example of an ambiguous metric is the notion of a &#8220;daily active user,&#8221; or the total number of people who interacted with your product today.  This number is ambiguous because it is really the sum of two metrics: the number of new users and the number of returning users.
</p>

<p>
Ambiguous metrics are bad because they obscure the underlying variables that truly reflect customer behavior and delay the decision-making process as you are forced to determine which of these variables is actually affecting the aggregate metric.  Furthermore, it&#8217;s possible that one variable accounts for all the growth in the metric, e.g., you have millions of new users per day but no returning users.
</p>

<p>
This latter scenario has been the death of many Facebook apps.
</p>

<h3>Segment by Purpose</h3>
<p>
Whenever I&#8217;m building a web product I divide usage into key segments.  Generally these are acquisition, retention, engagement, and monetization.  That is, how do customers find my product?  Are those customers coming back?  Are they doing the things I want or need them to do?  And how much money are those customers making me?
</p>

<p>
See Dave McClure&#8217;s <a href="http://www.slideshare.net/dmc500hats/startup-metrics-for-pirates-long-version">Startup Marketing for Pirates</a> presentation, which focuses on this idea.
</p>

<p>
Doing this lets you tackle each segment independently and allows for a sharper product focus.  Early on you might want to focus on acquisition, or maybe you want to focus on monetization from day one.  Eventually you&#8217;ll starting caring about longer-term metrics like retention and you won&#8217;t be distracted or overwhelmed by unrelated metrics from different segments.
</p>

<p>
You might select one or two from each category to act as top-line variables in a dashboard that you look at every day, e.g., new users (by channel), returning users (by channel), activity, and revenue.
</p>

<h3>Appropriate Granularity</h3>
<p>
Sometimes you need a bird eye&#8217;s view and sometimes you need a tunneling electron microscope.  Know when you need which.
</p>

<p>
As a general rule of thumb I focus on the microscopic when I am designing specific optimizations but focus on the macroscopic when I&#8217;m determining whether the decisions we&#8217;re making are working.  Another way is to finish this sentence, &#8220;I know my product is healthy because&#8230;&#8221;
</p>

<p>
You&#8217;d never finish that sentence with &#8220;because the click-through-rate on my login page is 20%.&#8221;  You&#8217;d say something like &#8220;because 80% of my customers return every week&#8221; or &#8220;because our revenue is growing by 5% month-over-month.&#8221;
</p>

<h3>Conclusions</h3>
<p>
These are just tips, not hard-and-fast rules.  They are meant to focus the discussion of what metrics matter and why because the choice of metrics has a long-lasting impact if your team is committed to building a data-driven culture.
</p>

<p>
If you have any tips on deciding what metrics matter and why, leave them in the comments!  This list wasn&#8217;t meant to be exhaustive and I know (or hope) people will have some strong opinions!
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/8-tips-for-crafting-metrics-that-matter/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Building a Social Network, Island by Island</title>
		<link>http://20bits.com/articles/building-a-social-network-island-by-island/</link>
		<comments>http://20bits.com/articles/building-a-social-network-island-by-island/#comments</comments>
		<pubDate>Mon, 11 May 2009 15:30:36 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[acquisition]]></category>
		<category><![CDATA[density]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[hi5]]></category>
		<category><![CDATA[myspace]]></category>
		<category><![CDATA[product development]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[strategy]]></category>
		<category><![CDATA[xfire]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=649</guid>
		<description><![CDATA[
A necessary condition for building a self-sustaining social network is density.  We understand this intuitively.  After all, a network of one person is hardly a &#8220;network&#8221; at all.



Metcalfe&#8217;s Law, which states that the value of a network grows in proportion to the square of the number of users of that network, express this [...]]]></description>
			<content:encoded><![CDATA[<p>
A necessary condition for building a self-sustaining social network is density.  We understand this intuitively.  After all, a network of one person is hardly a &#8220;network&#8221; at all.
</p>

<p>
Metcalfe&#8217;s Law, which states that the value of a network grows in proportion to the square of the number of users of that network, express this idea formally.  The value of a social network rests in its ability to foster communication, in its connections.
</p>

<p>	
If you&#8217;re building a social network, whether it&#8217;s a destination website or an application that exists on another social network, density must figure into key strategic decisions.  Let&#8217;s see how.
</p>

<h3>Islands</h3>
<p>
One way to think about social networks is as a network of networks.  On Facebook, for example, if I graph the connections between all my friends I see distinct groups: my high school friends, my college friends, my current circle of friends, and my professional network.
</p>

<p>
Each group of people is more or less isolated from each other.  Density exists within each of these islands, but not between them.  I&#8217;m fairly certain this is a topological property of any social network.
</p>

<h3>Case Study: Facebook</h3>
<p>
Facebook started at Harvard and was initially college-only.  Their growth strategy was explicit from day one: move from school to school as demand warranted.  I&#8217;ve been told that Sean Parker wouldn&#8217;t consider opening up access to a new school until at least several dozen students from that school requested an account.
</p>

<p>
Each school was an island.  Once Facebook saturated a specific set of colleges it moved onto the next round.  Eventually there was enough anticipatory buzz that they could launch at large, state schools without risk of fading away.
</p>

<p>
They still pursue this strategy today.  After establishing a critical density among colleges they opened up access to high schools and then to everyone with an email address.  From there they started moving country-to-country.
</p>

<p>
The countries where Facebook is having the most difficulty gaining traction are the ones with already-established social networks, like Germany with <a href="http://en.wikipedia.org/wiki/StudiVZ">StudiVZ</a>.  In fact, if you look at <a href="http://gawker.com/tech/data-junkie/the-world-map-of-social-networks-273201.php">this old map</a> of the most popular social network in each country, you get an idea of how isolated this country-by-country growth really is.
</p>

<h3>Other Networks</h3>
<p>
Facebook isn&#8217;t the only example of a social network who grew this way.  hi5 has a similar story, starting with smaller markets overseas and spreading from country-to-country.  Or Craigslist, by starting small in San Francisco and eventually becoming a presence in most major US cities.
</p>

<p>
The MySpace team had a background in direct marketing, which is all about targeting specific offers at the people who are most likely to respond.  They started with the club scene in LA and grew from there.
</p>

<p>
The key to all these strategies was density.  
</p>

<p>
If you&#8217;re launching a new social service, even if your end goal is to have everyone and their mother using it, it&#8217;s important to understand the impact density has on the growth
</p>

<h3>Multiple Dimensions of Density</h3>
<p>
So far the only kind of density we&#8217;ve talked about is network density, i.e., multiple people connected through their shared use of a service.  You could call this &#8220;product density.&#8221;
</p>

<p>
Sometimes product density isn&#8217;t enough.  Take IM, for example, or any network that requires synchronous communication.  Not only do two people have to be using the same product but they have to be using it at the same time.  What good is your friend being on IM if you&#8217;re never awake at the same time?
</p>

<p>
<a href="http://en.wikipedia.org/wiki/Xfire">Xfire</a>, an IM client for gamers, is an example of a product that innovated in this space by tackling a segment of customers who were already interacting synchronously.
</p>

<p>
Mobile social networks take this to an even greater extreme.  To connect with people on Loopt or Google Latitude not only do we have to be using the same product at the same time, but we have to be in the same place!
</p>

<p>
This isn&#8217;t to say building these networks is impossible.  Rather, they come with an extra handicap in the form of reduced density.  Overcoming that problem has to be a key part of the product strategy.
</p>

<h3>Conclusion and Counterexamples?</h3>
<p>
Most product strategy discussions, in my experience, are focused on acquisition or other topline metrics that go &#8220;up and to the right.&#8221;  Instead, if you&#8217;re building social software, I believe density is a necessary condition for long-term success and needs to be a part of the strategy discussion from day one.
</p>

<p>
First, understand the density requirements for your product.  Do customers need to sign up for the same service?  Do they need to be using it at the same time?  Do they need to be in the same place?  Is there anything you can do to lower the density requirement?
</p>

<p>
Second, build a &#8220;depth first&#8221; strategy.  Are there any naturally dense customer segments that might fit your product?  Do you have the ability to target specific demographics or segments for acquisition?  Which ones respond positively and is it possible to build density there?
</p>

<p>
Once you&#8217;ve achieved sufficient density on one island hop to the next and repeat.
</p>

<p>
And if anyone out there can think of any counterexamples &mdash; social networks or services that got big &#8220;all at once&#8221; &mdash; leave a comment and let me know!  I honestly can&#8217;t think of any.
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/building-a-social-network-island-by-island/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>What Verna Taught Me</title>
		<link>http://20bits.com/articles/what-verna-taught-me/</link>
		<comments>http://20bits.com/articles/what-verna-taught-me/#comments</comments>
		<pubDate>Wed, 06 May 2009 15:30:28 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[customers]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[product developer]]></category>
		<category><![CDATA[strategy]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=627</guid>
		<description><![CDATA[
When I was student at the University of Chicago I worked for Residential Computing, or ResCom.  ResCom was responsible for maintaining all the computer labs and IT systems in the residential dorms.



I had two jobs.  The first was to help manage the dorm computer labs.  If a new virus broke out and [...]]]></description>
			<content:encoded><![CDATA[<p>
When I was student at the University of Chicago I worked for Residential Computing, or ResCom.  ResCom was responsible for maintaining all the computer labs and IT systems in the residential dorms.
</p>

<p>
I had two jobs.  The first was to help manage the dorm computer labs.  If a new virus broke out and computers were banned from the network because they were spamming everybody on campus, I had to go in and clean it up.  I still remember when the <a href="http://en.wikipedia.org/wiki/Blaster_(computer_worm)">Blaster worm</a> infected most of the Windows machines on campus.
</p>

<p>
The second job was to help build a web-based dorm management system called Chopin.  This system was at the center of the daily operation of the dorms.  Students could use it to submit work requests and report problems with the dorms.  Staff could use it to send out mass mailings, sell students printing credits for the dorm printers, and any number of other routine tasks.
</p>

<h3>Enter Verna</h3>

<p>
Verna was a front-desk clerk at one of the dorms.  She was responsible for helping students if a problem came up and used Chopin to get her job done.
</p>

<p>
She had also lived in Hyde Park, the neighborhood surrounding the University, for most of her life and just wanted to do her job without having to deal with whiny students or crappy software.  Chopin was supposed to help her do that.
</p>

<p>
One day while fixing the the front-desk computer I asked Verna, &#8220;What do you think of Chopin?&#8221;  She immediately supplied a laundry list of complaints.  It didn&#8217;t work well, it was confusing, she never knew where to go or what to do, and so forth.
</p>

<p>
It would have been easy to blame her.  Maybe she just needed better training or maybe she wasn&#8217;t trying hard enough.  But that was all ego: I was just upset because she told me the product I helped build was pretty awful.
</p>

<p>
Her critique was absolutely fair.  I had built solutions I liked for problems I wasn&#8217;t even sure existed.  In fact, before then, I had never honestly talked to any of the people who actually <em>used</em> Chopin about the product itself.  In retrospect that seems completely insane.
</p>

<p>
And watching her use Chopin I saw so much waste.  She would click ten times where two would have sufficed.  Some features would never get used, others used for things they weren&#8217;t intended for.  Once I was there, talking with her and watching how she used Chopin, I saw that the whole process was messed up, from top to bottom.  Most of her energy was spent working around problems I had caused!
</p>

<h3>Go and See</h3>

<p>
If my job hadn&#8217;t required that I work on Chopin and get out of the office I never would have even realized there was a problem.
</p>

<p>
That experience taught me that whenever I didn&#8217;t understand a customer&#8217;s frustration or thought that maybe they were feeling this way or that way I should just go ask them before building solutions that might be worse than the problem.  When in doubt, go and see for yourself. Actually, scratch that: <em>always</em> go see for yourself.
</p>

<p>
Too often I&#8217;d pass off mere belief as knowledge, or generalize from a specific set of circumstances to a fundamentally different set of circumstances.  Maybe this solution worked over there, but why should it work over here?  The only people who can validate your product are your customers &mdash; everyone else, including yourself, can wait their turn.
</p>

<p>
Verna taught me that.
</p>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/what-verna-taught-me/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Notification Strategies for Social Networks</title>
		<link>http://20bits.com/articles/notification-strategies-for-social-networks/</link>
		<comments>http://20bits.com/articles/notification-strategies-for-social-networks/#comments</comments>
		<pubDate>Tue, 05 May 2009 16:00:02 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[notifications]]></category>
		<category><![CDATA[retention]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=599</guid>
		<description><![CDATA[
You&#8217;ve built a social application and launched a new feature.  The number of notifications you can send out is constrained.  Which set of users should you notify to guarantee the most people start using this new feature?



This problem might seem artificial.  Why not put up an ad on your product, or send [...]]]></description>
			<content:encoded><![CDATA[<p>
You&#8217;ve built a social application and launched a new feature.  The number of notifications you can send out is constrained.  Which set of users should you notify to guarantee the most people start using this new feature?
</p>

<p>
This problem might seem artificial.  Why not put up an ad on your product, or send a notification to every single person who might be interested?  There are several reasons the number of people you can notify might be constrained.
</p>
<ol>
<li>There is a technical constraint, e.g., Facebook limits the number of application-to-user notifications at the API level.</li>
<li>There is a financial constraint, e.g., you&#8217;re sending notifications over SMS and every message costs you money.</li>
<li>There is a strategic constraint, e.g., sending notifications too frequently causes fatigue and reduces the effectiveness of future notifications.</li>
</ol>

<p>
So, the situation is not too far fetched.  Let&#8217;s investigate the issue.
</p>

<p>
For the rest of the article the &#8220;application&#8221; is going to be a Facebook application and it can only send 100 application-to-user notifications per week.  Which 100 users should we notify?
</p>

<h3>The Basic Considerations</h3>
<p>
In the <a href="http://20bits.com/articles/behavior-adoption-on-social-networks/">linear cascade model</a> when a user in a social network adopts a new behavior there is a probability that each neighbor in the network will adopt it.
</p>

<p>
Under this model we probably wouldn&#8217;t want to notify two people who are friends, and especially not a cluster of friends or a <a href="http://en.wikipedia.org/wiki/Clique_(graph_theory)">clique</a>.  The new feature wouldn&#8217;t spread very far beyond this group.
</p>

<p>
Likewise, we wouldn&#8217;t want to notify people who are very far apart on the social network because a user is more likely to adopt a behavior if more than one of his friends has also adopted it.  So there is a balancing act between notifying users who are close together, to achieve density, and notifying users who are far apart, to achieve breadth.
</p>

<h3>Heuristics and Centrality Measures</h3>
<p>
The easiest solution is to pick 100 random users to notify, but this is also the most naive since it takes into account neither the structure of the network nor likelihood that a person will influence their neighbors.
</p>

<p>
A better<sup>1</sup> solution to this problem is to develop a heuristic that ranks every user in the network according to some metric.  If we can only send 100 notifications then they are sent to the first 100 people on this ranked list.
</p>

<p>
The idea here is to use <a href="http://en.wikipedia.org/wiki/Centrality">centrality measures</a> to come up with heuristics.  In graph theory &#8220;centrality&#8221; is a measure of how important an individual node is.
</p>

<p>
The simplest measure is called &#8220;degree centrality&#8221; and is equal to the number of neighbors of a node.  On a social network this is the number of friends of a given user.  So, if you wanted to send out 100 notifications using this heuristic we&#8217;d send notifications to the 100 users with the most friends.  This heuristic involves convincing celebrities to use the new feature.
</p>

<p>
There are other, more complex heuristics.  The Wikipedia article linked above has a list of other centrality measures, and I wrote an article about calculating <a href="http://20bits.com/articles/graph-theory-part-iii-facebook/">eigenvalue centrality</a>, which is similar to PageRank.  Each of these admits a heuristic which can tell us which users to notify.
</p>

<p>
Of course, which strategy works best is hard to know beforehand, as it varies with respect to both time and the underlying notification.  A/B testing this is difficult because the effects are intentionally dependent.  If anyone has a good solution to this that doesn&#8217;t involve collecting massive amounts of data about user behavior I&#8217;d be interested in hearing it!
</p>

<p>
It should be noted that each of these heuristics only takes into account the underlying structure of the graph and not the probability of &#8220;infection.&#8221;  By including the latter we can come up with a nearly exact model of the optimal subset of users to notify.
</p>

<h3>A Global Solution</h3>
<p>
<a href="http://www.cs.cmu.edu/~bmeeder/">Brendan Meeder</a> at CMU pointed me to a great paper that discuss this very topic, <a href="http://www.cs.cornell.edu/home/kleinber/kdd03-inf.pdf">Maximizing the Spread of Inﬂuence through a Social Network</a> by Kempe, et al.
</p>

<p>
Rather than take a localized view of the problem by ranking each node individually, we create a statistical model of how the new feature propagates through the network.
</p>

<p>
First, we start with a finite seed set, A.  In our case A is a set of 100 users.  Say we convert each of these 100 users.
</p>

<p>
In our model if a user <em>u</em> is converted then for each neighbor <em>v</em> there is some probability
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=p_%7Bu%2Cv%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='p_{u,v}' title='p_{u,v}' class='latex' />
</div>
<p>
that v will also be converted.
</p>

<p>
After the process has run its course some set of users has adopted the new feature.  Because adoption is probabilistic the size of this final configuration is a random variable.  Using the notation from Kempe, et al., for a given seed set A the size of the final set of adopters is a random variable denoted by
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Cvarphi%5Cleft%28A%5Cright%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='\displaystyle{\varphi\left(A\right)}' title='\displaystyle{\varphi\left(A\right)}' class='latex' />
</div>

<p>
Our goal is to pick the set A which maximizes the expected value of this random variable.
</p>

<p>
Formally, we want to find the subset A such that
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Csigma%5Cleft%28A%5Cright%29%20%3D%20E%5Cleft%5B%5Cvarphi%5Cleft%28A%5Cright%29%5Cright%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='\displaystyle{\sigma\left(A\right) = E\left[\varphi\left(A\right)\right]}' title='\displaystyle{\sigma\left(A\right) = E\left[\varphi\left(A\right)\right]}' class='latex' />
</div>
<p>
is maximized, where
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7B%5Csigma%5Cleft%28%5Ccdot%5Cright%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='\displaystyle{\sigma\left(\cdot\right)}' title='\displaystyle{\sigma\left(\cdot\right)}' class='latex' />
</div>
<p>
is called the <em>influence function</em>.
</p>

<h3>The Algorithm and The Results</h3>
<p>
It turns out that calculating the influence function exactly is <a href="http://en.wikipedia.org/wiki/NP-hard">NP-hard</a>, but there is a <a href="http://en.wikipedia.org/wiki/Greedy_algorithm">greedy algorithm</a> which approximates the value under certain (unrestrictive) conditions.
</p>

<p>
If you want more details read the paper linked above or the related <a href="http://www.cs.cornell.edu/home/kleinber/icalp05-inf.pdf">Inﬂuential Nodes in a Diﬀusion Model for Social Networks</a> by Kempe, et al.
</p>

<p>
Using Monte Carlo methods Kempe, et al. simulated the diffusion process using this algorithm versus several of the heuristics I described above.  The results are fairly striking: their algorithm performs at least 18% beter than the best-performing heuristic (degree centrality) and 48% better than if the seed set were randomly selected.  I&#8217;ve embedded a graph of their results below.
</p>
<img src="http://20bits.com/wp-content/uploads/2009/05/kempe-graph.png" alt="kempe-graph" title="kempe-graph" width="429" height="334" class="alignnone math size-full wp-image-613" />
<p>
The &#8220;target set&#8221; is the initial seed set of users to notify, and the &#8220;active set&#8221; is the final set of users who actually adopted the new feature or product.  The more users who adopt the feature the better the strategy.
</p>

<h3>Feasibility</h3>
<p>
Kempe&#8217;s algorithm is more feasible than many of the heuristics discussed above, although the best performing heuristic &mdash; degree centrality &mdash; is also the easiest to calculate.  He also doesn&#8217;t include eigenvalue centrality in his analysis, which I&#8217;d be interested in comparing.
</p>

<p>
The biggest downside to his algorithm is that it requires both full knowledge of the underlying graph and an accounting of all the user-to-user transmission probabilities.  Modeling these probabilities would require a lot of data about users over an extended period of time.
</p>

<p>
Whether the additional 18% is worth the extra computation and data collection depends on a lot on specific circumstances, but personally I&#8217;m going to try to implement it in my projects and see how the performance compares first-hand.
</p><ol class="footnotes"><li id="footnote_0_599" class="footnote">&#8220;Better&#8221; according to what?  As we&#8217;ll see, randomly selecting seed users performs worse than all the other heuristics.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/notification-strategies-for-social-networks/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Why hi5 Might Have an Edge on Facebook</title>
		<link>http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/</link>
		<comments>http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/#comments</comments>
		<pubDate>Tue, 28 Apr 2009 17:50:26 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[hi5]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[myspace]]></category>
		<category><![CDATA[opinion]]></category>
		<category><![CDATA[social-networking]]></category>
		<category><![CDATA[strategy]]></category>
		<category><![CDATA[virtual goods]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=579</guid>
		<description><![CDATA[
Facebook has been trying hard to find a business model.  Their Beacon advertising product is probably the most infamous example.  So far they&#8217;ve been left empty handed and have been forced to look outside the company for money, first from Microsoft1 and then from foreign investors.2



If Facebook wants to be the internet&#8217;s cable [...]]]></description>
			<content:encoded><![CDATA[<p>
Facebook has been trying hard to find a business model.  Their <a href="http://en.wikipedia.org/wiki/Beacon_(Facebook)">Beacon</a> advertising product is probably the most infamous example.  So far they&#8217;ve been left empty handed and have been forced to look outside the company for money, first from Microsoft<sup>1</sup> and then from foreign investors.<sup>2</sup>
</p>

<p>
If Facebook wants to be <a href="http://news.cnet.com/8301-10784_3-9946606-7.html">the internet&#8217;s cable company</a> what are they going to have to do to turn themselves into a <a href="http://www.google.com/finance?q=NASDAQ%3ACMCSA">$40Bn</a> company?
</p>

<h3>Are Virtual Goods the Key?</h3>
<p>
Not all social networks are struggling to find a great business model.  Tencent, a Chinese social networking company, pulled in over $1Bn in revenue last year, primarily through its use of virtual goods.<sup>3</sup>
</p>

<p>
But Facebook doesn&#8217;t need to look overseas to see that virtual goods could work for them.  Most of the top Facebook games use virtual currency to make money, powered by leadgen-based ad networks like <a href="http://offerpal.com">Offerpal</a> and <a href="http://getgambit.com">Gambit</a>.  There are reports that some of these apps are pulling in eight figures per year.<sup>4</sup>
</p>

<p>
And of course there&#8217;s Facebook&#8217;s own gifting service, which has recently moved to a virtual currency system, pricing gifts in &#8220;points&#8221; that can be bought with real money.<sup>5</sup>
</p>

<p>
All of this is to say that it appears that virtual goods are a natural business model for social networks and Facebook has enough data to see that.  Why isn&#8217;t Facebook pursuing this strategy more aggressively?  Why do they seem dead-set on building advertising technologies like Social Ads and Beacon?
</p>

<h3>The US Advertising Crutch</h3>
<p>
In the world of advertising not all countries are equal.  US traffic is generally valued the highest, followed by other English-speaking countries, the <a href="http://en.wikipedia.org/wiki/G20_industrial_nations">G20</a>
, and finally the rest of the world.
</p>

<p>
Until recently Facebook was concentrated in the English-speaking world.  It&#8217;s the second largest social network in the US, after MySpace, and the largest in both Canada and the UK.
</p>

<p>
Unlike other social networks which don&#8217;t have a significant presence in the English-speaking world, Facebook can support itself through advertising.  This is a crutch that prevents Facebook making bold decisions with their business model.  I believe Facebook sees themselves as the next Google, one piece of technology away from <a href="http://www.roughtype.com/archives/2007/11/the_social_graf_1.php">changing the world of advertising</a>.
</p>

<h3>The Demographic Crunch</h3>
<p>
Not all social networks have Facebook&#8217;s demographics, of course. hi5, the world&#8217;s third largest social network after MySpace and Facebook, has an extensive presence throughout Latin America and other countries which advertisers and publishers typically ignore.  The same can be said of the advertising market in China, but recall that Tencent pulled in $1Bn last year through virtual goods.
</p>

<p>
It&#8217;s little wonder, then, that hi5 is aggressively pursuing a virtual goods strategy.<sup>6</sup>  Their demographics makes this strategy much more appealing.  Facebook has the money and the audience to waste pursuing a pure-advertising strategy for social networks.
</p>

<p>
What once seemed like a demographic disadvantage might turn out to be a demographic advantage for hi5.  Will they beat Facebook to the business model punch?
</p>
<p>
And a year from now will we be reading articles about Facebook&#8217;s virtual goods strategy compares to hi5&#8217;s, as opposed to articles about how Facebook&#8217;s new homepage compares to Twitter?
</p>

<h3> You&#8217;re crazy.  You know that, right?</h3>

<p>
Obviously hi5 has an uphill battle.  Facebook is growing on the order of 500,000 new users <em>per day</em> and shows no signs of slowing.  But the same was said of MySpace and Friendster when Facebook launched.  I think we still have a few more twists in the story of social networking on the web, and this is just one possible twist among many.
</p><ol class="footnotes"><li id="footnote_0_579" class="footnote"><a href="http://news.cnet.com/8301-13577_3-9803872-36.html">Microsoft acquires equity stake in Facebook, expands ad partnership</a> (cnet)</li><li id="footnote_1_579" class="footnote"><a href="http://www.businessinsider.com/2008/11/update-on-facebook-s-dubai-fundraising-trip">Update On Facebook&#8217;s Dubai Fundraising Trip</a> (Business Insider)</li><li id="footnote_2_579" class="footnote"><a href="http://venturebeat.com/2009/03/19/the-worlds-most-lucrative-social-network-chinas-tencent-beats-1-billion-revenue-mark/">The world’s most lucrative social network? China’s Tencent beats $1 billion revenue mark.</a> (VentureBeat)</li><li id="footnote_3_579" class="footnote"><a href="http://venturebeat.com/2008/08/25/developer-analytics-facebook-game-mob-wars-making-22000-a-day/">Developer Analytics: Facebook game Mob Wars making $22,000 a day</a> (VentureBear)</li><li id="footnote_4_579" class="footnote"><a href="http://blog.facebook.com/blog.php?post=36577782130">Gift Shop Credits Have Arrived</a> (Facebook)</li><li id="footnote_5_579" class="footnote"><a href="http://venturebeat.com/2009/01/22/hi5s-virtual-entertainment-plans-could-hit-a-virtual-jackpot/">Hi5’s virtual entertainment plans could hit a virtual jackpot</a> (VentureBeat)</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/why-hi5-might-have-an-edge-on-facebook/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Behavior Adoption on Social Networks</title>
		<link>http://20bits.com/articles/behavior-adoption-on-social-networks/</link>
		<comments>http://20bits.com/articles/behavior-adoption-on-social-networks/#comments</comments>
		<pubDate>Fri, 24 Apr 2009 16:20:12 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[graph-theory]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[obama]]></category>
		<category><![CDATA[social-networking]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[viral growth]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=548</guid>
		<description><![CDATA[
Why and how do people adopt new behaviors?  Why do they start using new products?  Did you sign up for Facebook because all of your friends were on it, or because a specific friend recommended it to you?  Or do you refuse to sign up at all?



In this article I&#8217;m going to [...]]]></description>
			<content:encoded><![CDATA[<p>
Why and how do people adopt new behaviors?  Why do they start using new products?  Did you sign up for Facebook because all of your friends were on it, or because a specific friend recommended it to you?  Or do you refuse to sign up at all?
</p>

<p>
In this article I&#8217;m going to outline two models that describe how new behaviors, ideas, and messages propagate through social networks.
</p>

<h3>The Threshold Model</h3>
<p>
The first model is called the Threshold Model.<sup>1</sup>  It says that people adopt a new behavior bceause a sufficiently large proportion of their friends have adopted that behavior.  Early adopters have a very low threshold, say 5% or 10%, while late adopters would have a much higher threshold. Every person, however, has their own individual threshold.
</p>

<p>
For example, my girlfriend&#8217;s stated reason for signing up for Twitter was that &#8220;all my friends were using it.&#8221;  And during the 2008 US Presidential election, some Obama supporters would adopt Hussein as their middle name.<sup>2</sup>  When I saw that lots of my friends were doing it I was certainly tempted to do the same.
</p>

<p>
The underlying psychological principle is one of &#8220;missing out&#8221; or &#8220;when in Rome.&#8221;  The key variable here is the initial distribution of thresholds across a social network, which describes in totality the final extent of the behavior.
</p>

<p>
It&#8217;s worth noting that this model says nothing about how people <em>initially</em> adopt behavior.  That is, it says nothing about innovators, only about the spread of innovation through a social network.
</p>

<h3>The Cascade Model</h3>
<p>
The second model is called the Cascade or Word-of-Mouth Model<sup>3</sup>, and is the method of &#8220;viral growth&#8221; that most <a href="http://20bits.com/articles/social-applications-are-social-networks/">social application developers</a> are familiar with.  It says that every person has a chance of adopting a new behavior whenever one of their neighbors adopts it.
</p>

<p>
This model describes phenomena like product recommendations or user-to-user notifications on Facebook.  The probability that a person adopts the new behavior is the conversion rate for the notification.<sup>4</sup>
</p>

<p>
This probability is both a function of the sender and the recipient, so more influential people are more likely to convince you to adopt a behavior (or purchase a product, or install an application).
</p>

<h3>Practical Implications</h3>
<p>
Both of these models describe facets of real-world interaction on social networks.  My take is that the cascade model is more accurate at the beginning of a social network&#8217;s life, where behavior is spreading through sparse areas, connected by influencers.  Later on, after a critical density has settled in, people start adopting the behavior because everyone else is adopting it and there&#8217;s a social cost to not doing the same.
</p>

<p>
We see this pattern in services like Facebook and MySpace, both of which got their start by harvesting emails and spreading through word-of-mouth (and spam) across a social network.<sup>5</sup> Eventually each network reached a point where a sufficient number of people were familiar with the product and new users adopted it not because their friends recommended it (the cascade model), but because there was a social expectation that they do (the threshold model).
</p>

<p>
Also, with respect to analytics and viral growth, the threshold model is more difficult to track.  In the cascade model we record who sent what to whom and which messages they responded to.  It&#8217;s clear who gets credit for a user&#8217;s conversion.  In the threshold model you have to track passive exposures, and there&#8217;s no clear causal relationship. 
</p>

<p>
If ten of my friends are doing something and I decide to start doing the same thing, who gets credit?  Most analytics packages will show this behavior as a direct visit, with no connection to other users&#8217; behavior, even though there is a viral process underlying it.
</p>

<p>
In short, the threshold model requires a certain level of behavioral density, while the cascade model doesn&#8217;t.  However, we see both models expressed in how people actually adopt new behaviors in social contexts.
</p>

<h3>Formalisms</h3>
<p>
In the threshold model every person <em>u</em> has a threshold
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=T_u%20%5Cin%20%5B0%2C1%5D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='T_u \in [0,1]' title='T_u \in [0,1]' class='latex' /></div>
<p>
and each of their neighbors <em>v</em> is weighted according to
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=w_%7Bu%2Cv%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='w_{u,v}' title='w_{u,v}' class='latex' /></div>
<p>
If
</p>
<div class="math"<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BT_u%20%3C%20%5Csum_%7Bv%20%5Cin%20%5Ctext%7Badopters%7D%7D%20w_%7Bu%2Cv%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='\displaystyle{T_u &lt; \sum_{v \in \text{adopters}} w_{u,v}}' title='\displaystyle{T_u &lt; \sum_{v \in \text{adopters}} w_{u,v}}' class='latex' /></div>
<p>
then the person <em>u</em> adopts the behavior.
</p>

<p>
The set of thresholds, weights, and initial adopters completely determines the extent of the behavior in the social network.
</p>

<p>
In the cascade model, for every person <em>u</em> and neighbor <em>v</em> there is a random variable
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=X_%7Bu%2Cv%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='X_{u,v}' title='X_{u,v}' class='latex' /></div>
<p>
which describes the likelihood of <em>u</em> adopting the behavior if <em>v</em> has adopted it.
</p>

<h3>Takeaways</h3>
<p>
I&#8217;ll try to boil all this down into a few, practical takeaways.
</p>
<ol>
	<li>The Threshold and Cascade Models describe two mechanisms of behavior adoption in social networks.</li>
	<li>The Threshold Model says that people do something if enough of their friends are doing it.</li>
	<li>The Cascade Model says that people have a chance of doing something if one of their friends is doing it.</li>
	<li>Both models correspond to different real-life adoption patterns.</li>
	<li>The typical &#8220;viral loop&#8221; involves the cascade model, but most successful social networks rely on the mechanics of the threshold model in the long run, i.e., density is important for long-term success.</li>
	<li>The cascade model is a good tool for analyzing acquisition scenarios, but the threshold model is probably more helpful for understanding retention and engagement &mdash; it at least implies that <em>density</em> is a key factor in social network growth, a metric that&#8217;s not often discussed publicly.</li>
</ol> 

<p>
Agree?  Disagree?  Leave a comment, send me an email, or <a href="http://twitter.com/jessefarmer">follow me on Twitter</a>!
</p><ol class="footnotes"><li id="footnote_0_548" class="footnote"><footnote>See <a href="http://rumordynamics.awardspace.com/phfs/Threshold_Models_of_Collective_Behavior.pdf">Threshold Models of Collective Behavior</a> (1978) by the famous sociologist Mark Granovetter. </li><li id="footnote_1_548" class="footnote">See <a href="http://www.huffingtonpost.com/2008/06/28/obama-supporters-adopting_n_109788.html">Obama Supporters Adopting Middle Name &#8220;Hussein&#8221; As Their Own</a></li><li id="footnote_2_548" class="footnote">See <a href="http://pluto.huji.ac.il/~msgolden/home_page/pdf/TalkofNetworks.pdf">Talk of the Network: A Complex Systems Look at the 
Underlying Process of Word-of-Mouth</a> (2001) by Goldenburg, Libari, and Muller.</li><li id="footnote_3_548" class="footnote">More accurately, we&#8217;d model the &#8220;probability&#8221; as a random variable whose mean was the conversion rate.</li><li id="footnote_4_548" class="footnote">See <a href="http://www.amazon.com/Stealing-MySpace-Control-Popular-Website/dp/1400066948">Stealing MySpace: The Battle to Control the Most Popular Website in America</a> for details about the MySpace team&#8217;s background in direct marketing.  The ConnectU vs. Facebook court documents, which you can find via Google, paint a similar story for Facebook&#8217;s early years.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/behavior-adoption-on-social-networks/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Almost Viral: A Hybrid Acquisition Strategy</title>
		<link>http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/</link>
		<comments>http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/#comments</comments>
		<pubDate>Wed, 15 Apr 2009 16:00:54 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[acquisition]]></category>
		<category><![CDATA[arpu]]></category>
		<category><![CDATA[data-driven-development]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[strategy]]></category>
		<category><![CDATA[viral]]></category>
		<category><![CDATA[viral coefficient]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=520</guid>
		<description><![CDATA[
Two common acquisition strategies for a new application are a paid acquisition strategy and a viral acquisition strategy.  The former involves acquiring users at a cost less than the revenue they generate.  The latter involves users inviting their friends to the application.



&#8220;Going viral&#8221; has become a sort of holy grail and most people [...]]]></description>
			<content:encoded><![CDATA[<p>
Two common acquisition strategies for a new application are a paid acquisition strategy and a viral acquisition strategy.  The former involves acquiring users at a cost less than the revenue they generate.  The latter involves users inviting their friends to the application.
</p>

<p>
&#8220;Going viral&#8221; has become a sort of holy grail and most people would say they&#8217;d rather have a viral application than not, but it has a distinct downside: uncontrolled growth and ever-increasing operational costs.  By being almost-but-not-quite viral you can dramatically reduce your cost of acquisition without setting your servers on fire.
</p>

<h3>The Paid Strategy</h3>
<p>
The key variables for a paid strategy are cost of acquisition and average revenue per user, or ARPU.  If your ARPU is greater than your cost of acquisition then you can buy as many users as your budget allows, reinvesting the new revenue into acquiring new users.  Generally this is done by advertising your product through something like AdWords or Facebook&#8217;s Social Ads.
</p>

<p>
The best thing about a paid strategy is its relative simplicity.  If your ARPU is $2.00 then you can run as many ads you want so long as the cost of acquisition is less than $2.00 and still be profitable.
</p>

<p>
Eventually, though, you will run out of ad inventory.  There are only so many publishers who are willing to accept $0.10 per click.  At this point your only options are to decrease your cost of acquisition<sup>1</sup> or increase your ARPU.
</p>

<h3>The Viral Strategy</h3>
<p>
The viral acquisition strategy requires you get current users to invite other users to the application.  The two key variables for the viral strategy are the average number of invites each new users sends and the rate at which those invites convert into new users.
</p>

<p>
The ratio of converting invites to new users is your viral coefficient, k.  If this is greater than one you will see self-sustaining, viral growth.<sup>2</sup>  If the coefficient is less than one each user will bring in a fixed number of new users, but the application&#8217;s growth is still linear.
</p>

<p>
People who have decided on a viral acquisition strategy focus on this number obsessively.  It&#8217;s the first big hurdle to clear and if you haven&#8217;t had experience engineering a viral application it can take months to build something viral.
</p>

<p>
But being viral isn&#8217;t an either-or proposition.  Increasing your viral coefficient from 0.5 to 0.8 has other advantages, especially if you integrate it with a paid strategy.  Let&#8217;s see how.
</p>

<h3>A Hybrid Strategy</h3>
<p>
Say you&#8217;re building a game on Facebook backed by a virtual currency.  Users give you money or fill out offers to get coins, so you have a positive ARPU.  This means you&#8217;re free to pursue a paid acquisition strategy.
</p>

<p>
On the other hand, you&#8217;re on Facebook, which has many viral hooks.  Many of the technical hurdles are much smaller there, so it becomes more a question of design and optimization rather than implementation.  At the very least you encourage players to invite their friends.
</p>

<p>
The first version of your application has a viral coefficient of k=0.5, that is, every new user who joins brings in 0.5 new users.  Equivalently, for every two users you acquire you get one free.
</p>

<p>
That&#8217;s interesting, especially if you&#8217;re also <em>paying</em> for users.  If you&#8217;re paying $1.50 per user then you paid $3.00 to get two users, but acquired a third user for free!  This means that you effectively paid $1.00 per user: $3.00 paid / 3 users. 
</p>

<p>
This process is actually geometric, however.  If you purchased 4 users with a viral coefficient of 0.5, you&#8217;d first get 2 new users for free.  These 2 new users would then bring in 1 additional user, for a total of 3 new users, reducing your cost of acquisition even further.  This is an infinite geometric series, which I&#8217;ll outline below.
</p>

<p>
<strong>Having a non-zero viral coefficient reduces your effective cost of acquisition.</strong>
</p>
<p>
This means that you get more users for every dollar you spend on ads.  Or, if you&#8217;ve run out of inventory, this means you can now spend more per user and retain the same net revenue level.
</p>

<p>
Let&#8217;s figure out how to calculate your <em>effective</em> cost per acquisition.
</p>

<h3>Effective Cost of Acquisition</h3>
<p>
If you just want the formula, here it is:
</p>
<div class="math"><img src='http://s.wordpress.com/latex.php?latex=C%27%20%3D%20C%281-k%29&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='C&#039; = C(1-k)' title='C&#039; = C(1-k)' class='latex' /></div>
<p>
Where C is your cost of acquisition, k is your viral coefficient, and C&#8217; is your effective cost of acquisition.
</p>
<p>
If k=0, i.e., you have no viral acquisition, then C&#8217; = C.
</p>
<p>
If k = 1 and your application is viral then C&#8217; = 0 and your application grows without spending any additional money.  But rather than being an either-or proposition &mdash; you&#8217;re either viral or you&#8217;re not &mdash; there&#8217;s a sliding scale.  The more viral you are the cheaper it is to acquire users.
</p>

<h3>The Benefits of Being Almost Viral</h3>
<p>
From the above formula, if you have a viral coefficient of 0.90 then you have reduced your cost of acquisition by 90%.  This is a great situation to be in.  You might ask, &#8220;Why wouldn&#8217;t I want go the last 0.10 and make my application viral?&#8221;
</p>

<p>
The one benefit of being viral is huge growth, which looks sexy on a graph and can tip an investor to your side if you&#8217;re looking for outside money, but the growth is unpredictable.  Not only do you have little control over what demographics come to dominate your application, but sometimes it grows so quickly that you run into operational problems (servers on fire, etc.).
</p>

<p>
By being almost viral you can grow very cheaply, control your rate of growth and demographics, and get enough traffic to conduct meaningful <a href="http://20bits.com/articles/scientific-product-development/">experiments</a>.  Need to grow more slowly?  Just decrease your daily ad spend.  Need statistically significant results more quickly?  Increase your daily ad spend.
</p>

<p>
Put another way, with a viral coefficient of 0.9 you&#8217;ve dealt with your acquisition risk.  Rather than going fully viral and dealing with the operational difficulties, it might be worth your time to deal with other market risks: retention, engagement, and monetization.
</p>

<p>
So, stop sweating about &#8220;being viral.&#8221;  Sometimes it&#8217;s better to be almost viral.
</p>

<h3>Deriving the Formula for Effective Cost of Acquisition</h3>

<p>
You can skip this and go right to the comments if you&#8217;re not interested in the math.
</p>

<p>
We have an application with a viral coefficient of k = 0.5.  Every new user who joins brings in 0.5 new users.  Another way of thinking of it is that for every new user who joins there is a 50% probability that he will bring in another user.
</p>

<p>
But this potential user also has a 50% chance of bringing in a new user, so the expected number of users is now 1 + 0.5 + 0.5*0.5.  This continues <em>ad infinitum</em>.
</p>

<p>
Formally, if we acquire one user and have a viral coefficient of k then the number of users we expect to join is N(k), given by the formula
</p>

<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BN%28k%29%20%3D%201%20%2B%20k%20%2B%20k%5E2%20%2B%20k%5E3%20%2B%20%5Ccdots%20%3D%20%5Csum_%7Bi%20%3D%200%7D%5E%7B%5Cinfty%7D%20k%5Ei%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='\displaystyle{N(k) = 1 + k + k^2 + k^3 + \cdots = \sum_{i = 0}^{\infty} k^i}' title='\displaystyle{N(k) = 1 + k + k^2 + k^3 + \cdots = \sum_{i = 0}^{\infty} k^i}' class='latex' />
</div>

<p>
The initial 1 is our original user, whom we paid for, and each k represents the expected number of users from each step in the viral process.
</p>

<p>
This is a <a href="http://en.wikipedia.org/wiki/Geometric_series">geometric series</a>, so we know that
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BN%28k%29%20%3D%20%5Cfrac%7B1%7D%7B1-k%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='\displaystyle{N(k) = \frac{1}{1-k}}' title='\displaystyle{N(k) = \frac{1}{1-k}}' class='latex' />
</div>

<p>
Therefore, our effective cost of acquisition is
</p>
<div class="math">
<img src='http://s.wordpress.com/latex.php?latex=%5Cdisplaystyle%7BC%27%20%3D%20%5Cfrac%7BC%7D%7BN%28k%29%7D%20%3D%20%5Cfrac%7BC%7D%7B%5Cfrac%7B1%7D%7B1-k%7D%7D%20%3D%20C%281-k%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=2' alt='\displaystyle{C&#039; = \frac{C}{N(k)} = \frac{C}{\frac{1}{1-k}} = C(1-k)}' title='\displaystyle{C&#039; = \frac{C}{N(k)} = \frac{C}{\frac{1}{1-k}} = C(1-k)}' class='latex' />
</div><ol class="footnotes"><li id="footnote_0_520" class="footnote">If you increase the conversion rates for your ads then you can pay more per click to get more impressions without hurting your bottom line &mdash; you pay the same to acquire a user, but get more users through the door.</li><li id="footnote_1_520" class="footnote">See my article <a href="http://20bits.com/articles/three-myths-of-viral-growth/">Three Myths of Viral Growth</a> for more information about viral growth.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/almost-viral-a-hybrid-acquisition-strategy/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Social Applications are Social Networks</title>
		<link>http://20bits.com/articles/social-applications-are-social-networks/</link>
		<comments>http://20bits.com/articles/social-applications-are-social-networks/#comments</comments>
		<pubDate>Thu, 09 Apr 2009 15:00:51 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[engagement]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[monetization]]></category>
		<category><![CDATA[retention]]></category>
		<category><![CDATA[social network analysis]]></category>
		<category><![CDATA[social-networking]]></category>
		<category><![CDATA[top friends]]></category>
		<category><![CDATA[virality]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=485</guid>
		<description><![CDATA[
Are all social applications also social networks?  Dave McClure made a passing reference to this a little over a year ago, saying &#8220;RockYou &#038; Slide [are] arguably social networks of their own.&#8221;1  I want to make the stronger claim: social applications are always social networks.



It doesn&#8217;t matter how large you are, it doesn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>
Are all social applications also social networks?  Dave McClure made a passing reference to this a little over a year ago, saying &#8220;RockYou &#038; Slide [are] arguably social networks of their own.&#8221;<sup>1</sup>  I want to make the stronger claim: social applications are always social networks.
</p>

<p>
It doesn&#8217;t matter how large you are, it doesn&#8217;t matter what your goals are, and it doesn&#8217;t matter what your product is.  I think if you&#8217;re building a social application then you&#8217;re trying to build a new social network.  As we&#8217;ll see, this has both strategic and technical implications.
</p>

<h3>What is a Social Network?</h3>
<p>
First, if I&#8217;m going to convince you that something is a social network we should understand what a social network is. If you ask a person to name a few social networks, they will probably list services like Facebook, MySpace, and Twitter.  And if an investor tells you they&#8217;re &#8220;not investing in social networks,&#8221; they mean it in this concrete, social-network-as-a-product sense.
</p>

<p>
Others, like Brad Fitzpatrick and Mark Zuckerberg, use the term <em>social graph</em><sup>2</sup> to distinguish between the underlying social relations between people and the services, called social networks, that are built on top of them.
</p>
<p>
But if there&#8217;s one thing I learned from my mathematics education it&#8217;s this: we&#8217;re free to define things however we want so long as they&#8217;re consistent.  Therefore we ought to choose the definition that helps us get our job done.
</p>

<p>
So, here is my first, and most abstract definition: <blockquote>A social network is a collection of people bound together through a specific set of social relations.</blockquote>
</p>

<!-- Let's see if anyone makes me define social relation! -->

<p>
By &#8220;social relation&#8221; I mean a connection between people that permits the exchange of information.  This prevents artificial relations like &#8220;Alex and James are connected if they have the same hair color.&#8221;
</p>

<p>
When I say &#8220;social network&#8221; I always mean the actual collection of people.  Facebook is a social network.  There are actual people engaged with the site, creating relationships, sharing information, and doing all the things they&#8217;d do in &#8220;real life.&#8221;  Or, put another way: a family is a social network, a family tree is not.<sup>3</sup>
</p>



<p>
If you don&#8217;t like the above definition I can give you a functional one which I believe is equivalent. <blockquote>A collection of people is a social network if and only if it is possible for something to spread virally through that collection.</blockquote>
</p>

<p>
In Web 2.0 speak, a &#8220;social network&#8221; is a collection of people over which you can &#8220;go viral&#8221;.  I believe that virality and social networks are fundamentally linked, and that both the above definitions are equivalent.
</p>

<h3>Social Applications are Social Networks</h3>
<p>
Accepting the above definitions, even if for the sake of argument, I don&#8217;t think it&#8217;s too hard to see why social applications are social networks. Let&#8217;s take Slide&#8217;s <a href="http://www.facebook.com/apps/application.php?id=2425101550">Top Friends</a> as an example.  Is Top Friends a social network in its own right?
</p>

<p>
I think it&#8217;s easier to see that Top Friends meets the first definition.  It is certainly a collection of people: the set of Facebook users who have installed the application.  Are those people bound by specific social relations?  Yes, and those relations are distinct from the ones represented in Facebook.  For example, Alex adding James as a top friend is a social signal distinct from Facebook.
</p>

<p>
What about the second definition?  Top Friends doesn&#8217;t have an external API so it&#8217;s impossible to build apps or plugins for Top Friends.<sup>4</sup>  So, what &#8220;goes viral&#8221; over Top Friends?  New features and patterns of usage do.<sup>5</sup>
</p>

<p>
I&#8217;d also argue that the converse is true: social networks are all social applications.  YouTube spread through MySpace, Facebook spread through email, email spread through the real-life &#8220;social graph&#8221;, and PayPal spread through eBay.<sup>6</sup> All social networks are social applications built off of pre-existing social networks.
</p>

<h3>Strategic Implications</h3>
<p>
If Top Friends is a social network in its own right then there are strategic implications for Facebook. <em>Prima facie</em>, Top Friends is competing with facebook for users&#8217; attention on its own platform.  Before Facebook launched the Platform it was the Eye of Providence, collecting, collating, and analyzing every bit of activity that occurred on its network.
</p>

<p>
After the Platform launched these third parties were able to infect portions of Facebook&#8217;s network.  In some cases, e.g., the Causes application, the relationship was symbiotic.  In others, e.g., Top Friends, the relationship was antagonistic, with Facebook actually shutting down Top Friends at one point.<sup>7</sup>  
</p>

<p>
What does Facebook gain by having Top Friends on its Platform?  Nothing substantial, as far as I can tell.  What does it lose?  Control and insight over the activities of its userbase.<sup>8</sup>
</p>

<p>
In effect, Top Friends is a social network bootstrapped off of Facebook, with its own set of communication channels over which Facebook has no authority or insight.  This tension is present everywhere in the Platform because application developers&#8217; interests are not wholly aligned with Facebook&#8217;s and will probably never be.
</p>

<h3>Technical Implications</h3>
<p>
I&#8217;m going to save the technical implications for another article, but it boils down to this: social networks in the sense that I defined above are fairly well understood.  I believe the techniques used on the web today to grow &#8220;viral&#8221; applications are under the research from fields like social network analysis and epidemiology.

<p>
Since I believe social applications and social networks are synonymous, we can better understand how these applications grow by understanding how social networks grow.
</p>

<p>
In the meantime, I recommend reading <em><a href="http://www3.interscience.wiley.com/journal/118986267/abstract">The Statistical Evaluation of Social Network Dynamics</a></em> by Tom A. B. Snijders from the University of Groningen if you&#8217;re interested in the technical aspects of social networks and social applications.
</p>

<p>
And please, leave a comment if you have any thoughts about the above!
</p><ol class="footnotes"><li id="footnote_0_485" class="footnote"><a href="http://500hats.typepad.com/500blogs/2007/11/google-open-soc.html">Google Open Social + Friends vs. Facebook Platform</li><li id="footnote_1_485" class="footnote">See, e.g., <a href="http://bradfitz.com/social-graph-problem/">Thoughts on the Social Graph</li><li id="footnote_2_485" class="footnote"><a href="http://www.artinthepicture.com/artists/Rene_Magritte/pipe.jpeg">Ceci n&#8217;est pas un Social Network</li><li id="footnote_3_485" class="footnote">For all I know Slide has an internal Top Friends API that lets them build new services that ride on Top Friends&#8217; success, but that&#8217;s only <a href="http://api.topfriends.com/">speculation</a>.</li><li id="footnote_4_485" class="footnote">This is the essence of <a href="http://startuplessonslearned.blogspot.com/2008/12/engagement-loops-beyond-viral.html">engagement loops</a>.  Eric Ries talks about going &#8220;beyond viral.&#8221;  There is no &#8220;beyond viral.&#8221;  Rather, on social networks viral processes govern the whole stack: acquisition, retention, engagement, and monetization.</li><li id="footnote_5_485" class="footnote">Slide is to Facebook as Paypal was to eBay.  Anyone buy it?</li><li id="footnote_6_485" class="footnote">See <a href="http://www.techcrunch.com/2008/06/26/did-facebook-shut-down-slides-top-friends-how-very-myspace-of-them/">this TechCrunch article</a>.</li><li id="footnote_7_485" class="footnote">There&#8217;s a broader argument that ceding control in this way is the right strategic move, but Facebook is not there yet &mdash; the limit of that argument is something like OpenSocial.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/social-applications-are-social-networks/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Where the iTunes Store Fails: Community</title>
		<link>http://20bits.com/articles/where-the-itunes-store-fails-community/</link>
		<comments>http://20bits.com/articles/where-the-itunes-store-fails-community/#comments</comments>
		<pubDate>Mon, 06 Apr 2009 15:45:04 +0000</pubDate>
		<dc:creator>Jesse</dc:creator>
				<category><![CDATA[Articles]]></category>
		<category><![CDATA[apple]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[itunes]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[scientific-product-development]]></category>
		<category><![CDATA[startups]]></category>

		<guid isPermaLink="false">http://20bits.com/?p=465</guid>
		<description><![CDATA[
You don&#8217;t need me to tell you that the iTunes Store has changed the face of music distribution, digital or otherwise.  In April of 2008 it became the top music retailer in the US1 and passed 6 billion songs downloaded earlier this year2.  That&#8217;s almost one song downloaded for every person on the [...]]]></description>
			<content:encoded><![CDATA[<p>
You don&#8217;t need me to tell you that the iTunes Store has changed the face of music distribution, digital or otherwise.  In April of 2008 it became the top music retailer in the US<sup>1</sup> and passed 6 billion songs downloaded earlier this year<sup>2</sup>.  That&#8217;s almost one song downloaded for every person on the planet.
</p>

<p>
For music startups iTunes figures into most strategic decisions.  If you&#8217;re streaming music for free to consumers you&#8217;re going to be an iTunes affiliate<sup>3</sup>.  If you&#8217;re selling music to consumers you&#8217;re going to competing directly with iTunes &mdash; consumers have no reason to get their music from both you and iTunes if you both have it.
</p>
<p>
It&#8217;s understandable if your heart skips a beat when you catch rumor that Apple will be building a similar product.  How can you maneuver in this environment?
</p>


<h3>Finding Room to Breath</h3>
<p>
The iTunes Store is a lot like Wal-Mart: ubiquitous<sup>4</sup>, highly integrated, and bland.  People shop there because it&#8217;s easier, not because it&#8217;s sexier, even though in other areas Apple is very good at selling precisely that hip, sexy lifestyle.
</p>

<p>
But Wal-Mart&#8217;s strategy isn&#8217;t the only strategy out there.  Companies like Whole Foods can still thrive in their niche even though people can get cheaper food at Wal-Mart.  Where is the Whole Foods of digital music?  Does such a thing even make sense?
</p>

<h3>Building a Community</h3>
<p>
Corner record stores are about more than just the transaction.  They attract a certain crowd and embrace a certain culture.  <a href="http://en.wikipedia.org/wiki/High_Fidelity_(film)">High Fidelity</a> is a good example of this on film.
</p>

<p>
iTunes misses out on the cultural and communal aspects of music altogether.  It&#8217;s very sterile.  It&#8217;s also a terrible means of <em>discovering</em> new music, a role which traditional record stores can fulfill. 
</p>

<p>
As an example, say you&#8217;re really into electronica.   What good are the reviews on the iTunes music store to you?  They&#8217;re left by idiots who don&#8217;t know Aphex Twin from Paul van Dyk.  You go there when you know what you want and leave the second you have it<sup>5</sup>.
</p>

<p>
And knowing iTunes, they might not even have music from your favorite bands if they&#8217;re obscure enough.
</p>

<p>
Instead, imagine a hub of engaged electronica fans with a custom, electronica-centric store<sup>6</sup>.  The community itself spurs demand for its own store due to its reputation for quality electronica-related recommendations.
</p>

<p>
Improved discovery, better quality merchandise, exclusive deals with bands, and a community of like-minded people are just a few reasons why people might prefer to shop at a genre-specific music store rather than iTunes if they&#8217;re forced to choose.
</p>

<p>
Plus an independent store is more freely able to experiment with payment models, distribution methods, and marketing campaigns.  This might interest bands who see iTunes as a love-it-or-leave-it environment controlled from top to bottom by Apple.
</p>

<h3>Will This Work?</h3>
<p>
I don&#8217;t know if this will work, but I think it&#8217;s a reasonable enough <a href="http://20bits.com/articles/scientific-product-development/">hypothesis to test</a>.  This is just one possible strategy for building a music product and probably has several flaws I haven&#8217;t thought through.  Leave a comment and let me know your thoughts.
</p>

<p>
May a thousand music stores bloom!
</p>

<h3>Update</h3>
<p>
<a href="http://adam.blog.heroku.com/">Adam from Heroku</a> pointed me towards <a href="https://www.beatport.com/">Beatport</a>, which has been pursuing this exact strategy for the last few years.</p>
<p>
After a little digging I found a few others, too.  <a href="http://www.insound.com/">Insound.com</a> for indie music and <a href="http://mondomix.com/">Mondomix</a> for world music.  I also know of one stealth startup pursuing this strategy for another genre.  Do you know of any others?  How well does this strategy perform?
</p>

<p>
In the limit you can imagine a &#8220;Ning for iTunes Stores,&#8221; where the costs of implementing the store are shared but the community-building aspects are left to the company.
</p><ol class="footnotes"><li id="footnote_0_465" class="footnote"><a href="http://www.apple.com/pr/library/2008/04/03itunes.html">iTunes Store Top Music Retailer in the US</a></li><li id="footnote_1_465" class="footnote"><a href="http://www.techcrunch.com/2009/01/06/itunes-sells-6-billion-songs-and-other-fun-stats-from-the-philnote/">iTunes Sells 6 Billion Songs, And Other Fun Stats From The Philnote</a></li><li id="footnote_2_465" class="footnote">Both imeem and Last.FM are, for example</li><li id="footnote_3_465" class="footnote">Who did Apple pass as the top music retailer?  Wal-Mart</li><li id="footnote_4_465" class="footnote">This is a problem with iTunes in general.  It&#8217;s the last step in your marketing campaign, not the first.  See my article <a href="http://20bits.com/articles/the-099-app-store/">The $0.99 (App) Store</a>.</li><li id="footnote_5_465" class="footnote">Of course, you can break down genres into subgenres, and so forth.  Maybe the sweet spot is having an ambient store, a trance store, a house store, a D&#038;B store, etc.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://20bits.com/articles/where-the-itunes-store-fails-community/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>
