pull down to refresh

RIP #14: A Formal Model of Posting Behavior on Stacker News[1]

One of the feedbacks I got from my department is that I should formalize the model of posting behavior on Stacker News. When I started the paper, I thought that would be mathematical overkill because the inuition is fairly obvious, and the paper is primarily about the data analysis, not formal modeling. But I decided to try it, and after doing it I think it does add a bit of "gravitas" to the paper, and I really like the new results.

The model works like this: people randomly receive "post ideas" that have a certain quality and intrinsic motivation to post. The expected amount that the post gets zapped depends on the quality, and quality and intrinsic motivation could be correlated with each other. When a user receives a post idea, they post it if the expected zaps plus intrinsic motivation exceeds the cost of posting.

The point of the model is to formalize the conditions under which we get the main result, that higher posting costs:

  • reduces the number of posts, and
  • increases the expected quality of posts

As it turns out, the logic holds under fairly unrestrictive conditions. We only require three things for the result to hold:

  1. Expected zap amounts need to be increasing in post quality (very reasonable, maybe even tautological)
  2. Quality and intrinsic motivation to post cannot be negatively related on one another, though they can be positively related or independent of each other (also very reasonable)
  3. Extreme levels of intrinsic motivation to post are sufficiently rare (this is the one that perhaps needs most justification)

Assumption 1 is almost tautological, for how would we judge post quality if not by the amount that people zap it?

Assumption 2 says that if someone has a higher quality post idea, that usually comes with a higher intrinsic motivation to post it as well. I think that's almost certainly true: if you put a lot of effort into something, you have a greater desire to show it to others, or it's because you care a lot about it.

Assumption 3 is there to rule out a weird situation in which posting costs get so high that only hyper-motivated but low quality posters remain on the platform. Even if assumption 3 fails, we can still get our main result if we simply don't focus on the extreme tails of the distributions.

One reason I like having the model in the paper is that by formalizing the conditions under which the results hold, I can make the following claim: "All three assumptions are likely to be satisfied in other social media environments, and thus it may be possible to extrapolate our results to other settings (though perhaps with different baseline cost and rewards levels.)" That addresses one of the biggest critiques of the paper, which is that results on Stacker News don't generalize to broader social media contexts.

The model and proof is stated below as a flex.

(... but also because I already did the work and want to show my proof of work)


ModelModel

We envision a model in which a user receives "post ideas" as a Poisson arrival process with rate . Post ideas arrive with a quality level , which affects how other users respond to the post, and an intrinsic motivation to post it, . The level of internal motivation may be correlated with post quality, and they are drawn from a joint probability density function . We allow different users to draw from different distributions of , but for now we focus on the decisions of a single user.

By Bayes' Rule, , where is the conditional density of given , and is the marginal density of . We assume that is strictly log-concave and weakly satisfies the monotone likelihood ratio property in with respect to :

Assumption 1. (Log Concavity) For each , is strictly log-concave; that is, is strictly concave in .

Assumption 2. (Monotone Likelihood Ratio Property) For any , the likelihood ratio of the conditional densities,

is weakly increasing in .


Remark.

Assumption 1 limits the heaviness of the tail of the motivation distribution. Intuitively, it ensures that the extreme tail of the motivation distribution is not thick enough to swamp quality selection effects at very high posting costs. It is not a strong assumption, and it is satisfied by most commonly used distributional families, including the normal and logistic distributions.[2]

Assumption 2 is a non-negative dependence condition between post quality and intrinsic motivation . Informally, it says that higher quality ideas tend to come with (weakly) higher intrinsic desire to post: when is higher, the distribution of shifts to the right. Independence between and is allowed as a special case, but the assumption rules out systematic negative dependence. In our context, this means that users who receive a high quality post idea are also (weakly) more likely to feel a strong intrinsic desire to post it. As we shall document later, this is consistent with users' self reported perceptions of their own motivations.


Once a post is made, zaps arrive according to a Poisson process. Both the arrival rate of zaps and the amount of each zap are dependent on the quality of the post, . Let be the arrival rate as a function of , and let be the probability density function of the zap amount conditional on .

Supposing the user has a discount factor of , the expected present value of the Poisson stream of zaps is equal to:[3]

We assume that is continuous and strictly increasing in , which merely says that the expected present value of zaps is higher for higher quality posts:

Assumption 3. (Monotone Returns to Quality) is continuous and strictly increasing in .

The user will make the post if their expected present value of zaps plus their intrinsic motivation is greater than the posting cost :

From the perspective of the user, is fixed and known at the time of the post.

We now demonstrate conditions under which an increase in posting cost will lead to fewer posts, but higher quality posts.


Proposition 1.
Suppose Assumptions 1-3 hold. Then an increase in $c$:

  • (i) strictly decreases the probability that the post idea is posted, and
  • (ii) strictly increases the expected quality of posts, conditional on posting.

Proof.

A post idea is posted if and only if . Define as the probability (over ) that a post of quality is posted when the cost is :

is equivalent to the survival function of , evaluated at : .

(i) The unconditional probability of posting is:

Since is a survival function, its first derivative is negative everywhere. An increase in will therefore reduce the value of at every . Since that is what we are integrating over to get , this strictly reduces . The expected number of posts per unit time is , so an increase in strictly reduces the expected number of posts.

(ii) Take any . The set of ideas that were posted at cost can be partitioned into two groups:

  • Surviving posts. Ideas that are still posted at cost , i.e.
  • Marginal posts. Ideas that are posted at but not at , i.e. .

By the Law of Iterated Expectations,

where . Noting that , we rearrange the above equation to obtain:

Thus, to show that expected post quality is increasing in , it suffices to show that the expected quality of the surviving posts is always higher than the expected quality of the marginal posts.

To demonstrate this, it is sufficient to establish that for any , the ratio is strictly increasing in . Intuitively, this says that as costs rise, higher quality posts retain a larger share of the posting probability relative to lower quality ideas. This would imply that the marginal posts are tilted towards lower quality than surviving posts.

We write:

Define the hazard rate: . Differentiating with respect to shows us that the ratio is increasing in if and only if:

The proof now proceeds in two parts.

Claim 1. First, we show that for any . By the MLRP, for any , we have that , which rearranges to:

Integrating both sides over from to gives us:

which rearranges to as was to be demonstrated.

Claim 2. Now, we show that is increasing in for each . Take the derivative of with respect to . The sign is equal to the sign of:

When , this is immediately positive. When , we rely on the log-concavity property. Since is log-concave, the following holds for every :

Exponentiating both sides:

Now integrating over from to :

Since , this can be rearranged to:

which was to be demonstrated. Thus, is strictly increasing in for all .

By the monotonicity assumption, . By Claim 2, this implies that . By Claim 1, . Together, we have , which was to be demonstrated.


Remark.

Proposition 1 implies that when posting cost increases, users are less likely to post, but that when they do post, the average quality of the post is higher. Although the proposition is stated from the perspective of a single user, we note that as long as the global joint distribution of holds to Assumptions 1 and 2, and if Assumption 3 also holds for all users, then Proposition 1 applies in the aggregate, even if users have heterogeneous marginal densities for and .

The result of Proposition 1 extends to a setting with multiple territories. Users post in the territory that maximizes , provided it is positive. When territory raises its posting cost , users' outside options are unaffected, so the same logic applies: as long as the population-level joint distribution of quality and motivation within each territory satisfies Assumptions 1-3, a territory that raises its posting cost will see fewer but higher quality posts.

  1. Note: This is a series in which I am publicly documenting the research process for an academic study into financial micro-incentives on discussion platforms, using data from Stacker.News. See here for a list of updates.

  2. Weaker, local conditions about the hazard rate in the neighborhood of the current posting cost would also suffice, but we state the stronger condition of global log-concavity for simplicity. Moreover, strict log-concavity is assumed for simplicity, but it suffices to assume either strict log-concavity or strict monotone likelihood ratio property; we do not need strictness for both.

  3. This follows from .

205 sats \ 1 reply \ @Taj 3h

Ye I would think of a post and just post it , if it was imo worthy of an upload

But if the barrier to entry, cost to post was much higher

I would probably think of an idea, perhaps shelve it, in favour of another better idea in the future, or wait for yet another idea to compliment the original idea

  • my pet hate posts

Link only posts?

Ffs put somr meat On the bones


Also we are seeing that bot called something or other commenting on every post with some inane vanilla bs

The cost of entry for that bot is the seesaw, obviously currently it's just spamming everyone

The question is , is that an acceptable annoyance or do we need SNIP-110 🤣🤣🤣

reply

SNIP-110, good one, haha!

reply

You could probably hand wave this away, but it seems like there should be an effort of creating the post that would be additional to the fee and would likely be increasing in q.

reply

I plan to tackle endogenous quality selection next. I think I need it to motivate the section on v4v and learning

reply

Really appreciate this and glad you write it up for us.

I confess I'm too tired to engage with it — and about a decade out of econ modeling/grad school stuff to grasp much. Sending some zaps and appreciation instead

reply

Main takeaway is just that you don't need very strong assumptions to get the basic result that raising posting costs reduces number of posts but raises average post quality. The channel works by filtering out lower quality posts.

The thing that could break the result is if low quality posters also have strong motivation to post. If every poster was like SS, then raising posting costs might not improve post quality coz you're left with highly motivated, low quality posters.

Very intuitive, but proving the mathematical model gives the paper a bit more meat on the bones.

reply

Would it not be of importance that the real world value of the cost is so tiny that it won't matter? (And that we can safely intuit that zaps post-posting Will cover it)

Do we care if its 63 or 103 sats to post?

For instance, I don't believe my behavior was changed at all during Undisciplined's econ cost experiment. (Doesn't have to mean anything, ofc, just that I'm way up the D curve).

reply

I share your instinct. That's why I wrote this in the paper:

Taken together, the results are both surprising and unsurprising. They are unsurprising in the sense that they conform with economic theory: demand curves slope downwards (when posting costs go up, number of posts goes down), and signaling theory works (when posting costs go up, higher quality posts are made). They are surprising in the sense that even such small micro-incentives (the average posting cost is just 51 sats, or about 5 cents) are enough to influence user behavior in such a way that post quality is improved. The results suggest that pay-to-post may be an effective mechanism for mediating content quality, even at very small monetary amounts. Moreover, the assumptions required for this result to hold more broadly are fairly weak: we require only that expected rewards are increasing in post quality (Assumption 3, that post quality and intrinsic motivation to post are positively affiliated (Assumption 2), and that extreme levels of intrinsic motivation are sufficiently rare (Assumption 1). All three assumptions are likely to be satisfied in other social media environments, and thus it may be possible to extrapolate our results to other settings (though perhaps with different baseline cost and rewards levels.)
reply
72 sats \ 1 reply \ @BlokchainB 2h

I think we should renamed these to SNIPs

Stacker news improvement proposals haha

reply

I didn't actually propose any improvements though haha

reply