Member-only story

Bayesian approximations in A/B tests

How close does an approximation get to numerical simulations?

4 min readJun 12, 2021

A/B testing is common in businesses. This test puts two versions of the same experience in contest with other another. People (such as website users) see one experience through random assignment.

The research question is simple to state: which version is better? This article compares three different uncertainty intervals in R.

Uncertainty intervals

In an earlier post, I wrote, of an approximation:

For this example, that 95% (highest density) interval is from -0.1 to 8.1 points. This is alike to both numerical methods and classical approximations.

Suppose there are 1,000 users on each version of the web page. Each user converts or not, so the data follows a Binomial distribution. We set two independent prior distributions as uniform — with equal density between 0 and 1.

The first version (page A) had 300 conversions. The second version (B) had 340. The difference in conversion ratios is four percentage points.

set.seed(4744)
xA <- 300; nA <- 1000; alphaA <- 1; betaA <- 1
xB <- 340; nB <- 1000; alphaB <- 1; betaB <- 1
number_sims <- 50000

The Beta distribution is a conjugate prior to our Binomial data. That means the prior and posterior distributions are in the same family.

There is a simple updating rule:

alphaA_post <- alphaA + xA; betaA_post <- betaA + nA - xA
alphaB_post <- alphaB + xB; betaB_post <- betaB + nB - xB

In R, we can run simulations to approximate the distribution:

pA <- rbeta(number_sims, alphaA_post, betaA_post)
pB <- rbeta(number_sims, alphaB_post, betaB_post)

pB_minus_pA <- 100*(pB - pA) %>% as_tibble()

We can then producing the credible interval, centred on the mean with equal tails:

pB_minus_pA_cred <- pB_minus_pA %>%
  mean_qi(mean = value) %>%
  rename(lower = .lower, upper = .upper) %>%
  mutate(name = "Numerical simulation", type = "Bayesian") %>%
  select(name, type, mean, lower, upper)

Instead, what if we used an approximation? Computing the Kawaski & Miyoaka approximation in a 2010 JJSS article:

muA_post <- alphaA_post / (alphaA_post + betaA_post)
muB_post <- alphaB_post /…

Bayesian approximations in A/B tests

How close does an approximation get to numerical simulations?

Uncertainty intervals

Written by Anthony B. Masters

No responses yet