A/B testing is common in businesses. This test puts two versions of the same experience in contest with other another. People (such as website users) see one experience through random assignment.
The research question is simple to state: which version is better? This article compares three different uncertainty intervals in R.
In an earlier post, I wrote, of an approximation:
For this example, that 95% (highest density) interval is from -0.1 to 8.1 points. This is alike to both numerical methods and classical approximations.
Suppose there are 1,000 users on each version of the web page. Each user converts or…
Opinion polling is important for understanding what the public believe. It is also important for readers to recognise quality in survey estimates.
Self-selecting surveys allow people to put themselves in the sample. Sir Robert Worcester used the term voodoo polls to describe open access surveys.People could phone or send text messages to news organisations to take part. The digital era heralded two more types of self-selecting survey:
Statistics producers can make many different reports. Those reports can have distinctions, dependent on analytical choices. I look at two reporting differences for Covid-19 vaccinations in England and Wales.
NHS England produce two sets of reports on vaccinations: by day and by week. The latest weekly report goes up to 30th May 2021. These extensive analyses show vaccinations by gender, age, ethnicity and more.
We can compare those figures to the daily report for the same date. Daily reports contain only headline statistics:
What share of people have had a Covid-19 vaccine dose? For the United Kingdom on 25th May, different sources give different statistics:
There is a conceptual difference. PHE uses an Office for National Statistics estimate of the UK population aged 18 or over (~52.6m). Our World in Data uses the UN World Population Prospects 2019 estimate for 2020 (~67.9m).
Both are valid statistics. It is important to see the vaccine coverage of those eligible, and of the total population. Vaccine eligibility may change.
The birthday problem is: how large does a group need to be before it is more likely than not that two share a birthday?
Intuition may offer 183 people as an answer, since that is 365 divided by two rounded up. The answer is 23 or fewer.
A class of children announce their birthdays one at a time. It is simpler to calculate the probability of all birthdays being different. The probability of a birthday pair is then one minus the chance of all different birthdays.
Assume there are 365 possible birthdays, all with equal chance. The first child says…
How long do you expect to live? That is a key question which affects financial planning and health decisions. Life expectancy is a statistical measure of how long a person expects to live. That expectancy depends upon demographic factors, like age and sex.
There are two main ways to think about life expectancy:
The financial organisation Company Debt claimed:
Every adult in the UK will have to order 124 pints of beer this year to bring pubs back to their pre-COVID levels.
As a whole, the food and accommodation services industry fell by around 43% from 2019 to 2020. The impact of the pandemic remains stark:
The calculation goes as follows:
In recent reporting, BBC journalist Marianna Spring showed a sticker left by protesters:
A sticker reads:
“60–70% of hospital admissions and deaths are from people who have had 2 doses of the vaccines.”
That sticker provides a short link to “NHS SOURCE”. The document is not, in fact, from the National Health Service.
It is from the modelling sub-group advising the UK government. Despite the exact quote-marks, those words do not appear in the document. The sticker intends to paraphrase this part:
32. The resurgence in both hospitalisations and deaths is dominated by…
Excess deaths are deaths from all causes above a baseline. That baseline often represents an expected number of deaths.
Different institutions use different baselines:
Excess death calculations differ through choices of periods and baselines.
There many institutions collating COVID-19 surveillance deaths. There is no single source for frequent all-cause mortality.
The United Kingdom held elections on Thursday 6th May. As volunteers count votes, we can look at some graphs in election literature and on social media.
One way to understand rules of graphical integrity is to see them bent and broken. This is a short article. It is not a complete list of all poor data visualisation during the election.
One common fault with graphs in election leaflets are disproportionate bar charts.
Bar charts show their values through the length of the bars. When bar graphs do not start from zero, heights are disproportionate to the values.
This blog looks at the use of statistics in Britain and beyond. It is written by RSS Statistical Ambassador and Chartered Statistician @anthonybmasters.