Simulating The Distribution of P-Values (With Downloadable R Code Notebook)
p.value_distribution_simulations.knit Introduction In this simulation, we will investigate the distribution of p-values : both when the null hypothesis is true. The idea is simply to simulate a sample size of…
Tell Me About Yourself
A Data Scientist answered an interview question… I had an hour to kill before my meeting, so I decided to stop by the university student center. That random visit turned…
What Are P-values And Why Are They So Problematic?
Introduction P-values are probably the most discussed statistical topic in history. They are often criticized, distrusted, misused, misinterpreted, and, on the flipside, used everyday in every single empirical study. So,…
Adding the “Real” to Real-World Data
The Irony of “Real-World Data” Have you noticed how many companies now include “AI” as a buzzword on their websites or in their product descriptions? How many do you think…
Power Calculation : 8 Reasons Why You Should Care About It
What is a power calculation good for? Power calculation helps a researcher determine the required sample size for a study. It is done at the planning stage of a trial.…
The Complete Biostatistics Roadmap, or How to Become a Great Biostatistician (with Books and Resources)
I’m often asked by budding biostatisticians curious and eager to learn more to advise them on a developmental roadmap. Here is what I think you need to become a great…
The Influence of Confounding Variables in Observational Studies
Observational studies play an important role in understanding associations between exposures and outcomes, particularly in fields where randomized controlled trials (RCTs) may not be feasible due to ethical, practical, or…
Do Power Calculation Before Data Collection, Not After – With Downloadable R Code Notebook
Don’t Compute the Statistical Power of Your Experiment…Even if SPSS Allows It And Your Editor Requires It! Introduction Download the R Markdown notebook here used to generate this blog post…
Sample Size Calculation for Bioequivalence Tests : A powerTOST Tutorial With Downloadable R Code
Introduction (Downloadable Code) Sample size calculation is a fundamental aspect of research design, influencing the reliability and validity of study outcomes. Researchers often inquire about the methods and factors involved…
Once Upon a Time Series
In the summer of 2015, I started to feel hope. Not that I was in a bad place—not at all. In four months, I would go on to defend my…
Who’s the Culprit for Bias in Aging Clocks?
Predicting biological age with machine learning How and why do we age? This is a lofty biological question, so what might it have to do with machine learning and statistics?…
Crash course on confounding, bias, and deconfounding remedies using R
. . Introduction In theory there is no difference between theory and practice. In practice there is. Yogi Berra Confounding bias is one of the most ubiquitous challenges in estimating…