George Choueiry

I am Georges Choueiry, PharmD, MPH, PhD student in epidemiology.

How to Report a Chi-Square Test

The 3 main types of Chi-square tests are: Chi-square goodness-of-fit test: used to compare the distribution of a categorical variable (with more than 2 levels) to a hypothetical distribution. Chi-square homogeneity test: used to test whether 2 groups (coming from 2 different samples) have the same distribution regarding a certain categorical variable. Chi-square independence test: …

How to Report a Chi-Square Test Read More »

Checking the Popularity of 125 Statistical Tests and Models

I analyzed the methods sections of 43,110 randomly chosen research papers, uploaded to PubMed Central between the years 2016 and 2021, in order to check the popularity of 125 statistical methods in medical research. I used the BioC API to download the articles (see the References section below). Here’s a summary of the key findings …

Checking the Popularity of 125 Statistical Tests and Models Read More »

How Many References Should a Research Paper Have? Study of 96,685 Articles

I analyzed a random sample of 96,685 full-text research papers, uploaded to PubMed Central between the years 2016 and 2021, in order to answer the question: How many references should you cite when writing a research article? I used the BioC API to download the data (see the References section below). Here’s a summary of …

How Many References Should a Research Paper Have? Study of 96,685 Articles Read More »

Statistical Software Popularity in 40,582 Research Papers

I analyzed a random sample of 76,147 full-text research papers, uploaded to PubMed Central between the years 2016 and 2021, in order to check the popularity of statistical software among medical researchers. (I used the BioC API to download the articles — see the References section below). Out of these 76,147 research papers, only 40,582 …

Statistical Software Popularity in 40,582 Research Papers Read More »

Experimental vs Quasi-Experimental Design: Which to Choose?

Here’s a table that summarizes the similarities and differences between an experimental and a quasi-experimental study design:   Experimental Study (a.k.a. Randomized Controlled Trial) Quasi-Experimental Study Objective Evaluate the effect of an intervention or a treatment Evaluate the effect of an intervention or a treatment How participants get assigned to groups? Random assignment Non-random assignment …

Experimental vs Quasi-Experimental Design: Which to Choose? Read More »

Correlation vs Collinearity vs Multicollinearity

Here’s a table that summarizes the differences between correlation, collinearity and multicollinearity:   Correlation Collinearity Multicollinearity Definition Correlation refers to the linear relationship between 2 variables Collinearity refers to a problem when running a regression model where 2 or more independent variables (a.k.a. predictors) have a strong linear relationship Multicollinearity is a special case of …

Correlation vs Collinearity vs Multicollinearity Read More »

Standardized vs Unstandardized Regression Coefficients

Here’s a table that summarizes the similarities and differences between standardized and unstandardized linear regression coefficients:   Unstandardized β Standardized β Definition Unstandardized coefficients are obtained after running a regression model on variables measured in their original scales Standardized coefficients are obtained after running a regression model on standardized variables (i.e. rescaled variables that have …

Standardized vs Unstandardized Regression Coefficients Read More »

Neyman’s [Prevalence-Incidence] Bias: A Simple Explanation

Neyman’s bias, also known as prevalence-incidence bias, occurs when studying the relationship between an exposure and an outcome using prevalence of the outcome instead of incidence in cases where prevalence is a biased estimator of incidence. Reminder:Prevalence is the proportion of individuals who have the outcome/disease at a given time.Incidence (or risk) is the number …

Neyman’s [Prevalence-Incidence] Bias: A Simple Explanation Read More »

Temporal Bias in Research

Temporal bias occurs when we assume a wrong sequence of events which misleads our reasoning about causality. It mostly affects study designs where participants are not followed over time. The most common study designs that are subject to temporal bias are: Cross-sectional studies: Because information is collected at a single moment in time Case-control studies: …

Temporal Bias in Research Read More »

Which Variables Should You Include in a Regression Model?

When building a linear or logistic regression model, you should consider including: However, you should watch out for: Below we discuss each of these points in details. 1. Selecting variables based on background knowledge Advantages of using background knowledge to select variables How to choose variables based on background knowledge? You can find out whether …

Which Variables Should You Include in a Regression Model? Read More »