How Long Should the Abstract Be? Data 61,429 from Examples

I analyzed a random sample of 61,429 full-text research papers, uploaded to PubMed Central between the years 2016 and 2021, in order to answer the questions:

What is the typical length of an abstract? and which factors influence it?

I used the BioC API to download the data (see the References section below).

Here’s a summary of the key findings

1. The median abstract was 263 words long (equivalent to 11 sentences, or 2 paragraphs), and 90% of the abstracts in the sample were between 163 and 416 words.

2. Longer research articles have slightly longer abstracts. Specifically, a research article that has 1000 more words, has an abstract that is 1.4% longer.

3. The length of the abstract does not differ between review articles and original research articles.

4. The quality of the journal does not influence the length of the abstract.

Overall length of the abstract

Here’s a table that describes the length of the abstract in terms of words, sentences, and paragraphs:

Abstract Length
Word CountSentence CountParagraph Count
Minimum19 words1 sentence1 paragraph
25th Percentile217 words9 sentences1 paragraph
50th Percentile (Median)263 words11 sentences2 paragraphs
Mean273.6 words11.2 sentences2.7 paragraphs
75th Percentile315 words13 sentences4 paragraphs
Maximum1,686 words73 sentences40 paragraphs

From these data, we can conclude that the abstracts in most research papers are between 217 and 315 words long (9 to 13 sentences).

If you are interested, here are the links to the articles with the shortest and longest abstracts.

The abstract is 7.15% the length of the entire article word count, and longer research articles have longer abstracts. In fact, I ran a Poisson regression model that predicts the abstract word count given the whole article word count, and the outcome was that: a research article that has 1000 more words, has an abstract that is 1.4% longer. To put that into perspective, an article that has 1000 words more than the median is associated with an abstract that is 3.5 words longer.

Length of the abstract for different article types

The following table shows the median word count of the abstract for different study designs:

Study designNumber of studies in the sampleMedian abstract word count
Case series140 studies273 words
Case-control443 studies275 words
Pilot study842 studies282 words
Case report407 studies285 words
Cross-sectional1,481 studies285 words
Quasi-experiment144 studies286 words
Meta-analysis3,529 studies287 words
Cohort5,180 studies288 words
Systematic review686 studies294 words
Randomized controlled trial689 studies301 words

The data show no clear pattern since the abstracts of review articles and original research articles have almost similar word counts. So we can conclude that there is no particular article type that requires a longer abstract.

Length of the abstract in different journals

In order to study the influence of the journal quality on the length of the abstract, I ran a Poisson regression that models the abstract word count given the journal impact factor. Here’s the model output:

VariablesCoefficientStandard errorp-value
(Intercept)5.627<0.001<0.001
Journal impact factor-0.004<0.001<0.001

The model shows that a higher journal impact factor is associated with a shorter abstract. Although statistically significant, this effect is practically negligible since a 1 unit increase in the journal impact factor is associated with a decrease of only 0.4% in the abstract word count. For the median article, this means that a 1 unit increase in the journal impact factor is associated with an approximate decrease of 1 word in the abstract.

References

  • Comeau DC, Wei CH, Islamaj Doğan R, and Lu Z. PMC text mining subset in BioC: about 3 million full text articles and growing, Bioinformatics, btz070, 2019.

Further reading