Too many digits: The presentation of numerical data

  • Archives of Disease in Childhood 100(7)

Tim J Cole at University College London

  • University College London

Discover the world's research

  • 25+ million members
  • 160+ million publication pages
  • 2.3+ billion citations
  • AM J SPEECH-LANG PAT

Catriona M Steele

  • Ryan J. Burdick
  • Justine Dallal-York

Sophia Werden Abrams

  • Alyssa Jenn Estose
  • Melcris Opon

Emily Tabanao

  • CLIN CHEM LAB MED

Hanah Kim

  • Jeroen B J Smeets
  • PEDIATR BLOOD CANCER
  • Nancy Sherman
  • Muhammad Younus
  • Kevin Wolter
  • Elaine M Boyle
  • Ahmad Maqboul

Bakheet Elsadek

  • Ivan J. Golub
  • Rushabh M. Vakharia
  • Afshin E. Razi
  • AM J OBSTET GYNECOL

Nathalie Auger

  • Émilie Brousseau
  • William D. Fraser
  • J CARDIOVASC ELECTR
  • Carola Gianni

Mohanad Elchouemi

  • Amin Al-Ahmad
  • INT J NURS STUD

Tom Lang

  • Jinliang Zhu

Shengli Lin

  • Zwabragi Joel

David M Haas

  • Page Kirkpatrick

Nicky J Welton

  • William G. Hopkins

Alan M Batterham

  • J Martin Bland
  • Recruit researchers
  • Join for free
  • Login Email Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google Welcome back! Please log in. Email · Hint Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google No account? Sign up
  •  Sign into My Research
  •  Create My Research Account
  • Company Website
  • Our Products
  • About Dissertations
  • Español (España)
  • Support Center

Select language

  • Bahasa Indonesia
  • Português (Brasil)
  • Português (Portugal)

Welcome to My Research!

You may have access to the free features available through My Research. You can save searches, save documents, create alerts and more. Please log in through your library or institution to check if you have access.

Welcome to My Research!

Translate this article into 20 different languages!

If you log in through your library or institution you might have access to this article in multiple languages.

Translate this article into 20 different languages!

Get access to 20+ different citations styles

Styles include MLA, APA, Chicago and many more. This feature may be available for free if you log in through your library or institution.

Get access to 20+ different citations styles

Looking for a PDF of this document?

You may have access to it for free by logging in through your library or institution.

Looking for a PDF of this document?

Want to save this document?

You may have access to different export options including Google Drive and Microsoft OneDrive and citation management tools like RefWorks and EasyBib. Try logging in through your library or institution to get access to these tools.

Want to save this document?

  • More like this
  • Preview Available
  • Scholarly Journal

too many digits the presentation of numerical data

Too many digits: the presentation of numerical data

No items selected.

Please select one or more items.

Select results items first to use the cite, email, save, and export options

You might have access to the full article...

Try and log in through your institution to see if they have access to the full text.

Content area

Emperor Joseph II: My dear young man, don't take it too hard. Your work is ingenious. It's quality work. And there are simply too many notes, that's all. Just cut a few and it will be perfect.

Mozart: Which few did you have in mind, Majesty?

Emperor Joseph II: Well, there it is.

Quotation from the film Amadeus (1984)

As a statistical reviewer for Archives and BMJ I am interested in the presentation of numerical data. 1 It concerns me that numbers are often reported to excessive precision, because too many digits can swamp the reader, overcomplicate the story and obscure the message.

A number's precision relates to its decimal places or significant figures (or as preferred here, significant digits ). The number of decimal places is the number of digits to the right of the decimal point, while the number of significant digits is the number of all digits ignoring the decimal point, and ignoring all leading zeros and some trailing zeros (for a fuller definition see 'significant figures' on Wikipedia).

Ideally data should be rounded appropriately, not too much and not too little (one might call it Goldilocks rounding). 2 The European Association of Science Editors guidelines include the useful rule of thumb: "numbers should be given in (sic) 2-3 effective digits". 3

Take as an example the odds ratio (OR) of 22.68 (95% CI 7.51 to 73.67) comparing beta mimetics with placebo for side effects requiring a change of medication. 4 Its two decimal places and four significant digits are excessive when the effect size and confidence interval (CI) are so large. Reporting it rounded to two significant digits, as 23 (7.5 to 74), or even as 23 (8 to 70), with one significant digit for the CI, would be simpler and clearer.

There are several published recommendations (or reporting rules ) about rounding numbers, some of which relate to decimal places (eg, the Cochrane Style Guide 5 or APA Style 6 to round to two decimal places), some to significant digits (eg, the European Association of Science Editors guideline above 3 ) and some to a combination of the two (eg, setting the number of decimal places to ensure two significant digits for the standard deviation (SD) 7 ). However,...

You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer

Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer

Suggested sources

  • About ProQuest
  • Terms of Use
  • Privacy Policy
  • Cookie Policy
  • DOI: 10.1136/archdischild-2014-307149
  • Corpus ID: 9550329

Too many digits: the presentation of numerical data

  • Published in Archives of Disease in… 15 April 2015
  • Mathematics

97 Citations

Goldilocks rounding: achieving balance between accuracy and parsimony in the reporting of relative effect estimates, missing the point: are journals using the ideal number of decimal places, how many decimals rounding descriptive and inferential statistics based on measurement precision, the bias and precision of reporting the average age of human participants., merging graphics and text to better convey experimental results: designing an “enhanced bar graph”, systemisers are better at maths, the principles of biomedical scientific writing: results, toward a more reliable characterization of fractal properties of the cerebral cortex of healthy subjects during the lifespan, setting number of decimal places for reporting risk ratios: rule of four, interrater and intrarater agreement and reliability of ratings made using the zaidi–dayal and richards–jabbour scales for the shape of the foramen magnum, 15 references, statistics notes: presentation of numerical data, publication manual of the american psychological association, improving tabular displays, with naep tables as examples and inspirations, ease guidelines for authors and translators of scientific articles to be published in english, the effects of weathering demonstrated by maternal age on low birth weight outcome in babies, speed of updating online evidence based point of care summaries: prospective cohort analysis, tocolytic therapy for preterm delivery: systematic review and network meta-analysis, effect of in vitro culture period on birthweight of singleton newborns., related papers.

Showing 1 through 3 of 0 Related Papers

  • Advanced search

Deposit your research

  • Open Access
  • About UCL Discovery
  • UCL Discovery Plus
  • REF and open access
  • UCL e-theses guidelines
  • Notices and policies

UCL Discovery download statistics are currently being regenerated.

We estimate that this process will complete on or before Mon 06-Jul-2020. Until then, reported statistics will be incomplete.

Too many digits? The presentation of numerical data

Green open access


Arch Dis Child-2015-Cole-608-9.pdf
|
Type: Article
Title: Too many digits? The presentation of numerical data
Open access status: An open access version is available from UCL Discovery
DOI:
Publisher version:
Additional information: This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY 4.0) license, which permits others to distribute, remix, adapt and build upon this work, for commercial use, provided the original work is properly cited. See: http:// creativecommons.org/licenses/by/4.0/
UCL classification:
> >
> > > >
> > > > >
URI:

too many digits the presentation of numerical data

Archive Staff Only

View Item
  • Freedom of Information
  • Accessibility
  • Advanced Search
  • Econometrics

Too many digits: the presentation of numerical data

Related documents.

Chapter 1: Significant Digits and Rounding Numbers

Add this document to collection(s)

You can add this document to your study collection(s)

Add this document to saved

You can add this document to your saved list

Suggest us how to improve StudyLib

(For complaints, use another form )

Input it if you want to receive answer

Equator network

Enhancing the QUAlity and Transparency Of health Research

  • Courses & events
  • Librarian Network
  • Search for reporting guidelines

Use your browser's Back button to return to your search results

Too many digits: the presentation of numerical data

Reporting guideline provided for?
(i.e. exactly what the authors state in the paper)
Recommendations for rounding summary statistics.
Full bibliographic reference Cole TJ. Too many digits: the presentation of numerical data. Arch Dis Child. 2015;100(7):608-609.
Language English
PubMed ID
Relevant URLs
(full-text if available)
The full-text of this reporting guideline is freely available from:
Statistical methods and analyses
December 17, 2021

Reporting guidelines for main study types

Translations

Some reporting guidelines are also available in languages other than English. Find out more in our Translations section .

  • About the Library

For information about Library scope and content, identification of reporting guidelines and inclusion/exclusion criteria please visit About the Library .

Visit our Help page for information about searching for reporting guidelines and for general information about using our website.

Library index

  • What is a reporting guideline?
  • Browse reporting guidelines by specialty
  • Reporting guidelines under development
  • Translations of reporting guidelines
  • EQUATOR Network reporting guideline manual
  • Reporting guidelines for animal research
  • Guidance on scientific writing
  • Guidance developed by editorial groups
  • Research funders’ guidance on reporting requirements
  • Professional medical writing support
  • Research ethics, publication ethics and good practice guidelines
  • Links to other resources

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Malays J Med Sci
  • v.23(5); 2016 Sep

Reporting Statistical Results in Medical Journals

Wan nor arifin.

1 Statistical Editors, Malaysian Journal of Medical Sciences, Penerbit Universiti Sains Malaysia, 11800 USM, Pulau Pinang, Malaysia

2 Unit of Biostatistics and Research Methodology, School of Medical Sciences, Universiti Sains Malaysia, 16150 Kubang Kerian, Kelantan, Malaysia

Abdullah Sarimah

Bachok norsa’adah, yaacob najib majdi, ab hamid siti-azrin, musa kamarul imran.

3 Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia,16150 Kubang Kerian, Kelantan, Malaysia

Abd Aziz Aniza

4 Unit of Community Medicine, Faculty of Medicine, Universiti Sultan Zainal Abidin, 20400 Kuala Terengganu, Terengganu, Malaysia

5 PAPRSB Institute of Health Sciences, Universiti Brunei Darussalam, Gadong BE 1410, Brunei

Statistical editors of the Malaysian Journal of Medical Sciences (MJMS) must go through many submitted manuscripts, focusing on the statistical aspect of the manuscripts. However, the editors notice myriad styles of reporting the statistical results, which are not standardised among the authors. This could be due to the lack of clear written instructions on reporting statistics in the guidelines for authors. The aim of this editorial is to briefly outline reporting methods for several important and common statistical results. It will also address a number of common mistakes made by the authors. The editorial will serve as a guideline for authors aiming to publish in the MJMS as well as in other medical journals.

Introduction

Year over year, statistical editors of the Malaysian Journal of Medical Sciences (MJMS) must go through many submitted manuscripts, scrutinising the statistical and methodological soundness of the manuscripts. In 2015 alone, the MJMS received 272 manuscripts from many different countries, 52% of which were original articles ( 1 ). However, the editors have noted many different styles of reporting statistical results, and these styles are not standardised among authors. This has caused unnecessary difficulties for the editors as they have to comment not only on the methods and statistics used, but also on the technical and formatting aspects of the manuscripts. This lack of standardised reporting also causes delay in reviewing and accepting submitted articles. Admittedly, this could be due to a lack of clear written instructions on reporting statistics in the guidelines for authors. Although there are a number of guidelines available on reporting statistical results, for examples in Lang and Altman ( 2 ) and Cummings and Rivara ( 3 ), the editors of the MJMS found them incomplete as guidelines for authors.

The aim of this editorial is to outline reporting methods for several important and common statistical results. It will also address a number of common mistakes made by authors.

Presentation Forms

Statistical results can be presented in text, table, or figure form. The decision depends very much on the amount of information the authors want to present to the readers.

The text form is suitable for brief results, for example, the description of a sample (“A total of 100 patients were recruited,” “Most of the respondents were female...”). Text form is also used to highlight important results in tables that might be missed by readers given the amount of information commonly summarised in tables, for example, “Among all the studied factors, only gender and salary were found to be significantly associated with...”.

The table form is suitable for presentation of detailed statistical results. Common examples are detailed demographic profiles of study participants, results of a multiple logistic regression analysis, and cross-tabulation of factors with outcomes. It is very important to note that the table description is placed at the top of the table, while the list of abbreviations and additional relevant descriptions (especially related to statistical analysis) are placed below the table as footnotes. The footnotes should be indicated by superscript Roman letters (a, b, c, ...) instead of symbols or numbers. All abbreviations used in the table must be described again in the footnotes, although the abbreviations were already described in text or earlier tables.

The figure form includes charts, graphs, and other images. It should be reserved for results that are more presentable in this form, for example, trends or geographical distribution of disease, histopathological or radiological images, and comparison of means over time. Figure descriptions are placed below the figure.

Descriptive Statistics

The descriptive statistics summarise data from a sample, for example, demographic profiles. Whenever there are a number of groups, it is useful to provide the descriptive statistics by group and for the overall sample. This gives a visual impression of the comparability of the groups in term of their baseline characteristics. It is not necessary to report statistical tests and P -values in such a summary because the main concern is the comparability of the participants (which reflects the sampling), not the populations.

Depending on the types of variables, authors should present the appropriate descriptive statistics. For numerical variables, if the variable is normally distributed, the mean and standard deviation (SD) are presented. In the text, this is reported as mean (SD = value), for example, “the mean age was 46.5 (SD = 3.0).” In a table, the “mean (SD)” statement is included in the header. Whenever the variable is not normally distributed, the median and inter-quartile range (IQR) are reported instead. The use of “±” symbol between a mean and an SD must be avoided because the mathematical symbol has its own specific meaning. For the categorical variable, count ( n ) and percentage (%) are presented. In addition, authors must report the group size and total sample size, written as n = size in the table headers and the table description, respectively. The use of a capital N in place of n must be avoided as it refers to population size instead of sample size. A typical demographic table is presented in Table 1 .

Patient demographics ( n = 95)

VariablesDrug X ( = 45)
(%)
Placebo ( = 50)
(%)
Total
(%)
Age (years) 45.3 ( 2.6)47.8 ( 3.2)46.5 ( 3.0)
GenderMale25 (55.6)25 (50.0)50 (52.6)
Female20 (44.4)25 (50.0)45 (47.4)
BMI groupsUnderweight (BMI < 18.5)10 (22.2)11 (24.0)21 (22.1)
Normal (BMI 18.5 to 24.9)12 (26.7)13 (28.0)25 (26.3)
Overweight (BMI ≥ 25)23 (51.1)26 (48.0)49 (51.6)

Confidence Interval

Precision of the estimates, for example, single mean and proportion, are presented in the form “estimate (95% CI: lower limit, upper limit)”. In writing, for the single mean, “the mean body mass index (BMI) was 22.5 (95% CI: 21.5, 23.5)” and for the single proportion/percentage “the prevalence of obesity was 34.5% (95% CI: 30.5%, 38.5%)”. Other common examples are the reporting of mean difference (independent t -test) and odds ratio (logistic regression), which are presented under the specific statistical tests section below.

Common Statistical Tests

In order to standardise the reporting and presenting of statistical results in the MJMS, the editors offer the suggested forms of presentation summarised in Table 2 as general guidelines.

Presenting statistical results

Statistical testsTable formFigure form
Independent -test Comparison of systolic blood pressure between intervention and control groups.-
SBP (mmHg)Intervention
= 40
Control
= 40
10.0 (7.5, 12.6)7.83 (78)< 0.001
119.4 (5.06)109.4 (6.34)
SBP = systolic blood pressure. Independent test.
Paired -test Comparison of systolic blood pressure pre- and post-treatment.-

SBP (mmHg)PrePost−11.5 (−13.1, −9.9)−14.92 (29)< 0.001
136.5 (9.72)125.0 (7.64)
SBP = systolic blood pressure. Paired -test.
One-way ANOVA Comparison of mean weight between the four diet plans.
Okinawa Diet1065.5 (9.98)13.41 (3, 36)< 0.001
Eastern Diet1075.4 (4.17)
Western Diet1077.9 (5.70)
Fast food Diet1083.9 (5.07)
One-way ANOVA, Post-hoc analysis with Bonferroni corrections shows significant difference between Okinawa diet and other diet plans ( < 0.001) and between Eastern diet and Western diet ( = 0.041).
Pearson’s correlation Correlation between the study variables ( = 155).
Age (years) < 0.001 0.031
BMI (kg/m )0.55 0.008
SBP (mmHg)0.71 0.33
BMI = body mass index, SBP = systolic blood pressure. SD, -values, correlation coefficient ( ).
Linear regression Factors associated with systolic blood pressure (mmHg) ( = 150).-
BMI (kg/m )9.4 (8.6, 10.2)< 0.001
Age (years)2.5 (1.5, 3.5)0.004
BMI = body mass index. Adjusted regression coefficients, Multiple linear regression ( = 0.65).
Chi-square test Association between gender and disease status-
GenderMale20 (80.0)5 (20.0)258.33 (1)0.004
Female10 (40.0)15 (60.0)25
Chi-square test for independence
McNemar Status of skin lesion pre- and post-treatment.-
PreLesion30 (37.5)18 (22.5)8011.34 (1)< 0.001
No lesion2 ( 2.5)30 (37.5)
McNemar’s Chi-squared test with continuity correction.
Logistic regression Associated factors of coronary artery disease ( = 250).-
Diastolic Blood Pressure (mmHg)0.051.05 (1.02, 1.08)<0.001
GenderMale vs Female 0.812.24 (1.04, 4.82)0.045
OR = odds ratio. Likelihood ratio test, the reference category.
Diagnostic test Sensitivity and specificity values of selected tumor markers ( = 435).
Tumor marker A (at 20 ng/mL)78.590.20.75 (0.71, 0.79)< 0.001
Tumor marker B (at 35 ng/mL)60.150.30.54 (0.51, 0.57)0.004
Tumor marker C (at 12 ng/mL)55.581.10.45 (0.38, 0.52)0.919
AUC = Area under the curve. Null hypothesis: true area = 0.5.

Additional Concerns

In text, the P -value is written as an italic capital P followed by the value, while as a table header, it should be written as P -value. The authors should write the value instead of reporting the result as “not significant” or “NS” ( 3 ). For example, “the comparison was significant, with P = 0.003”. Three decimal places are preferred in the MJMS for all ranges of P -values. The editors are aware of different guidelines on the number of decimal places of P -values, for example, as given in Cummings and Rivara ( 3 ) and Cole ( 4 ).

Italic formatting of statistical tests and coefficients

Statistical tests that are named after the statistical distributions on which they are based, for example, t -test, F -test, and χ 2 -test, are italicised. In addition, coefficients, for example, r (Pearson’s correlation coefficient), R 2 ( R -squared), and α (Cronbach’s alpha) are also italicised.

Statistical analysis

Computer programs used for statistical analysis should be described, specifically, the name of the program and the version should be given as well as the specific add-on packages if applicable. For example, “IBM SPSS for Windows version 22.0 was...” and “ psych version 1.5.8 and lavaan version 0.5–20 packages were used in the R software environment.” The statistical analysis used should be described in sufficient detail to reproduce the analysis ( 2 ), particularly the name of the analysis, its relation to the aims of the study, and the dependent and independent variables. In addition, Lang and Altman ( 2 ) outlined in greater detail general principles of reporting statistical methods.

Formatting and presentation of numbers

In general, one decimal place is used for percentage values. Use two or more decimal places for percentage values less than 1.0%. For descriptive statistics of numerical data, add one additional decimal place to the original data. For example, if cholesterol level is reported with one decimal place (e.g. 4.8 mmol/L), the mean and SD should be reported with two decimal places (e.g. mean = 4.82, SD = 2.11 mmol/L). Use two decimal places for test statistics values, for example, values of t -statistic, F -statistic, and χ 2 -statistic.

Using a dash “−” in between any two numbers must be avoided as it could be mistaken for a minus or negative sign. For example, authors should write “the age ranges between 20 to 29 years old” instead of “the age ranges between 20 – 29 years old”. In relation to formatting of numbers in tables, the last digits of numbers must be right-aligned. The formatting is demonstrated in Table 1 and ​ and2 2 .

Closing Remarks

This editorial outlines the basics of reporting statistical results in medical journals. This editorial will serve as a guide to authors aiming to publish in the MJMS. Given the availability of the guidelines on reporting statistical results, it is hoped that the authors follow the guidelines to ensure standardisation of the submitted manuscripts. This will shorten the process of reviewing and accepting manuscripts submitted to the MJMS.

  • - Google Chrome

Intended for healthcare professionals

  • My email alerts
  • BMA member login
  • Username * Password * Forgot your log in details? Need to activate BMA Member Log In Log in via OpenAthens Log in via your institution

Home

Search form

  • Advanced search
  • Search responses
  • Search blogs
  • Statistics Notes:...

Statistics Notes: Presentation of numerical data

  • Related content
  • Peer review
  • Douglas G Altman , head a ,
  • J Martin Bland , professor of medical statistics b
  • a IRCF Medical Statistics Group, Centre for Statistics in Medicine, Institute of Health Sciences, PO Box 777, Oxford OX3 7LF
  • b Department of Public Health Sciences, St George's Hospital Medical School, London SW17 0RE
  • Correspondence to: Mr Altman.

The purpose of a scientific paper is to communicate, and within the paper this applies especially to the presentation of data.

Continuous data, such as serum cholesterol concentration or triceps skinfold thickness, can be summarised numerically either in the text or in tables or plotted in a graph. When numbers are given there is the problem of how precisely to specify them. As far as possible the numerical precision used should be consistent throughout a paper and especially within a table. In general, summary statistics such as means should not be given to more than one extra decimal place over the raw data. The same usually applies to measures of variability or uncertainty such as the standard deviation or standard error, though greater precision may be warranted for these quantities as they are often used in further calculations. Similar comments apply to the results of regression analyses, where spurious precision should be avoided. For example, the regression equation 1

birth weight=-3.0983527 + 0.142088xchest circumf + 0.158039 x midarm circumf, purports to predict birth weight to 1/1000000 g.

Categorical data, such as disease group or presence or absence of symptoms, can be summarised as frequencies and percentages. It can be confusing to give percentages alone, as the denominator may be unclear. Also, giving frequencies allows percentages to be given as integers, such as 22%, rather than more precisely. Percentages to one decimal place may sometimes be reasonable, but not in small samples; greater precision is unwarranted. Such data rarely need to be shown graphically.

Test statistics, such as values of t or χ 2 , and correlation coefficients should be given to no more than two decimal places. Confidence intervals are better presented as, say, “12.4 to 52.9” because the format “12.4-52.9” is confusing when one or both numbers are negative. P values should be given to one or two significant figures. P values are always greater than zero. Because computer output is often to a fixed number of decimal places P=0.0000 really means P<0.00005—such values should be converted to P<0.0001. P values always used to be quoted as P<0.05, P<0.01, and so on because results were compared with tabulated values of statistical distributions. Now that most P values are produced by computer they should be given more exactly, even for non-significant results—for example, P=0.2. Values such as P=0.0027 can be rounded up to P=0.003, but not in general to P<0.01 or P<0.05. In particular, the use of P<0.05 (or, even worse, P=NS) may conceal important information: there is minimal difference between P=0.06 and P=0.04. In tables, however, it may be necessary to use symbols to denote degrees of significance; a common system is to use *, **, and *** to mean P<0.05, 0.01, and 0.001 respectively. Mosteller gives a more extensive discussion of numerical presentation. 2

The choice between using a table or figure is not easy, nor is it easy to offer much general guidance. Tables are suitable for displaying information about a large number of variables at once, and graphs are good for showing multiple observations on individuals or groups, but between these cases lie a wide range of situations where the best format is not obvious. One point to consider when contemplating using a figure is the amount of numerical information contained. A figure that displays only two means with their standard errors or confidence intervals is a waste of space as a figure; either more information should be added, such as the raw data (a really useful feature of a figure), or the summary values should be put in the text.

In tables information about different variables or quantities is easier to assimilate if the columns (rather than the rows) contain like information, such as means or standard deviations. Interpretation of tables showing data for individuals (or perhaps for many groups) is aided by having the data ordered by one of the variables—for example, by the baseline value of the measurement of interest or by some important prognostic characteristic.

  • Bhargava SK ,
  • Mohan MAN ,
  • Sachdev HPS
  • Bailar JC ,
  • Mosteller F

too many digits the presentation of numerical data

  • Search Menu
  • Sign in through your institution
  • Acute Care Surgery
  • Breast Surgery
  • Cardiothoracic Surgery
  • Experimental Science
  • General Surgery
  • Hepato-Pancreato-Biliary Surgery
  • Lower Gastrointestinal Surgery
  • Orthopaedics
  • Paediatric Surgery
  • Plastic Surgery
  • Transplantation
  • Upper Gastrointestinal Surgery
  • Vascular Surgery
  • Abstract Supplements
  • Scientific Surgery
  • Author Videos
  • Digital Collections
  • Highly Cited Collection
  • Author Guidelines
  • Submission Site
  • Open Access Options
  • Self-Archiving Policy
  • Why Publish
  • Diversity, Equity, Inclusion, and Accessibility
  • Editorial Board
  • Advertising & Corporate Services
  • Strategic Partners
  • Journals on Oxford Academic
  • Books on Oxford Academic

Issue Cover

Article Contents

  • < Previous

The art of reporting numerical data

ORCID logo

  • Article contents
  • Figures & tables
  • Supplementary Data

Jonathan A Cook, Dongquan Bi, Jonas Ranstam, The art of reporting numerical data, British Journal of Surgery , Volume 109, Issue 6, 16 May 2022, Pages 548–549, https://doi.org/10.1093/bjs/znac028

  • Permissions Icon Permissions

It is unfortunate that in English and a number of other languages, we use the same term ‘statistics’ to refer to both numerical data (e.g. national statistics) and also the science of collecting and analysing data (e.g. a degree in statistics). The former can be very mundane and routine, whereas the latter, which utilizes the former to understand variability and carry out inference, is enlightening and often sobering. The reporting of numerical data should be informed by statistical principles (the science of statistics). One area where this can be counterintuitive is the level of numerical precision to report data (both of individual values and summaries like means and standard deviations). A review of three recent BJS articles identified over 1000 statistics across the articles, which either summarized data from the respective study or reported an output from an analysis of study data. Often in manuscripts submitted to BJS , values are reported to an excessive level of numerical precision.

Simplicity, neatness of presentation, a desire to report fully numerical calculations, or a false understanding of the value of the data collected, can all lead to reporting data to an excessive level of precision. For example, it is tempting but somewhat misleading to report 13 out of 39 as (33.33 per cent). The use of two decimal points (four significant figures) gives the false impression of greater precision than really occurred. In this example, each observation accounted for over 3 per cent of the final percentage. There is a quirk of numerical data that percentages will often not add up to exactly 100 per cent where there are three or more categories. However, as long as the number of observations within each category is accurately reported with the percentage in the group, this apparent discrepancy is but a small price to pay for greater clarity and a more honest representation of the data. A simple rule of thumb is that if there are less than 100 observations in a sample, reporting percentages to fractions of a per cent is not helpful. Arguably even for larger samples, it is rarely necessary except perhaps for reporting the extremes (e.g. 0.1 versus 0 per cent). Many people will find tables filled with redundant decimal points more difficult to read and understand, and find themselves lost in a sea of unnecessary figures. The number of people who are able to differentiate between tenths of a per cent, let alone 100 of a per cent, are, in our view rather miniscule. Equivalent arguments can be made for reporting proportions where two decimal points should be the standard level of precision.

Continuous data are perhaps the hardest to fairly report; to do so requires an understanding of the quantity measured, how well it can be measured, how it is or might be used, and how well the original data were recorded. While an operation time reported as 134.682341 min can be occur and could be recorded as such, it is of no greater value (and certainly of lesser clarity) to do so than recording it as simply 135 min. You would be a brave person to believe an operation time is accurately recorded to this level of precision (microseconds) in practice. For the poor reader of papers, this is even more so the case when it is recognized that operating practices and the recording of the operation time tend to vary greatly between surgeons, surgical teams, and institutions. My 135 min could easily be your 140 min. Reporting mean operation times beyond minutes is of little clinical value irrespective of how large the study sample size is.

Another area where excessive decimal points are commonly used is in reporting P -values as if more zeros create stronger evidence. This probably reflects both some misunderstanding of what a P -value is, which is an indirect measure of evidence against a null hypothesis, and not a measure of the strength, nor the magnitude 1 of statistical disagreement (only of the unlikeliness of compatibility assuming the null hypothesis is true). If the conventional statistical approach is being used with a prior statistical significance level specified (say, the typical two-sided 5 per cent level), then reporting it to a sufficient level to see how the P -value relates to that marker of statistical significance is what is important. As a consequence, reporting a P -value as ‘ P < 0.05’ or ‘n.s.’ does not provide sufficient detail as it is unclear how close or not the P -value was to the cut-off point. However, once the value is some distance from the cut-off marker, further precision is of little consequence. P -values above 0.1 can happily be reported to one decimal point (e.g. 0.2), and if 0.05 is the significance level of interest, then the differences between P -values <0.001 are of little interest. Where large numbers of hypotheses are tested (e.g. analyses of genetic data), the significance level of interest should be reduced 2 , but the same principle can be applied to the corresponding lower significance level. Here, as with reporting other outputs of statistical tests and models such as a mean difference or an odds ratio with confidence intervals, the level of precision that is appropriate should be determined by the question we are seeking to answer and how the findings might be applied, as well as the level of precision the data can bear.

Helpful guidance on reporting numbers is available elsewhere 3 for a range of statistical metrics beyond those covered here. It is difficult to be too prescriptive as surgery and science are wonderfully complex. Nevertheless, it would benefit our research and the readers of it if we were more circumspect in reporting our data and a bit more humble in our data presentation.

Disclosure . The authors declare no conflict of interest.

Cook   JA , Ranstam   J . Statistical methods that provide an effect size are to be preferred . Br J Surg   2016 ; 103 : 1365

Google Scholar

Cook   JA , Ranstam   J . Spurious findings . Br J Surg   2017 ; 104 : 97

Cole   TJ . Too many digits: the presentation of numerical data . Arch Dis Child   2015 ; 100 : 608 – 609

Month: Total Views:
March 2022 48
April 2022 8
May 2022 76
June 2022 65
July 2022 38
August 2022 23
September 2022 27
October 2022 22
November 2022 22
December 2022 31
January 2023 13
February 2023 11
March 2023 16
April 2023 12
May 2023 12
June 2023 4
July 2023 14
August 2023 13
September 2023 8
October 2023 7
November 2023 13
December 2023 7
January 2024 9
February 2024 5
March 2024 11
April 2024 9
May 2024 12
June 2024 8
July 2024 8
August 2024 8
September 2024 4

Email alerts

Citing articles via.

  • Recommend to Your Librarian
  • Advertising & Corporate Services
  • Journals Career Network

Affiliations

  • Online ISSN 1365-2168
  • Copyright © 2024 BJS Foundation Ltd.
  • About Oxford Academic
  • Publish journals with us
  • University press partners
  • What we publish
  • New features  
  • Open access
  • Institutional account management
  • Rights and permissions
  • Get help with access
  • Accessibility
  • Advertising
  • Media enquiries
  • Oxford University Press
  • Oxford Languages
  • University of Oxford

Oxford University Press is a department of the University of Oxford. It furthers the University's objective of excellence in research, scholarship, and education by publishing worldwide

  • Copyright © 2024 Oxford University Press
  • Cookie settings
  • Cookie policy
  • Privacy policy
  • Legal notice

This Feature Is Available To Subscribers Only

Sign In or Create an Account

This PDF is available to Subscribers Only

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • My Bibliography
  • Collections
  • Citation manager

Save citation to file

Email citation, add to collections.

  • Create a new collection
  • Add to an existing collection

Add to My Bibliography

Your saved search, create a file for external citation management software, your rss feed.

  • Search in PubMed
  • Search in NLM Catalog
  • Add to Search

Presentation of numerical data

Affiliation.

  • 1 IRCF Medical Statistics Group, Centre for Statistics in Medicine, Institute of Health Sciences, Oxford.
  • PMID: 8595293
  • PMCID: PMC2350327
  • DOI: 10.1136/bmj.312.7030.572

PubMed Disclaimer

Similar articles

  • [Presentation of statistical data in nursing. 8--Scatter plots (I): correlations become clear]. Ostermann R, Wilhelm AF, Wolf-Ostermann K. Ostermann R, et al. Pflege Z. 2004 Aug;57(8):568-70. Pflege Z. 2004. PMID: 15446341 German. No abstract available.
  • [Presentation of statistical data in nursing 12: with reflection and estimation]. Ostermann R, Wilhelm AF, Wolf-Ostermann K. Ostermann R, et al. Pflege Z. 2004 Dec;57(12):862-4. Pflege Z. 2004. PMID: 15646111 German. No abstract available.
  • Statistical tools in published articles of a public health journal in 2013 and 2014: bibliometric cross-sectional study. Arcila Quiceno V, García Restrepo E, Gómez Rúa N, Montenegro Martínez G, Silva Ayçaguer LC. Arcila Quiceno V, et al. Medwave. 2015 Aug 31;15(7):e6238. doi: 10.5867/medwave.2015.07.6238. Medwave. 2015. PMID: 26460577 English, Spanish.
  • Statistics from the inside. 6. Data structures (continued). Healy MJ. Healy MJ. Arch Dis Child. 1992 Jun;67(6):757-9. doi: 10.1136/adc.67.6.757. Arch Dis Child. 1992. PMID: 1626999 Free PMC article. Review. No abstract available.
  • Statistics: how many? Scott M, Flaherty D, Currall J. Scott M, et al. J Small Anim Pract. 2012 Jul;53(7):372-6. doi: 10.1111/j.1748-5827.2012.01231.x. J Small Anim Pract. 2012. PMID: 22747729 Review.
  • Analysis of proportions using arcsine transform with any experimental design. Laurencelle L, Cousineau D. Laurencelle L, et al. Front Psychol. 2023 Jan 30;13:1045436. doi: 10.3389/fpsyg.2022.1045436. eCollection 2022. Front Psychol. 2023. PMID: 36793367 Free PMC article.
  • Impact of the COVID-19 pandemic on prescription refills for immune-mediated inflammatory disorders: a time series analysis (January 2019 to January 2021) using the English Prescribing Dataset. Barrett R, Barrett R, Lin SX, Culliford D, Fraser S, Edwards CJ. Barrett R, et al. BMJ Open. 2022 Dec 23;12(12):e051936. doi: 10.1136/bmjopen-2021-051936. BMJ Open. 2022. PMID: 36564115 Free PMC article.
  • A CHecklist for statistical Assessment of Medical Papers (the CHAMP statement): explanation and elaboration. Mansournia MA, Collins GS, Nielsen RO, Nazemipour M, Jewell NP, Altman DG, Campbell MJ. Mansournia MA, et al. Br J Sports Med. 2021 Sep;55(18):1009-1017. doi: 10.1136/bjsports-2020-103652. Epub 2021 Jan 29. Br J Sports Med. 2021. PMID: 33514558 Free PMC article.
  • Imaging Software Programs for Reliable Mathematical Measurements in Orthodontics. Radwan ES, Scribante A, Sfondrini MF, Montasser MA. Radwan ES, et al. Dent J (Basel). 2020 Aug 3;8(3):81. doi: 10.3390/dj8030081. Dent J (Basel). 2020. PMID: 32756303 Free PMC article.
  • Development of a Greek Oral health literacy measurement instrument: GROHL. Taoufik K, Divaris K, Kavvadia K, Koletsi-Kounari H, Polychronopoulou A. Taoufik K, et al. BMC Oral Health. 2020 Jan 15;20(1):14. doi: 10.1186/s12903-020-1000-5. BMC Oral Health. 2020. PMID: 31941482 Free PMC article.
  • Br Med J (Clin Res Ed). 1985 Dec 7;291(6509):1617-9 - PubMed
  • Search in MeSH

Related information

  • Cited in Books

LinkOut - more resources

Full text sources.

  • Europe PubMed Central
  • Ovid Technologies, Inc.
  • PubMed Central
  • MedlinePlus Health Information
  • Citation Manager

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.

Stack Exchange Network

Stack Exchange network consists of 183 Q&A communities including Stack Overflow , the largest, most trusted online community for developers to learn, share their knowledge, and build their careers.

Q&A for work

Connect and share knowledge within a single location that is structured and easy to search.

Should percentages be reported with decimal places?

When presenting data using a percentage, is it a good thing to have decimal places, say 2 decimal places instead of rounding off to whole numbers?

For example, instead of 23.43%, you round off to 23%.

I am looking at this from the perspective of whether the 2 decimal places accuracy will make much difference since we are dealing with percentage and not raw data value.

Silverfish's user avatar

  • 1 $\begingroup$ There are many fields where small percents are so small that people use parts per million, per billion and so forth. Either that's a different question -- because people do or should know that citing numbers like 0.000001 or even 0.0001% is in that circumstance silly and one should use different units -- or it's another answer to this question. When some or all of the numbers of interest are very small, large numbers of decimal places may be essential as well as informative. $\endgroup$ –  Nick Cox Commented Nov 8, 2018 at 17:47
  • $\begingroup$ This is a special case of the issues discussed at stats.stackexchange.com/questions/8734 . Note that this question concerns precision: accuracy is a different matter altogether. See gis.stackexchange.com/a/8674/664 for the distinction. $\endgroup$ –  whuber ♦ Commented Nov 8, 2018 at 18:10

4 Answers 4

It depends on the size of the differences between classes. In most applications, saying the 73% prefer option A and 27% prefer option B is perfectly acceptable. But if you're dealing in an election where candidate X has 50.15% of votes and candidate Y has 49.86%, the decimal places are very much necessary.

Of course, you need to take care to make sure that all classes add up to 100%. In my electoral example above, they add up to 100.01%. In that case you might even consider adding a third decimal place.

Carlos Accioly's user avatar

  • 1 $\begingroup$ I'd also say it depends on your data and your goals. If I were briefing an executive on the sources of customer complaints, no way I'd be talking about fractions of a percent. You also have to consider the margins of error: saying 50.15% of voters prefer candidate X +/- 5% is suspicious. $\endgroup$ –  Wayne Commented Mar 6, 2014 at 15:22
  • 2 $\begingroup$ If there are just two percents, rounding and adding to 100% are completely compatible. See e.g. statweb.stanford.edu/~cgates/PERSI/papers/freedman79.pdf $\endgroup$ –  Nick Cox Commented Mar 6, 2014 at 15:31
  • $\begingroup$ @NickCox That's a nice find! In case that link goes dead, for future readers the reference is "On Rounding Percentages", Persi Diaconis; David Freedman, Journal of the American Statistical Association, Vol. 74, No. 366. (Jun., 1979), pp. 359-364. Despite the rather broad title, it deals with the probability a table of rounded percentages "correctly" sum to 100. The probability declines to around 3/4 with 3 categories, around 2/3 with 4 categories, and $\sqrt{6/\pi c}$ with $c$ categories $\endgroup$ –  Silverfish Commented Mar 30, 2018 at 19:23
  • 1 $\begingroup$ I actually don't think it is necessary to ensure that percentages sum exactly to 100, and wouldn't introduce extra (possibly spurious or distracting) decimal places just to ensure it happens. A little footnote that "Percentages may not sum to 100 due to rounding" should suffice. In fact adding decimal places will not always help, eg the simplest case of having three categories with equal frequencies, then neither 33%+33%+33% nor 33.33%+33.33%+33.33% quite solve the "sum to 100" problem. $\endgroup$ –  Silverfish Commented Mar 30, 2018 at 19:34

Different organisations often have conflicting rules for the precision in reporting of results. Ultimately there is a trade-off between when seeing the extra digits is useful, versus cases where unnecessary and excessive precision "can swamp the reader, overcomplicate the story and obscure the message" — a subject explored by Tim Cole (2015) in a piece that I found gave a useful guide to "sensible" precision in reporting, and a comparison of leading style manuals. His advice on percentages was as follows:

Integers, or one decimal place for values under 10%. Values over 90% may need one decimal place if their complement is informative. Use two or more decimal places only if the range of values is less than 0.1% Examples: 0.1%, 5.3%, 27%, 89%, 99.6%

By "complement" he is referring to cases where one might be interested in the "other lot", e.g. if I tell you 98% of patients in a trial got better, you may well be interested in the 2% who did not, and in that case another decimal place to distinguish whether that "2%" really means "2.4% or "1.6%" would actually be useful.

Cole, T. J. (2015). Too many digits: the presentation of numerical data. Archives of disease in childhood , 100 (7), 608-609. http://dx.doi.org/10.1136/archdischild-2014-307149

Community's user avatar

This is a significant figures issue, and is dependent upon the precision of the numbers underlying the percentages. The technically correct number of significant figures is not dependent upon downstream use or the differences between percentage values.

If you're trying to express a percentage describing 5 items out of 7, it would be absurd to claim that it's 71.4285714285% - you simply don't have the precision to back up all those decimal places. When doing division, your answer should have as many significant figures and the fewest number of sig figs in your starting numbers. Here, you only have 1 significant figure, so the percentage should really just be 70%, not even 71%. If you had another example where you want to express 71428 items out of 100000, then you are justified in using more significant figures, all the way out to 71.428%.

Even if you have great precision, it's often preferable to truncate for human readability. Depending on your domain, adding those two extra decimal places may or may not make a difference. You should never over-report significant figures, but you may be justified in under-reporting them if your statistical precision is greater than what's needed for your application.

Nuclear Hoagie's user avatar

  • 4 $\begingroup$ Well worth bringing up this issue, but the examples in your second paragraph are rather opaque. I have seven cousins; five are male: in what sense do I lack a precise enough observation to justify reporting this as a percentage to any number of decimal places I please? $\endgroup$ –  Scortchi - Reinstate Monica ♦ Commented Mar 30, 2018 at 22:27
  • 2 $\begingroup$ I have to agree with @Scortchi. If you know the value of an integer (e.g. number of electrons orbiting an atom, etc) you know it essentially to infinite precision points. We know the proportion of Scortchi's cousins that are male to infinite precision points. What's pragmatic to report is a different issue. (In this case, I think simply saying "5/7" is the best thing.) $\endgroup$ –  Bridgeburners Commented Nov 8, 2018 at 17:52

The goal is to make it easy for the reader to understand the important differences. Too many digits obscures the meaningful difference between values in a table. Too few leaves out important information. Here's a great discussion: https://newmr.org/blog/how-many-significant-digits-should-you-display-in-your-presentation/ and here's a much more detailed analysis: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4483789/

Franklin Davis's user avatar

Your Answer

Sign up or log in, post as a guest.

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy .

Not the answer you're looking for? Browse other questions tagged reporting percentage or ask your own question .

  • Featured on Meta
  • Announcing a change to the data-dump process
  • Bringing clarity to status tag usage on meta sites

Hot Network Questions

  • What was the first "Star Trek" style teleporter in SF?
  • Is this host and 'parasite' interaction feasible?
  • RC4 decryption with WEP
  • Textile Innovations of Pachyderms: Clothing Type
  • Simulate Minecraft Redstone | Where to start?
  • How to go from Asia to America by ferry
  • Could a lawyer agree not to take any further cases against a company?
  • Best approach to make lasagna fill pan
  • Instance a different sets of geometry on different parts of mesh by index
  • VMware workstation kills Ubuntu host - rcu_preempt
  • How can I play MechWarrior 2?
  • How to raise and lower indices as a physicist would handle it?
  • A seven letter *
  • In which town of Europe (Germany ?) were this 2 photos taken during WWII?
  • What would be a good weapon to use with size changing spell
  • Why is this bolt's thread the way it is?
  • Sub-/superscript size difference between newtxmath and txfonts
  • Does a party have to wait 1d4 hours to start a Short Rest if no healing is available and an ally is only stabilized?
  • What qualifies as a Cantor diagonal argument?
  • Is reading sheet music difficult?
  • Is the 2024 Ukrainian invasion of the Kursk region the first time since WW2 Russia was invaded?
  • I'm a little embarrassed by the research of one of my recommenders
  • Is there a way to read lawyers arguments in various trials?
  • Manhattan distance

too many digits the presentation of numerical data

Log in using your username and password

  • Search More Search for this keyword Advanced search
  • Latest content
  • Current issue
  • For authors
  • BMJ Journals

You are here

  • Volume 100, Issue 7
  • Highlights from this issue
  • Article Text
  • Article info
  • Citation Tools
  • Rapid Responses
  • Article metrics

Download PDF

  • R Mark Beattie , Editor in Chief

https://doi.org/10.1136/archdischild-2015-309113

Statistics from Altmetric.com

Request permissions.

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Too many digits—the presentation of numerical data

We have all been frustrated reading numbers to too many decimal places, the simplest being digital scales in the outpatient clinic where measurements are probably not accurate to more than 10g although the implication of the weight recorded is that the accuracy is much greater. In an excellent leading article this month Tim Cole takes us back to first principles to discuss this and provide sensible, pragmatic guidelines for the presentation of numerical data. It is interesting and helpful to work through. Remember the difference between decimal places and significant figures. The number of significant figures (digits) is the number of all digits ignoring the decimal point, and ignoring all leading and some trailing zeros. Data should be rounded appropriately—not too much, not too little. Clearly, for example 22.68 (95% confidence interval 7.51–73.67) is more effectively and meaningfully written as 23 (95% confidence interval 7.5–74). The various reporting tools are discussed. Significant figures should be considered rather than just decimal places. The general principle is to use two or three significant digits for effect sizes, and one or two significant digits for measures of variability. There is a helpful summary table included with recommendations given for different scenarios. See page 608 .

Prevalence of severe childhood obesity in England

Bed sharing and sudden infant death.

Bed sharing increases the risk of sudden infant death in infants less than 3 months. The effect is most profound in infants less than 1 month (5 fold increase in risk of SIDS in infants less than 1 month). The mechanism is not clearly defined. Heyman and colleagues review the accidental deaths during sleep (as a cause of sudden infant death in infancy); New Zealand 48 cases, 2002–2009, 0.1 per 1000 live births. Deaths were due to overlay (n=30), or wedging (n=18), with 34 (71%) in a bed sharing situation. Of the overlay group 8 were by a mother while breast feeding, 4 by a sibling and 17 by a parent. In the wedging group 10 were between a sleeping surface and wall or broken cot, 6 between a cushion and couch and 2 between a sleep surface and bedding. The authors conclude these are potentially preventable deaths particularly if bed sharing is avoided, faulty or if inadequately constructed cots are avoided and extra attention is paid to the safety of sleep arrangements particularly if adhoc/temporary. In an accompanying editorial Volpe and colleagues discuss—Infant sleep related deaths: why do parents take risks. The editorial is provocative discussing these issues in the context of other factors, recent guidance from NICE and the need to inform parents about the risks and benefits in order to help them make the best decision for them and their child. See pages 610 and 603 .

Should we discourage daytime napping

Duration and quality of sleep affect child development and health with early childhood being a time in which sleep consolidates into the night and napping ceases. Many factors influence sleep patterns and childhood sleep patterns have the potential to disrupt family functioning and child well being. Thorpe and colleagues report a systematic review of the evidence regarding the effects of napping on child development and health. 26 articles were included—heterogeneous quality, observational study designs. Most of the findings were inconsistent—cognition, behaviour, health impact—probably because of variability in ages and habitual napping status. The most consistent finding was an association between napping and later onset, shorter duration and poorer quality night sleep with evidence strongest in children greater than 2 years. The authors highlight the absolute need for more data before specific advice is given. Lucy Wiggs discusses the findings and their wider implications in an accompanying editorial. It is interesting to reflect on what is normal—how should a nap be identified (quantity, quality, timing), heterogeneity of the individual, influence of the family and environment, and multiple potential outcome measures of impact and therefore difficulty in studying. Certainly napping in young children is universal and the question posed in the title of the editorial—Daytime napping in preschool aged children; is it to be encouraged—is appropriate. Ensuring children receive sufficient amounts of good quality sleep, according to their individual needs, remains the priority. See pages 615 and 604 .

Why do we treat children of Jehovah's witnesses differently from their adult parents

This is a significant, emotive and difficult issue particularly when the clinician is faced with a patient who needs a blood transfusion but refuses it for religious or other reasons. In a thought provoking leading article Robert Wheeler explores these issues, using case law to illustrate and very much highlighting the different issues in children compared to adult and as such is very relevant to us as paediatricians. The decision of a competent adult to refuse blood is legally binding on doctors. This is not the case in a child or young person under age 18 years when the law will no longer defer to a parent's wishes or religious beliefs if such deference will mean that the child is not treated in accordance with his best interests. This clearly needs to be managed carefully and with consideration of alternative options and after social care and legal advice. The issues and some of the practicalities are complex, even more so during adolescence and the article of relevance to how we manage these difficult situations when blood transfusion or other life saving treatment are needed and for complex reasons consent not forthcoming. See page 606 .

Linked Articles

  • Leading article Why do we treat the children of Jehovah's Witnesses differently from their adult parents? Robert Wheeler Archives of Disease in Childhood 2015; 100 606-607 Published Online First: 04 Mar 2015. doi: 10.1136/archdischild-2014-307354
  • Editorial Infant sleep-related deaths: why do parents take risks? Lane E Volpe Helen L Ball Archives of Disease in Childhood 2015; 100 603-604 Published Online First: 31 Mar 2015. doi: 10.1136/archdischild-2014-307745
  • Original article Infant suffocation in place of sleep: New Zealand national data 2002–2009 Rebecca M Hayman Gabrielle McDonald Nick J de C Baker Edwin A Mitchell Stuart R Dalziel Archives of Disease in Childhood 2014; 100 610-614 Published Online First: 25 Nov 2014. doi: 10.1136/archdischild-2014-306961
  • Original article Prevalence of severe childhood obesity in England: 2006–2013 Louisa J Ells Caroline Hancock Vicky R Copley Emma Mead Hywell Dinsdale Sanjay Kinra Russell M Viner Harry Rutter Archives of Disease in Childhood 2015; 100 631-636 Published Online First: 27 Jan 2015. doi: 10.1136/archdischild-2014-307036
  • Original article Napping, development and health from 0 to 5 years: a systematic review Karen Thorpe Sally Staton Emily Sawyer Cassandra Pattinson Catherine Haden Simon Smith Archives of Disease in Childhood 2015; 100 615-622 Published Online First: 17 Feb 2015. doi: 10.1136/archdischild-2014-307241
  • Leading article Too many digits: the presentation of numerical data T J Cole Archives of Disease in Childhood 2015; 100 608-609 Published Online First: 15 Apr 2015. doi: 10.1136/archdischild-2014-307149
  • Editorial Daytime napping in preschool-aged children; is it to be encouraged? Luci Wiggs Archives of Disease in Childhood 2015; 100 604-605 Published Online First: 15 Apr 2015. doi: 10.1136/archdischild-2014-307614

Read the full text or download the PDF:

Too many digits: the presentation of numerical data (Q28647834)

Language Label Description Also known as
English

Identifiers

Wikipedia (0 entries), wikibooks (0 entries), wikinews (0 entries), wikiquote (0 entries), wikisource (0 entries), wikiversity (0 entries), wikivoyage (0 entries), wiktionary (0 entries), multilingual sites (0 entries).

too many digits the presentation of numerical data

Navigation menu

Summarizing Numerical Data

Seeing the forest for the trees.

Once you have your data in front of you, you’ve seen how we can form visual summaries with ggplot2 . But how can we calculate numerical summaries? Furthermore, what if we are concerned about summarizing a portion of our data, like just one species of penguin at a time? We will answer these questions below, and introduce some new functions from the dplyr package (within the tidyverse library) along the way. We’ll also look at how factor() can come in handy while plotting.

If you are playing along in RStudio while reading these notes (which we strongly recommend!), be sure to start off by loading the two packages that are necessary for the tutorial by running the following code.

Calculating Numerical Summaries

One example of a numerical variable we could have examine is the body mass of a particular penguin (measured in grams). Let’s calculate both a measure of center and spread for this variable.

To get an idea of what summaries we should pick, let’s revisit the density plot from earlier.

too many digits the presentation of numerical data

What we can glean from this figure is that the distribution of body masses across all species of penguin is skewed right. This means that, for instance, a more typical observation lies closer to 4000 grams than 5000 grams.

If we take an average, it is likely to be pulled to the right by the larger, but less typical, observations. The median observation, however, would be more resistant to this pull. Therefore, the median might be a nice choice for a measure of center. Similarly, since the IQR is initially constructed from the median, it will serve well here as a measure of spread.

Now, let’s calculate these values. We should first isolate our variable of interest. We can do this in code by using the dplyr function select() .

As is custom with dplyr functions, the first argument goes to the data frame you are working with. The following arguments are more function specific. In select() ’s case, we tell the computer which column/variable we are interested in.

Now, we can calculate our summaries. When working with a vector, we could use functions like mean() and median() directly, e.g.  median(body_mass_g) . However, body_mass_g is not a standalone vector but is now a column in a data frame called body_mass ! Therefore, we need to access it through a dplyr function called summarise() .

Note that while the first argument goes to the name of the data frame, the following arguments are given to the names of the new columns that summarise() puts in another new data frame (one row by two columns). You can name the columns whatever you would like.

Based on what we’ve found, the median here supports the claim we made above: that a typical penguin has a body mass closer to 4000 grams than to 5000 grams. The middle 50 percent of the penguins have body masses within 1225/2 grams, or roughly 600 grams, of 4050.

Groupwise Operations

Let’s return to the bill length examine of a particular penguin, measured in millimeters. Here is the density plot for all of the data ; for simplicity, earlier we showed you the plot for only the first 16 observations.

too many digits the presentation of numerical data

This plot is interesting. It appears we have a bimodal shape! While it’s tempting to state that the data is roughly symmetric and calculate an overall mean, we should first see if there are any other variables at play. It stands to reason that different species of penguin might have different anatomical features. Let’s add species to the mix by using the color aesthetic (see if you can code along)!

too many digits the presentation of numerical data

Aha! We now see that each penguin species has its own shape of distribution when it comes to bill length.

The example above demonstrates a very common scenario: you want to perform some calculations on one particular group of observations in your data set. But what if you want to do that same calculation for every group? For example, what if we’d like to find the average and standard deviation of bill length among each species of penguin separately ?

This task - performing an operation on all groups of a data set one-by-one - is such a common data science task that nearly every software tool has a good solution. In the dplyr package, the solution is the group_by() function. Let’s see it in action.

Like most tidyverse functions, the first argument to group_by() is a data frame. The second argument is the name of the variable that you want to use to delineate groups. In this case, we want to group by species to calculate three separate mean/standard deviation pairs.

Now, assuming we roll with our new grouped_penguins data frame, we can use summarise() like we did before!

From both the visuals and the numbers, we can see that Adelie penguins have much smaller bill lengths on average when compared to Chinstrap and Gentoo penguins. We also see that the Adelie distribution of bill lengths is less variable than the distributions of the other two species.

Plotting with Categorical Variables

Finally, let’s return to the violin plot of bill lengths grouped by species of penguin.

too many digits the presentation of numerical data

What if I wanted the Adelie violin to show up on the top of the graph? By default, the violin plot puts the level first in the alphabetical order on the bottom of the plot. Therefore, I need to reorder the levels of species to put Adelie at the top. This is where factor() will do the job!

As before, bill_length_mm is not a standalone vector but a column in a data frame! We cannot access it directly, e.g. by factor(species, levels = c("Gentoo", "Chinstrap", "Adelie")) .

Therefore, we use the dplyr function mutate() . A mutation involves changing the properties of an existing column, or adding a new one altogether (which we will explore next week).

The first argument of mutate() is dedicated to our data frame, penguins . The second argument can be the name of an existing column or the name of a new column (next week). We want to change species to be an altered version of itself, hence we name the second argument species . Make sure you understand where each set of parentheses closes and ends.

Now, assuming we roll with our new reordered_penguins data frame, we can use ggplot() like we did before!

too many digits the presentation of numerical data

A summary of a summaries…this better be brief! Summaries of numerical data - graphical and numerical - often involve choices of what information to include and what information to omit. These choices involve a degree of judgement and knowledge of the criteria that were used to construct the commonly used statistics and graphics.

Europe PMC requires Javascript to function effectively.

Either your web browser doesn't support Javascript or it is currently turned off. In the latter case, please turn on Javascript support in your web browser and reload this page.

IMAGES

  1. (PDF) Too many digits: The presentation of numerical data

    too many digits the presentation of numerical data

  2. Mathematical Numbers on Black Abstract Data Science Algorithm for

    too many digits the presentation of numerical data

  3. Decimals : Definition, Facts and Examples

    too many digits the presentation of numerical data

  4. PPT

    too many digits the presentation of numerical data

  5. Random Numbers on Black Algorithmic Sequence of Numerical Digits

    too many digits the presentation of numerical data

  6. Presenting Numerical Data

    too many digits the presentation of numerical data

VIDEO

  1. 0.7 ; 2.4/Representing Decimal Numbers on the number line

  2. Significant Digits#Numerical Methods@Maths N Stats

  3. barcode with too many digits issue

  4. How many digits in 2^100

  5. Data Analysis: Numerical Representation

  6. Statistics

COMMENTS

  1. Too many digits: the presentation of numerical data

    Use the same rule as for the corresponding effect size (be it mean, percentage, mean difference, regression coefficient, correlation coefficient or risk ratio), perhaps with one less significant digit. Test statistics: t, F, χ 2, etc. Up to one decimal place and up to two significant digits. t=−1.3. F=11.

  2. PDF Too many digits: the presentation of numerical data

    One or two decimal places, or more when very close to ±1. 0.03. 0.7. − 0.89. Risk ratio. Round to two significant digits if the leading non-zero digit is four or more, otherwise round to three (the rule of four11). Alternatively use one/two significant digits rather than two/three.

  3. Too many digits: the presentation of numerical data

    Too many digits: the presentation of numerical data. Arch Dis Child. 2015 Jul;100 (7):608-9. doi: 10.1136/archdischild-2014-307149. Epub 2015 Apr 15.

  4. Too many digits: The presentation of numerical data

    As a s tatis tical revie wer for Archives and. BMJ I am interested in the presentation of. numerical data. It concerns me that. numbers are often reported to ex cessive. precision, because too ...

  5. Too many digits: the presentation of numerical

    As a statistical reviewer for Archives and BMJ I am interested in the presentation of numerical data. 1 It concerns me that numbers are often reported to excessive precision, because too many digits can swamp the reader, overcomplicate the story and obscure the message.

  6. Too many digits: the presentation of numerical data

    Emperor Joseph II : Well, there it is. Quotation from the film Amadeus (1984) As a statistical reviewer for Archives and BMJ I am interested in the presentation of numerical data.1 It concerns me that numbers are often reported to excessive precision, because too many digits can swamp the reader, overcomplicate the story and obscure the message.

  7. PDF Atoms Highlights from this issue

    TOO MANY DIGITS—THE PRESENTATION OF NUMERICAL DATA We have all been frustrated reading numbers to too many decimal places, the simplest being digital scales in the out-patient clinic where measurements are probably not accurate to more than 10g although the implication of the weight recorded is that the accuracy is much greater.

  8. Too many digits? The presentation of numerical data

    The presentation of numerical data - UCL Discovery. Too many digits? The presentation of numerical data. Cole, TJ; (2015) Too many digits? The presentation of numerical data. Archives of Disease in Childhood , 100 (7) pp. 608-609. 10.1136/archdischild-2014-307149 . Preview. Text. Arch Dis Child-2015-Cole-608-9.pdf.

  9. Too many digits: the presentation of numerical data

    BMJ I am interested in the presentation of. numerical data.1 It concerns me that. numbers are often reported to excessive. precision, because too many digits can. swamp the reader, overcomplicate the. story and obscure the message. A number's precision relates to its. decimal places or significant figures (or as.

  10. Too many digits: the presentation of numerical data

    Too many digits: the presentation of numerical data. Reporting guideline provided for? ... Full bibliographic reference: Cole TJ. Too many digits: the presentation of numerical data. Arch Dis Child. 2015;100(7):608-609. Language: English: PubMed ID: 25877157: Relevant URLs (full-text if available)

  11. Too many digits: the presentation of numerical data

    As a statistical reviewer for Archives and BMJ I am interested in the presentation of numerical data. 1 It concerns me that numbers are often reported to excessive precision, because too many digits can swamp the reader, overcomplicate the story and obscure the message. A number's precision relates to its decimal places or significant figures ...

  12. Presentation of numerical data.

    This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our privacy notice and cookie policy. Abstract ... Similar Articles Presentation of numerical data. ...

  13. Too Many Digits: The Presentation of Numerical Data: Tjcole

    Units - Free download as PDF File (.pdf), Text File (.txt) or read online for free. 1) The presentation of numerical data in publications is often overly precise, reporting numbers with too many decimal places or significant digits, which can obscure the key message. 2) There is no single rule for rounding that works in all cases - guidelines variously specify rounding to a certain number of ...

  14. Reporting Statistical Results in Medical Journals

    For descriptive statistics of numerical data, add one additional decimal place to the original data. For example, if cholesterol level is reported with one decimal place (e.g. 4.8 mmol/L), the mean and SD should be reported with two decimal places (e.g. mean = 4.82, SD = 2.11 mmol/L). ... Too many digits: the presentation of numerical data ...

  15. Setting number of decimal places for reporting risk ratios: rule of

    Precision and rounding—decimal places and significant digits. Reporting of numerical data is an important element in medical research. Summary statistics are often reported to too many decimal places, leading to spurious precision and over-complicated presentation1; less often, too few decimal places are used, resulting in a lack of precision.. Surprisingly, few guidelines on the subject

  16. Statistics Notes: Presentation of numerical data

    For example, the regression equation 1. birth weight=-3.0983527 + 0.142088xchest circumf + 0.158039 x midarm circumf, purports to predict birth weight to 1/1000000 g. Categorical data, such as disease group or presence or absence of symptoms, can be summarised as frequencies and percentages. It can be confusing to give percentages alone, as the ...

  17. The art of reporting numerical data

    The reporting of numerical data should be informed by statistical principles (the science of statistics). One area where this can be counterintuitive is the level of numerical precision to report data (both of individual values and summaries like means and standard deviations). A review of three recent BJS articles identified over 1000 ...

  18. Presentation of numerical data

    Presentation of numerical data. Presentation of numerical data BMJ. 1996 Mar 2;312(7030):572. doi: 10.1136/bmj.312.7030.572. Authors D G Altman 1 , J M Bland. Affiliation 1 IRCF Medical Statistics Group, Centre for Statistics in Medicine, Institute of Health Sciences, Oxford. PMID: 8595293 PMCID: ...

  19. Should percentages be reported with decimal places?

    When presenting data using a percentage, is it a good thing to have decimal places, say 2 decimal places instead of rounding off to whole numbers? ... Too many digits: the presentation of numerical data. Archives of disease in childhood, 100(7), 608-609. ... Too many digits obscures the meaningful difference between values in a table. Too few ...

  20. How many decimals? Rounding descriptive and inferential statistics

    Clearly, the second statistic is too precise to be realistic. Altman and Bland (1996) make. Intrinsic measures of precision: Rounding descriptive statistics from the precision of raw measurements. The following requires the notion of significant digits. In the world of mathematics, numbers are composed of digits, each one having a definite value.

  21. Highlights from this issue

    Too many digits—the presentation of numerical data. We have all been frustrated reading numbers to too many decimal places, the simplest being digital scales in the outpatient clinic where measurements are probably not accurate to more than 10g although the implication of the weight recorded is that the accuracy is much greater.

  22. Too many digits: the presentation of numerical data

    Too many digits: the presentation of numerical data (Q28647834) From Wikidata. ... scientific article. edit. Language Label Description Also known as; English: Too many digits: the presentation of numerical data. scientific article. Statements. instance of. scholarly article. 0 references. title. Too many digits: the presentation of numerical ...

  23. Summarizing Numerical Data

    A summary of a summaries…this better be brief! Summaries of numerical data - graphical and numerical - often involve choices of what information to include and what information to omit. These choices involve a degree of judgement and knowledge of the criteria that were used to construct the commonly used statistics and graphics.

  24. Too many digits: the presentation of numerical data

    This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our privacy notice and cookie policy.

  25. How many is too many? A review of the significant numbers in pediatric

    This review highlights the clinical presentation, complications, evaluation, and numerical significance, when applicable, for the following skin findings: infantile hemangiomas, capillary malformations, café-au-lait macules, hypopigmented macules, juvenile xanthogranulomas, pilomatricomas, and angiofibromas.