Analytical Instrumentation
Understanding and Learning to Calculate Repeatability and Reproducibility Using ASTM D6300
Nov 26 2024
Author: Dr. Raj Shah and Beau Eng on behalf of Koehler Instrument Company
Free to read
Articles are free to download. Unlock the article to be shown more content, graphs and images.
Introduction
In the world of petroleum and lubricant testing, few qualities are as important as precision. Precision reflects the reliability of a procedure to produce consistent results under set conditions, and two key elements of precision are repeatability (r) and reproducibility (R). Repeatability refers to the variability of multiple test results by an individual analyst using a single apparatus to test replicate specimens from a single sample. Reproducibility measures the variability of test results among different analyses, each using a different apparatus, when testing specimens from sub-samples of a sample that have been shipped to multiple laboratories.
Both types of variability are crucial for understanding the confidence limits for a test method and minimizing the risk of incorrectly assessing a property to either be out of specification when it is actually in specification (known as a Type 1 error) or, conversely, assessing the property to be in specification when it is actually not (known as a Type II error). These values then inform precision, which can establish the expected resolution between two values such that researchers performing condition monitoring tests can minimize the risk of either type of error. With an established range, researchers can more confidently test the results of an experiment and gauge whether the process itself is operating within specified parameters.
To numerically determine these parameters, ASTM International developed the ASTM D6300 standard, which provides guidelines for determining precision and bias for test methods involving petroleum products, liquid fuels, and lubricants1. ASTM D6300 has become an indispensable tool for gauging variability in test results. The formulas for calculating repeatability and reproducibility are of great importance to researchers in this field. This paper covers the calculation methods for both values, as outlined in the ASTM D6300 standard, explains the rationale behind key steps and provides a framework for manual calculation of repeatability (r) and reproducibility (R).
The Significance of Repeatability and Reproducibility
The importance of repeatability and reproducibility should be reinforced significantly. While the two values do not govern the outcome of a procedure, they act as measures to examine the variability of a testing method. Repeatability and reproducibility allow analyses to establish tolerance thresholds by indicating when two replicate test results are and are not “within normal expectation,” with a standard expectation cutoff representing 95% of the reference distribution for the “absolute difference” variable2. This is necessary for researchers performing condition monitoring tests, as an observable measure of replicability is always welcome for applications machines can automate, as mechanisms performing outside of acceptable error ranges are typically in need of maintenance and inspection.
Without the ability to gauge the irregularity of the output, researchers would be venturing blindly into a maze of Type I and Type II errors, unable to gauge whether the machine is operating inside or outside specified ranges and thus unqualified to determine whether an apparatus is working or malfunctioning. This would make system maintenance a nightmare, as without a basis to gauge the expected range of products, it would be much more difficult to determine if a mechanism requires maintenance just by observing its output. This drastically increases maintenance time and cost in factories, since each machine would then require manual inspection for errors at regular intervals to avoid the risk of catastrophic failure. With the application of repeatability and reproducibility, researchers can simply observe the precision of a baseline machine and compare it to the output of the machines under observation to gauge whether they are within acceptable margins.
Preparing Samples for Calculations:
Before properly beginning calculations, we must prepare a valid and informative data set for the experiment. According to standard practice, one would prepare enough material to supply at least 10% more than necessary for each laboratory involved in the experiment2. Each unit will be labelled with a letter for the material and a sequential number. For instance, for 10 laboratories and 2 test results of a material B, the test samples would be labeled B1 through B223. The samples themselves should include specimens in which the measured property is just above the detection limit, within the detection range, and just below the maximum measurable value without dilution.
While other precision metrics are tested and calculated within the experiment, this paper will focus solely on the formulas related to the calculation of repeatability and reproducibility.
Formula for Repeatability (r):
Of the various measures of accuracy, the fundamental precision statistics are Repeatability Standard Deviation (Sr) and Reproducibility Standard Deviation (SR). These two values are central to calculating repeatability and reproducibility of the subject, respectively. Repeatability concerns the results of multiple analyses by a single operator on specimens from a single sample. The expected data set will contain one or more samples undergoing multiple trials of the same procedure across multiple labs. As the calculations for reproducibility require repeatability, repeatability will be calculated first.
First, we must calculate the mean of test results from a single sample as tested by a single analyst. The formulas for computing the mean and standard deviation (S) of replicate tests are:
This gives us the average value of the sample within the relevant data set. With the mean, we can calculate the standard deviation of that sample with the next formula:
The standard deviation indicates how dispersed the data is relative to the mean. This serves as a general indicator of how precise the data is before any further calculations. We will then calculate the standard deviation for the same sample in each of the remaining labs in the data set using the same formulas above. With that set of standard deviations, we will calculate the repeatability standard deviation (Sr) pooled over all labs using the following formula:
Repeatability standard deviation is the standard deviation of the given test under repeatability conditions. However, this value does not account for 95% confidence. This will be rectified in the next step.This value will be further refined in the next step.
With Sr calculated, we then multiply the answer by 2.8 to calculate repeatability (r). This is to account for 95% confidence, as 1.96 standard deviations multiplied by √2 is roughly equivalent to 2.8.we now must account for the “normal expectation” cutoff, which falls within the 95th percentile, or 95% of all data. To do so, we multiply the repeatability standard deviation by 2.8, which brings the value within those bounds and thus provides our value for repeatability.
With this procedure, you can now calculate repeatability for each of the samples in the data set by simply inputting the data from that specific sample.
Formula for Reproducibility (R):
Reproducibility is found after repeatability, as Repeatability Standard Deviation is required for calculations. First, we calculate the mean of a sample in each lab within the set, as well as the total average of all the sample data across all labs (A). With the means, we calculate the Standard Deviation of the Averages (SA) using the following formula:
This serves as a measure of spread across multiple labs using the average values as a baseline, and we can use it to further extrapolate precision results across labs. With SA calculated, we then determine the Reproducibility Standard Deviation (SR) using the following formula:
Just like Repeatability Standard Definition, this is the standard deviation of the test results obtained under reproducibility conditions.
Again, this value is multiplied by 2.8 to account for normal expectation cutoffs, leaving us with this formula for reproducibility.
Like with repeatability, you can repeat this sequence to calculate the reproducibility value of other samples in the set.
Example Calculations
To demonstrate these calculations, we generated a series of random numbers. In these sample calculations, we generated 10 different samples, labelled A through J, and 5 different labs, with 6 trials per lab. The test data for the first sample is shown below, with the calculated average and standard deviation per lab set.
The repeatability (r) of Sample A is calculated below:
The Repeatability of Sample A is roughly equal to 7.101. For repeatability, a lower value is preferred, as numerical measures of variability act as inverse measures of precision. High precision would indicate that repeatability is very low.
The reproducibility (R) of Sample A is calculated below:
The Reproducibility of Sample A is roughly equal to 7.134. Again, reproducibility is a measure of variability, with lower values indicating higher precision.
Conclusion
ASTM D6300 offers an accurate method to determine both repeatability and reproducibility, which are both invaluable to the reliability and production efficiency of many procedures and production lines. With the formula outlined in this report, the repeatability and replicability values can be calculated by hand, saving valuable time and money from manufacturing errors and benefitting the field of petroleum and lubricant testing.
References
1. ASTM International. “Standard Practice for Determination of Precision and Bias Data for Use in Test Methods for Petroleum Products, Liquid Fuels, and Lubricants.” ASTM International. 2021. [Online]. Available: https://www.astm.org/d6300-21.html
2. Lau, Alex. “What Are Repeatability and Reproducibility? Part 1: A D02 Viewpoint for Laboratories.” ASTM Standardization News. astm.org. March/April 2009.
3. ASTM International. “Standard Practice for Conducting an Interlaboratory Study to Determine the Precision of a Test Method.” ASTM International. 2022. [Online]. Available: https://cdn.standards.iteh.ai/samples/112626/9e420be7e9de431e8a93062833b9bc4d/ASTM-E691-22.pdf
The authors sincerely thank Dr. Fred Passman ( an excellent mentor and brilliant scientist) for his suggestions and edits and criticisms provided. He is the president of Biodeterioration Control Associates, Inc. and a very active astm member.
Both Dr.’s Shah and Dr. Passman are also advisors to the student astm club at State Univeristy of New York, Stony brook.
The authors especially wish to thank
Dr. Alex Lau (guru of astm statistics) for perusing the article and for his invaluable comments. Attending his astm statistics course offered several times a year in various location worlwide is one of the best way to get proficient in understanding the statistics used and implemented in various ASTM.METHODS. Alex. L.is an inspiration in this field ad the authors thank him for his invaluable suggestions
About the Authors
Dr. Raj Shah is a Director at Koehler Instrument Company in New York, where he has worked for the last 25 plus years. He is an elected Fellow by his peers at IChemE, AOCS, CMI, STLE, AIC, NLGI, INSTMC, Institute of Physics, The Energy Institute and The Royal Society of Chemistry. An ASTM Eagle award recipient, Dr. Shah recently coedited the bestseller, “Fuels and Lubricants handbook”, details of which are available at ASTM’s LongAwaited Fuels and Lubricants Handbook 2nd Edition Now Available
https://bit.ly/3u2e6GY.
He earned his doctorate in Chemical Engineering from The Pennsylvania State University and is a Fellow from The Chartered Management Institute, London. Dr. Shah is also a Chartered Scientist with the Science Council, a Chartered Petroleum Engineer with the Energy Institute and a Chartered Engineer with the Engineering council, UK. Dr. Shah was recently granted the honourific of “Eminent engineer” with Tau beta Pi, the largest engineering society in the USA. He is on the Advisory board of directors at Farmingdale university (Mechanical Technology), Auburn Univ (Tribology), SUNY, Farmingdale, (Engineering Management) and State university of NY, Stony Brook ( Chemical engineering/ Material Science and engineering). An Adjunct
Professor at the State University of New York, Stony Brook, in the Department of Material Science and Chemical engineering, Raj also has over 680 publications and has been active in the energy industry for over 3 decades. More information on Raj can be found at https://bit.ly/3QvfaLX
Contact: rshah@koehlerinstrument.com
Mr. Beau Eng is part of a thriving internship program at Koehler Instrument company in Holtsville and is a student of Chemical and Molecular Engineering at Stony Brook University, Stony Brook, New York where Dr. Shah is on the external advisory board of directors.
Free to read
Articles are free to download. Please login to read this article or create an account.
Digital Edition
AET 28.4 Oct/Nov 2024
November 2024
Gas Detection - Go from lagging to leading: why investment in gas detection makes sense Air Monitoring - Swirl and vortex meters will aid green hydrogen production - Beyond the Stack: Emi...
View all digital editions
Events
Dec 02 2024 London, UK
Dec 03 2024 Dusseldorf, Germany
Dec 11 2024 Shanghai, China
Jan 12 2025 Abu Dhabi, UAE
Jan 14 2025 Abu Dhabi, UAE