Thursday, June 11, 2026

Six Sigma: Refusing to Measure Is Refusing to Improve

The Hidden Cost of “Fixing It Later” in Clinical Research


Clinical research quality is usually discussed in the language of compliance: GCP, inspection readiness, audit findings, protocol deviations, CAPA, TMF completeness, and data integrity. This language is necessary. Clinical trials must protect participants, produce reliable results, and withstand regulatory and scientific scrutiny.

A clinical trial may appear compliant at the end, but only after thousands of queries, repeated document corrections, email clarifications, monitoring follow-ups, vendor reconciliations, TMF quality-control rejections, and late-stage remediation activities. The final clinical study report may be based on acceptable data, but the operational question remains: how much rework was required to make the data, documents, and reports correct?

This is where Six Sigma thinking becomes useful.

Six Sigma does not need to be copied mechanically from manufacturing into clinical trials. Clinical research is more variable, more regulated, and more dependent on clinical judgement than most production processes. A patient visit is not a manufactured product, and a protocol deviation is not the same thing as a defect in a machine part. Nevertheless, Six Sigma offers a useful measurement discipline: define the unit, define the defect, define the opportunity for error, measure process performance, and translate poor quality into cost.

Many of the necessary measurement signals already exist in EDC, TMF, laboratory systems, spreadsheets, and emails. The challenge is to connect these signals and use them to measure quality, rework, and process improvement, potentially with AI assistance.

Why Six Sigma Is Relevant, but Not Widely Used, in Clinical Research

Lean Six Sigma has been widely discussed in healthcare, especially in relation to patient flow, waiting times, laboratory processes, hospital administration, operating room efficiency, pharmacy workflows, and documentation processes. The healthcare literature shows that Lean Six Sigma is often applied not only to medical care itself, but also to management and administrative processes. This is important for clinical research, because clinical trials contain a large amount of administrative, documentary, financial, and cross-functional operational work.

Clinical research has many process problems that are comparable to healthcare operations, but they are often less visible. A hospital can observe waiting time in an emergency department. A clinical trial may experience the same type of delay through site activation delays, unresolved EDC queries, late monitoring reports, missing TMF documents, delayed vendor reconciliation, or postponed database lock.

The literature on Lean Six Sigma in healthcare is broader than the literature on Lean Six Sigma in clinical trials. Still, several publications have discussed the applicability of Lean and Six Sigma to clinical and translational research, and industry publications have also considered Lean Six Sigma as a way to improve clinical trial efficiency, reduce rework, and improve data quality.

In parallel, modern clinical trial quality management has moved toward related concepts such as Quality by Design, Critical-to-Quality factors, Risk-Based Quality Management, Quality Tolerance Limits, and Key Risk Indicators. These are not identical to Six Sigma, but they share a similar logic. Important processes should be defined, measured, monitored, and improved. Quality should be built into the trial process rather than inspected only at the end.

This makes Six Sigma relevant to clinical research not as a slogan, but as a question:

How capable is the clinical trial process of producing correct, complete, timely, and usable outputs the first time?

Six Sigma Metrics and Possible Clinical Research KPIs

Before discussing the example in detail, it may be useful to translate several Six Sigma concepts into clinical research process language. The objective is not to claim that clinical trials should be managed as manufacturing processes. The objective is more practical: to define what can be measured, where defects occur, how often they occur, and how much rework they create.

Six Sigma conceptMeaning in process qualityPossible clinical research KPI
UnitThe item or process output being evaluatedOne source document, one patient visit, one eCRF form, one TMF document, one monitoring report, one site activation package
DefectA failure to meet a defined requirementMissing data, wrong date, incomplete source entry, unresolved query, incorrect TMF metadata, missing signature, inconsistent value between systems
OpportunityA specific place where a defect could occurEach required field, signature, date, data point, document attribute, transfer step, QC check, or reconciliation point
DPU — Defects per UnitNumber of defects per evaluated unitDefects per source document, queries per eCRF form, QC findings per TMF document, monitoring findings per visit
DPO — Defects per OpportunityNumber of defects divided by all possible defect opportunitiesDefects divided by total required fields, checks, signatures, metadata points, or reconciliation items
DPMO — Defects per Million OpportunitiesStandardized defect rate allowing comparison across processes of different complexityEDC data-entry defects per million fields, TMF metadata defects per million attributes, reconciliation discrepancies per million transferred values
RTY — Rolled Throughput YieldProbability that the process passes through all steps without reworkProbability that information moves from source document to EDC, query closure, vendor reconciliation, TMF filing, reporting, and final project delivery without avoidable correction
COQ — Cost of QualityTotal cost of prevention, appraisal, internal failure, and external failureCost of training, monitoring, QC, audits, system validation, remediation, CAPA, re-monitoring, and delayed reporting
COPQ — Cost of Poor QualityCost that would not exist if work were done correctly the first timeCost of avoidable queries, email clarification, TMF rejections, duplicate entry, late correction, remediation, delays, and budget overrun caused by rework

This translation is important because clinical research teams often measure activities but not necessarily process capability. For example, a team may know how many monitoring visits were completed, how many queries were open, or what percentage of TMF documents were filed. However, these numbers do not always show whether the process is producing correct outputs the first time, or whether quality is being achieved only through repeated correction.

From a Six Sigma perspective, the more interesting question is not only how many activities were performed. The more interesting question is how many times the process had to be corrected before the final output became acceptable.

The first concept is the unit. In clinical research, a unit can be one source document, one patient visit, one eCRF form, one laboratory result, one adverse event record, one informed consent form, one monitoring visit report, one TMF document, one site activation package, or one protocol deviation. The choice of unit matters because a process can only be measured consistently when it is clear what is being evaluated.

The second concept is the defect. A defect is any failure to meet a defined requirement. In clinical research, this could be a missing value, wrong date, late data entry, inconsistent data between systems, missing source documentation, missing signature, incorrect document version, incorrect TMF metadata, unresolved query, repeated query, missing medical review, late safety reporting, insufficient protocol deviation description, unresolved monitoring finding, or document rejected during TMF QC.

However, clinical research should not count all defects as equal. A missing administrative date, a missing safety assessment, and a data inconsistency affecting the primary endpoint do not have the same significance. Defects may need to be classified by severity, detectability, recurrence, and impact on patient safety, data integrity, reporting, audit readiness, and financial performance.

The third concept is the opportunity. An opportunity is a defined place where a defect could occur. One source document may contain 50 required fields, each representing an opportunity for a missing, incorrect, inconsistent, or unclear value. One TMF document may have several required attributes: correct document type, correct country, correct site, correct date, correct version, correct signature, correct filing location, and correct naming convention. Each attribute is an opportunity for error.

This distinction matters because processes differ in complexity. A short follow-up visit and a complex oncology screening visit should not be compared only by the number of errors. A complex visit has more opportunities for defects. Similarly, a simple administrative document and a multi-country regulatory package should not be compared without considering the number of required elements.

Defects per Unit, or DPU, measures how many defects are found per unit. For example, if 100 source documents are reviewed and 60 total defects are found, the DPU is 0.6 defects per source document. In clinical research, DPU could be used to measure defects per source document, queries per eCRF page, findings per monitoring visit, QC rejections per TMF document, missing fields per visit, or errors per site activation package. DPU is simple and understandable, but it may not fully account for process complexity.

Defects per Opportunity, or DPO, measures defects relative to the number of possible defect opportunities. If 100 source documents each contain 50 required fields, there are 5,000 opportunities. If 60 defects are found, the DPO is 60 divided by 5,000, or 0.012. This means that 1.2% of all opportunities resulted in defects. DPO can be useful when comparing processes of different complexity, such as screening visits versus follow-up visits, high-volume sites versus low-volume sites, laboratory data versus adverse event data, or different TMF document types.

Defects per Million Opportunities, or DPMO, standardizes DPO by multiplying it by one million. In manufacturing, DPMO can be translated into a sigma level. In clinical research, the sigma level itself may be less important than the discipline of standardizing defect rates. DPMO may help compare EDC data-entry defects, TMF metadata defects, laboratory reconciliation discrepancies, or safety database inconsistencies across studies, countries, vendors, or systems.

Rolled Throughput Yield, or RTY, is especially relevant for clinical research because clinical trials are long chains of dependent processes. RTY measures the probability that a process passes through all steps without requiring rework. This is important because clinical research quality is not produced by one step. Source documentation is completed, data are entered into EDC, data are reviewed, queries are generated, queries are answered, data are corrected, monitoring confirms consistency, vendor data are reconciled, documents are filed in TMF, and data are used in the clinical study report.

Even if each step performs reasonably well, the cumulative probability of error-free completion can be much lower than expected. If five process steps each have 95% first-pass quality, the rolled throughput yield is approximately 77%. If ten process steps each have 95% first-pass quality, the rolled throughput yield is approximately 60%. This may explain why clinical trial teams often feel overloaded even when individual functions appear to be performing adequately. The problem is not always that one team is failing. The problem may be that the whole process is designed in a way that normalizes small defects, late detection, and repeated rework.

Cost of Quality, or COQ, helps translate process quality into the language of management. COQ usually includes prevention costs, appraisal costs, internal failure costs, and external failure costs. In clinical research, prevention costs may include protocol review, risk assessment, site training, system validation, source documentation templates, EDC edit-check design, data standards, TMF planning, monitoring plan design, vendor qualification, and Quality by Design activities. These costs are not waste. They are investments intended to reduce future failure.

Appraisal costs include activities required to detect defects. In clinical research, these may include monitoring, source data verification, source data review, data review, medical review, TMF QC, audits, vendor oversight, reconciliation, central monitoring, and other quality checks. Some appraisal activities are necessary in regulated research. However, if a process depends too heavily on late inspection, this may indicate that prevention is insufficient.

Internal failure costs are costs caused by defects discovered before submission, inspection, final reporting, or formal project closure. Examples include query resolution, document correction, TMF remediation, re-monitoring, repeated data review, CAPA implementation, duplicate data entry correction, repeated training, vendor reconciliation, delayed database lock, and correction of statistical outputs due to late data changes.

External failure costs are costs caused by defects discovered after the output has reached an external customer, regulator, inspector, sponsor decision-maker, or final reporting stage. Examples include inspection findings, regulatory questions, delayed approval, major protocol amendments, repeated submission work, reputational damage, additional analyses, loss of confidence in trial data, delayed strategic decisions, and loss of credibility in project reporting.

Cost of Poor Quality, or COPQ, is perhaps the most important concept for clinical trial operations. COPQ includes costs that would not exist if work were done correctly the first time. In clinical research, COPQ may include time spent resolving avoidable EDC queries, email chains required to clarify missing or inconsistent source data, CRA follow-up due to incomplete documentation, site time spent correcting preventable errors, repeated TMF QC rejections, remediation projects before inspection, repeated medical review due to poor deviation descriptions, delayed database lock caused by unresolved queries, additional project management oversight caused by recurring defects, budget overruns caused by rework, delayed milestone completion, and delayed revenue recognition where work cannot be accepted as complete.

COPQ is often hidden because the work is distributed across many people and systems. One missing source value may create a chain of events: source defect, EDC query, email clarification, site correction, CRA follow-up, data management review, medical review, monitoring note, TMF update, QC rejection, correction, and final acceptance. The original defect may appear small, but the cumulative cost may be significant.

Example: Source Documentation, Data Transfer, Queries, and TMF Quality

A practical example can be taken from source documentation and clinical data flow.

In a clinical trial, source documents may contain different types of errors and omissions. These may include missing dates, incomplete medical history, inconsistent laboratory values, unclear adverse event descriptions, missing signatures, incorrect visit windows, or incomplete procedure documentation.

This information is then entered, sometimes manually, into different systems. For example, the same or related information may be entered into the EDC, CTMS, safety database, laboratory portal, vendor system, or local site tracker. Later, the data may be transferred or reconciled with other systems used for reporting, monitoring, statistical analysis, safety review, finance, and Trial Master File documentation.

At each step, there are new opportunities for defects.

A missing or incorrect value in the source document may generate an EDC query. The query may require communication between the data management team, CRA, site coordinator, investigator, and sometimes medical monitor. The same issue may also trigger emails, follow-up notes, monitoring findings, corrective actions, or TMF documentation updates.

Therefore, the cost of poor quality is not limited to the original error.

The total cost includes the full chain of work required to detect, communicate, investigate, correct, verify, document, and close the issue.

In a large clinical trial, this is not one query or one missing document. It may be thousands of queries, thousands of email exchanges, hundreds of monitoring follow-ups, repeated QC rejections, and several months of delayed resolution. Some queries may remain open for weeks or months, delaying data cleaning, medical review, database lock, reporting, audit readiness, and sometimes financial closure of project milestones.

From a Six Sigma perspective, several measurable indicators could be defined. These may include the number of defects per source document, the number of defects per data field, the number of missing or inconsistent fields per patient visit, the number of EDC queries per patient, site, visit, or form, the number of emails required to resolve one query, average query resolution time, the number of repeated queries for the same type of issue, the number of TMF QC rejections per document type, the number of documents returned for correction, the number of monitoring findings related to source documentation, and the cost of CRA, data management, medical, quality, project management, finance, and site time spent on correction.

This allows the organization to ask a more meaningful question:

What is the cost of achieving quality, and what is the cost of poor process quality?

The objective is not only to count errors. The objective is to understand where the process creates defects, where defects are detected too late, and where rework becomes more expensive than prevention.

For example, if source documentation is incomplete at the site, the defect may only become visible later through EDC queries, monitoring review, medical review, vendor reconciliation, or TMF QC. At that point, several functions may already be involved. The issue may no longer be a simple data correction. It becomes a process failure involving time, cost, delay, and potential risk to data integrity.

This also raises another important question:

What is the probability that correct, complete, and consistent information reaches the final clinical study report without rework?

In clinical research, this probability is often assumed rather than measured. Six Sigma and rolled throughput yield concepts can help make this visible. Even if each process step appears to work reasonably well, the cumulative probability of error-free flow across source documentation, data entry, query resolution, monitoring, vendor reconciliation, TMF filing, reporting, and project delivery may be much lower than expected.

This does not mean that clinical trials should mechanically apply manufacturing-style Six Sigma metrics. Clinical research is more variable, more regulated, and more dependent on clinical judgement.

However, the principle remains highly relevant:

If defects are not measured across the process, then the cost of poor quality remains hidden.

By measuring defects, rework, query burden, TMF rejection rates, resolution timelines, and the cost of repeated correction, clinical research organizations can improve not only the quality of outputs, but also the quality of the process itself.

The goal is therefore twofold.

First, to improve the clinical trial process so that fewer defects are created.

Second, to improve the quality process so that defects are detected earlier, corrected faster, and prevented from recurring.

Final Outcomes: More Than TMF Completeness

The final outcome of a clinical trial process is not only a filed document in the TMF. TMF completeness is essential, but it is only one part of the final quality picture.

The broader final outcome includes audit readiness, TMF completeness, data quality, financial performance, and the ability to remain on budget. These dimensions are connected. If source data are incomplete, EDC queries increase. If queries remain unresolved, data cleaning and database lock may be delayed. If documents are incomplete or incorrectly filed, TMF completeness and inspection readiness are affected. If rework increases, additional CRA, data management, quality, medical, site, and project management time is consumed. If effort increases without proper visibility, the project may remain technically active but financially uncontrolled.

Audit readiness depends on more than the presence of documents. It depends on whether the trial story can be reconstructed clearly, consistently, and credibly. The inspector or auditor should be able to understand what was done, when it was done, why it was done, who performed it, and how the trial maintained participant protection and data reliability. A TMF may be complete in percentage terms but still weak if documents are late, inconsistent, incorrectly classified, or disconnected from the operational reality of the study.

TMF completeness should therefore be understood not only as a document-counting exercise, but as a process outcome. If documents are created late, filed after repeated correction, rejected during QC, or disconnected from the work they are supposed to evidence, then TMF quality becomes a downstream reflection of poor process quality. In this sense, TMF completeness is not only a regulatory deliverable. It is also evidence of whether the trial process was controlled.

Data quality is another final outcome, but data quality is not created at the time of final analysis. It is created from the first source entry, the first patient visit, the first data field, the first monitoring review, and the first query response. If correct data reach the final report only after months of queries, repeated clarification, and reconciliation, the final dataset may be usable, but the process that produced it may still be inefficient and expensive.

Financial performance is also affected by process quality. Poor quality consumes time. Time consumes budget. Rework consumes budget again. A query is not only a data-management event. It may represent site effort, CRA follow-up, project management coordination, medical review, system updates, documentation, and delay. A TMF rejection is not only a document-quality issue. It may require correction, refiling, re-review, escalation, and sometimes additional explanation before audit or inspection.

Being on budget therefore depends partly on the quality of the process. A budget may be carefully negotiated at study start, but if the operational process generates avoidable rework, the original assumptions become less reliable. Poor process quality may appear as additional monitoring effort, additional data management time, additional site burden, additional vendor coordination, additional quality oversight, additional project management effort, and delayed milestone acceptance.

This is why clinical research should not only ask whether the final report is correct. It should also ask how efficiently, reliably, and transparently correct data and documents were produced.

Asking the Right Question

Clinical research often asks: is the final report correct? This is essential, but Six Sigma encourages an additional question: what is the probability that correct, complete, and consistent information reaches the final clinical study report without rework?

This question shifts attention from final inspection to process capability. If every step requires correction, reconciliation, or remediation, then quality is being achieved too late and at too high a cost.

The same question can be extended beyond the clinical study report. What is the probability that a source document, data field, TMF document, monitoring report, finance record, or project deliverable passes through the process without avoidable correction? What is the probability that the trial remains audit-ready throughout execution, not only after a remediation exercise? What is the probability that project costs remain aligned with the original budget because the process does not generate uncontrolled rework?

These are practical management questions, not theoretical statistical exercises.

Improving the Process and the Quality Process

The goal is not only to find more defects. The goal is to improve the process so that fewer defects are created.

At the same time, the quality process itself must also improve. Defects should be detected earlier, corrected faster, and prevented from recurring. Better source documentation design may reduce missing data. Better EDC edit checks may prevent incorrect entries. Better site training may reduce repeated deviations. Better system integration may reduce duplicate data entry. Better TMF metadata design may reduce QC rejections. Better dashboards may identify slow query resolution earlier. Better root cause analysis may reduce recurrence.

This creates two levels of improvement. The first is improvement of the clinical trial process itself. The second is improvement of the quality management process that detects, corrects, and prevents defects. Both are necessary. A quality system that only detects defects late may be compliant, but it may also be expensive and inefficient. A mature process should reduce the need for late correction by building quality into the work itself.

Conclusion

Clinical research already pays the cost of poor process quality. It pays through queries, rework, delays, remediation, monitoring follow-up, TMF correction, CAPA, repeated review, avoidable operational burden, and budget pressure.

However, these costs are often not measured as one connected process cost. They are distributed across systems, functions, emails, trackers, vendors, sites, and project teams. As a result, poor process quality may remain visible as individual issues, but invisible as a cumulative cost of trial delivery.

ICH E6(R3) emphasizes that quality should be built into the scientific and operational design and conduct of clinical trials. This is the essence of Quality by Design: important quality factors should be considered early, not repaired only after defects have already propagated through the process. In this context, Six Sigma does not replace GCP, Quality by Design, RBQM, QTLs, or clinical judgement. Rather, it offers a complementary measurement lens.

Six Sigma is useful because it translates quality from a compliance concept into a measurable process and financial concept. It asks not only whether quality was eventually achieved, but how much failure, detection, correction, and rework were required to achieve it. It also asks whether the process itself is capable of producing correct, complete, timely, and usable outputs with limited rework.

Clinical trials do not need to become factories. But they do need clearer measurement of process quality. A clinical trial should not only prove that the final data are correct. It should also be able to show how efficiently, reliably, and transparently correct data were produced, documents were completed, audit readiness was maintained, and budget expectations were protected.

Refusing to measure is refusing to improve.


References:

1. Rathi R, Vakharia A, Shadab M. Lean six sigma in the healthcare sector: A systematic literature review. Mater Today Proc. 2022;50:773-781. doi: 10.1016/j.matpr.2021.05.534. Epub 2021 Jun 7. PMID: 35155129; PMCID: PMC8820448. https://pmc.ncbi.nlm.nih.gov/articles/PMC8820448/

2. Schweikhart SA, Dembe AE. The applicability of Lean and Six Sigma techniques to clinical and translational research. J Investig Med. 2009 Oct;57(7):748-55. doi: 10.2310/JIM.0b013e3181b91b3a. PMID: 19730130; PMCID: PMC2835466. https://pmc.ncbi.nlm.nih.gov/articles/PMC2835466/

3. Lean Six Sigma in the Clinical Trial Industry: Two Perspectives. Applied Clinical Trials. Dawn Pope https://www.appliedclinicaltrialsonline.com/view/lean-six-sigma-clinical-trial-industry-two-perspectives 

4. McDermott, O.; Antony, J.; Bhat, S.; Jayaraman, R.; Rosa, A.; Marolla, G.; Parida, R. Lean Six Sigma in Healthcare: A Systematic Literature Review on Challenges, Organisational Readiness and Critical Success Factors. Processes 2022, 10, 1945. https://doi.org/10.3390/pr10101945