ATP Testing for Healthcare Facilities:
How RLU Benchmarks Replace the Honor System
Bioluminescence cleaning verification for hospitals, surgery centers, and medical offices. What CDC, AORN, and the 2022 SHINE trial actually require, and what the numbers mean when your surveyor asks.
ATP testing measures organic residue on cleaned surfaces in 10 to 15 seconds. The healthcare benchmark is 100 RLU or below, established by Mulvey 2011 and adopted by NHS Scotland. The 2022 SHINE trial proved ATP feedback to EVS staff drops MDRO incidence in ICUs (IRR 0.876, P=.002) where fluorescent markers did not. It is the verification layer your surveyor expects to see documented.
The Short Answer
ATP testing, adenosine triphosphate bioluminescence testing, is the method healthcare facilities use to verify that a cleaned surface actually has the biological residue removed. A swab contacts the surface, is inserted into a luminometer, and produces an RLU reading in 10 to 15 seconds. Below 100 RLU is the NHS Scotland and Mulvey 2011 benchmark for general healthcare surfaces, correlating to under 2.5 CFU per square centimeter. ICU environments commonly use 45 to 63 RLU. It is not a pathogen detection test. It is a proxy for cleaning thoroughness. The 2022 SHINE trial is the strongest published evidence that this proxy, when fed back to EVS staff in real time, actually reduces MDRO incidence in ICUs. Fluorescent markers did not produce the same result in the same trial.
How does ATP testing verify that healthcare cleaning actually happened?
Every living cell runs on ATP. Adenosine triphosphate is the energy currency of biology. Bacteria carry it. Blood carries it. Mucus, skin cells, food residue, all of it carries ATP. When a surface is poorly cleaned, that residue stays behind. ATP testing detects it.
The mechanism is the luciferin-luciferase reaction, the same bioluminescence chemistry that makes fireflies glow. A swab contacts the surface, is activated with the luciferin-luciferase reagent, and the luminometer reads the light emission produced when the reagent reacts with whatever ATP is present. The result is a single number: Relative Light Units. The reaction and the read take 10 to 15 seconds. No incubation. No lab. No 48-hour wait for culture results.
The RLU value is a proxy for biological burden, not a pathogen identification. A surface with an RLU above threshold has enough organic residue that you cannot confirm the disinfectant reached the actual surface material. A surface below threshold was cleaned thoroughly enough that residue is below the level correlated with significant microbial load.
That distinction matters. I have been in facilities where the infection prevention team was satisfied with ATP results below 100 RLU on bed rails but had never asked what product was used or whether the contact time was observed. ATP verifies cleaning thoroughness. It does not verify that the right disinfectant was used at the right dwell time. Both legs of the program need to hold.
For the full picture of what CDC, AORN, EPA, and Joint Commission require in healthcare cleaning, read the Healthcare Cleaning Standards Field Guide. ATP verification is one layer in a four-framework compliance program.
What RLU threshold counts as a passing read for healthcare surfaces?
This is where most conversations about ATP go sideways. There is no single universal pass number. The most cited reference is Mulvey et al., Journal of Hospital Infection, 2011. That study established a correlation between 100 RLU and under 2.5 CFU per square centimeter on the NHS Scotland standard. NHS Scotland adopted 100 RLU as its operational benchmark for general hospital surfaces. That number has become the de facto industry reference in the U.S. as well.
But 100 RLU is not the right threshold for every surface in every setting. ICUs, transplant units, and operating rooms typically use 45 to 63 RLU. The Methodist Specialty and Transplant Hospital published a 2024 study using 45 RLU as the inpatient threshold and reported 95 percent weekly pass compliance after baseline calibration. High-touch surfaces in those environments, bed rails, call buttons, IV poles, tray tables, need tighter thresholds than a waiting room chair.
Outpatient medical offices and ambulatory clinics commonly use 200 RLU as an alert threshold and 500 RLU as a fail. The CDC and manufacturers including Hygiena and PDI explicitly state that no single benchmark fits all surfaces, materials, and use patterns. A defensible ATP program documents its thresholds by zone, surface type, and cleaning frequency. It includes the rationale. When your surveyor asks why you use 63 RLU in the ICU, the answer should not be "that's what someone told us."
| Surface / Zone | Pass Threshold | Alert Threshold | Source Basis |
|---|---|---|---|
| ICU / Transplant unit high-touch | 45 to 63 RLU | 64 to 100 RLU | Methodist Specialty 2024; NHS Scotland ICU guidance |
| OR terminal clean surfaces | 63 RLU | 64 to 100 RLU | AORN environmental hygiene guidelines; facility SOP documentation |
| Hospital med-surg bed rail, call button | 100 RLU | 101 to 200 RLU | Mulvey et al., J Hosp Infect, 2011; NHS Scotland standard |
| Hospital med-surg floor surfaces | 200 RLU | 201 to 500 RLU | General industry practice; surface type adjustment |
| Outpatient exam room table and arm | 200 RLU | 201 to 500 RLU | Hygiena and PDI manufacturer guidance |
| Outpatient waiting area seating | 500 RLU | 501 to 1000 RLU | Low-risk zone; general commercial reference range |
| Restroom frequently touched surfaces | 100 RLU | 101 to 300 RLU | High-touch designation; NHS Scotland cross-reference |
Note: Thresholds above represent common published ranges and should be adapted to your facility-specific cleaning frequency, surface materials, and risk zone classification. Publish your facility thresholds in writing before running your first audit cycle.
Why did the SHINE trial change the conversation about ATP in healthcare?
Before 2022, the argument for ATP testing in healthcare was mostly theoretical. You could show that low RLU correlated with low microbial counts, and you could argue that clean surfaces reduce pathogen transmission. But no randomized trial had directly tied an ATP monitoring program to a reduction in actual infection rates in a real ICU population.
The SHINE trial changed that. Anderson et al., published in Clinical Infectious Diseases in 2022, ran a cluster-randomized crossover trial across six ICUs at three U.S. academic medical centers. The intervention was real-time feedback to EVS staff on ATP results and cleaning performance. The comparator was UV/fluorescent marker monitoring with similar feedback.
Results: ATP monitoring produced a statistically significant reduction in MDRO incidence. Incidence rate ratio 0.876, 95 percent CI 0.807 to 0.951, P=.002. In plain terms, ICUs using ATP feedback had 12.4 percent fewer multidrug-resistant organism acquisitions than ICUs using fluorescent marker feedback. The fluorescent marker arm produced no statistically significant reduction.
The mechanism is what matters. ATP monitoring gave EVS staff a number immediately after cleaning. That number either confirmed the clean or triggered a retraining event on the spot. Fluorescent markers require a UV light and a separate walkthrough after cleaning. They do not catch the quality of the disinfection. They catch whether a surface was wiped. ATP catches whether the wipe actually worked.
One more number from the SHINE trial that does not get cited often enough. Room turnaround time impact: ATP monitoring added one additional minute to room turnaround. Fluorescent marker monitoring added 4.5 additional minutes. The clinically superior method was also operationally faster.
If you are pricing out a medical cleaning program that includes ATP verification, the medical office cleaning cost guide for Atlanta covers how verification programs affect per-square-foot pricing in the Southeast market.
How is ATP testing different from fluorescent markers, RODAC plates, and visual inspection?
Four verification methods show up in healthcare EVS programs. They measure different things. They are not interchangeable. Here is what each one actually tells you.
| Method | What It Measures | Time to Result | False Positive Risk | SHINE Trial Result |
|---|---|---|---|---|
| ATP Bioluminescence | Organic residue (biological burden proxy) | 10 to 15 seconds | Low; reagent reacts to ATP only | IRR 0.876, P=.002, MDRO reduction confirmed |
| Fluorescent Marker (UV/GloGerm) | Whether surface was physically touched | Immediate under UV light | High; does not confirm disinfection occurred | No statistically significant MDRO reduction |
| RODAC Plate (contact agar) | Culturable microorganism count per surface | 24 to 48 hours | Low; culture is specific, but too slow for corrective action | Not evaluated in SHINE trial |
| Visual Inspection | Visible soil, visible streaks, visible residue | Immediate | Very high; misses biological residue entirely | Not evaluated; CDC explicitly does not recommend as sole method |
RODAC plates deserve a separate note. Contact agar culture is the most precise method for quantifying microbial contamination. A RODAC plate pressed against a surface and incubated for 24 to 48 hours will tell you exactly how many colony forming units were present. The problem is the 48-hour window. By the time you have the result, the room has had two more patients. RODAC plates are useful for baseline studies, program validation, and outbreak investigation. They are not useful for real-time corrective action.
The right program uses ATP for real-time feedback and spot RODAC sampling for periodic program validation. The two methods are complementary. ATP tells EVS staff whether to reclean before the next patient. RODAC tells the infection prevention committee whether the ATP thresholds are calibrated correctly.
What does an ATP testing program actually cost?
The capital cost is straightforward. A healthcare-grade luminometer, Hygiena SystemSURE Plus, 3M Clean-Trace LM1, or equivalent, runs $1,500 to $3,000 at current pricing. The luminometer is reusable. The ongoing cost is swabs, which run $1 to $3 each at volume pricing depending on the device and swab type.
Here is what that looks like at two facility scales. These are general estimates based on market pricing. They are not quotes.
50-bed community hospital
Post-discharge ATP on 20 to 30 percent of rooms plus post-terminal on all ICU and OR rooms. Data logged to Hygiena SureTrend or 3M Clean-Trace platform.
10-room medical office
Weekly spot-check on 5 high-touch surfaces per room. Results logged and reviewed monthly with cleaning contractor. Corrective action documented when over threshold.
Many cleaning contractors who operate ATP programs fold the swab cost into the service rate rather than billing it as a line item. If you are evaluating vendors, ask specifically: does the ATP program cost appear in the contract as an included service or as a separate billable? A vendor who bills swabs separately at retail pricing will often cost more annually than one who includes them in the base rate.
The honest context: ATP program cost at a 50-bed hospital is roughly $10,000 to $30,000 per year depending on swab volume and whether you hire program coordination internally or include it in the cleaning contract. Against a single MRSA HAI at $23,301 in direct costs or a single C. diff case at $34,157, the math is not complicated.
How do you run an ATP audit cycle in a Southeast medical facility?
An ATP audit cycle is not a number. It is a loop. Surface sample, result, compare to threshold, corrective action if over threshold, reclean, resample, document. Here is how that works in practice for a Southeast hospital or medical office.
Step 1: Map target surfaces by zone
Identify high-touch surfaces in each zone: patient room (bed rail, call button, TV remote, tray table, toilet flush handle, light switch), OR (anesthesia cart, OR table rail, electrosurgical unit controls), nurses station (keyboard, phone, countertop), exam room (table surface, arm support, countertop). Assign each surface a risk classification and corresponding pass threshold. Document this surface map in writing. It becomes the sampling protocol.
Step 2: Set swab cadence by zone
ICU and OR: post-terminal clean on every room, every cycle. Hospital med-surg: 20 to 30 percent random sample of rooms post-discharge. Outpatient exam rooms: weekly spot-check on three to five surfaces per room. Waiting areas and corridors: monthly. Adjust cadence based on outbreak status or survey proximity.
Step 3: Swab and read in real time
EVS staff swab each target surface immediately after cleaning, before the next patient enters. Swab activates, goes into the luminometer, result appears in 10 to 15 seconds. If the result is below threshold, the surface passes and the room is cleared. If the result is above threshold, the surface fails.
Step 4: Corrective action on failures
On a fail: reclean the surface, observe full disinfectant contact time, resample. If the second sample passes, document the failure and corrective action. If the second sample fails, escalate to the supervisor and flag for retraining. The retraining event must be logged by name, date, surface type, and the specific protocol reviewed. That log is what surveyors expect to see.
Step 5: Data logging and trend reporting
Log every result to your data platform, Hygiena SureTrend, 3M Clean-Trace, or equivalent, by surface type, zone, and date. Generate a monthly pass rate report by zone. Share the report with your infection prevention committee. Trend lines matter more than single-point readings. A zone that was at 95 percent pass last month and is at 80 percent pass this month needs investigation before it drops further.
Step 6: Document for surveyor readiness
Keep three months of ATP data accessible in your documentation system, not just in the device memory. When TJC or DNV arrives, you should be able to produce pass rates by zone, failure events and corrective actions, and staff retraining logs within the three-hour DNV document window. The data is only valuable if it is organized and producible under pressure.
"ATP testing without a published threshold and a documented corrective action loop is theater. The number on the screen is not the program. The audit trail is the program."
Austin Jones, CEO, Millennium Facility Services
For the compliance checklist that pairs with an ATP program, see the medical facility cleaning compliance checklist. It covers documentation requirements across TJC, CMS, and AORN that an ATP program feeds into.
What do CDC and Joint Commission actually say about ATP testing?
Neither CDC nor TJC mandates ATP testing by name in their current published guidance. That is often used as a reason not to implement it. It is the wrong conclusion.
The CDC HICPAC Environmental Infection Control Guidelines (MMWR Vol. 52, No. RR-10, 2003, with subsequent updates including the March 2024 supplemental guidance) explicitly state that visual inspection is insufficient as a sole verification method. The CDC Best Practices for Environmental Cleaning in Healthcare Facilities (CS314156-A) recommends an objective monitoring strategy for high-risk patient care areas. ATP bioluminescence is the primary example of an objective, real-time monitoring method. The CDC does not say "use ATP." It says use something that produces objective results. ATP is the tool the industry has built around that recommendation.
Joint Commission's IC.02.01.01 requires facilities to implement and manage their infection prevention and control program. Under the 2024 rewrite, surveyors now move beyond documentation to testing practical implementation. A facility that has only visual inspection as a verification method is likely to draw scrutiny when surveyors ask how the cleaning program is monitored. Facilities with documented ATP programs, threshold definitions, trend data, and corrective action logs are in a stronger position to satisfy IC.02.01.01 under the new survey methodology.
AORN's Guideline for Environmental Hygiene (October 2025) addresses verification of OR cleaning in explicit terms. The guideline recommends objective monitoring methods for terminal cleaning verification. ATP bioluminescence is referenced as an appropriate method. AORN also addresses the documentation requirement: date, time, personnel identity, disinfectant lot, concentration, and verification result on every terminal clean. That verification result should be an ATP reading, not "visually clean."
The documentation surveyors expect to see: ATP pass rates by zone for the prior 90 days, written facility thresholds by surface type, corrective action logs for failures, and staff retraining records tied to specific failure events. A printed report from your data platform, Hygiena SureTrend or 3M Clean-Trace, covers the first three. Your training records system covers the fourth. Facilities that can produce all four in the DNV three-hour document window are in a defensible position. Facilities that can produce none of them are not.
Millennium runs ATP verification programs at medical facilities across the Southeast. Full scope and approach: medical facility cleaning services and the healthcare industry overview.
The Healthcare Cleaning Standards Guide
CDC, AORN, EPA, and Joint Commission requirements in one place. Includes ATP program setup guidance, RLU threshold tables by zone, disinfectant formulary decision trees, and the documentation framework surveyors expect.
Download the Healthcare Cleaning Standards Guide (PDF)No email required. Updated May 2026.
Related Reading
- Healthcare Cleaning Standards: A Field Guide for Southeast Facility Directors
- Medical Office Cleaning Cost in Atlanta: What Practices Actually Pay Per Square Foot
- Medical Facility Cleaning Compliance Checklist: 20-Item Audit Tool
- Millennium Facility Services: Medical Facility Cleaning
- Healthcare Industry Services Overview
Frequently Asked Questions
ATP testing uses bioluminescence to measure adenosine triphosphate on a cleaned surface. A swab wiped across a surface goes into a luminometer and produces a result in 10 to 15 seconds as an RLU value. Lower RLU means less organic residue, which correlates with lower microbial burden. It is the only real-time objective verification method that gives EVS staff immediate feedback before the next patient enters the room.
The most widely cited benchmark is 100 RLU or below, established by Mulvey et al. in the Journal of Hospital Infection (2011). That threshold correlates with under 2.5 CFU per square centimeter on the NHS Scotland standard. ICU and transplant environments frequently use stricter thresholds of 45 to 63 RLU. A defensible program defines thresholds by surface type and zone and documents the rationale, not a single blanket number applied to every surface in the building.
Neither CDC nor Joint Commission mandates ATP testing by name. CDC HICPAC guidance strongly recommends objective monitoring methods, noting that visual inspection alone is insufficient. TJC's IC.02.01.01 requires facilities to implement and manage their infection prevention program, and surveyors increasingly expect cleaning verification data when they audit EVS operations. Facilities that lack objective verification carry higher citation exposure than those that can produce ATP trending data and corrective action logs.
Fluorescent markers detect whether the cleaning crew physically touched a surface. They do not measure biological residue or confirm disinfection occurred. The 2022 SHINE trial compared both approaches across six ICUs: ATP monitoring produced a statistically significant MDRO reduction (IRR 0.876, P=.002), while fluorescent marker monitoring produced no statistically significant reduction. ATP tells you how clean the surface is. Fluorescent markers tell you whether someone touched it.
Cadence depends on zone and risk level. ICUs and operating rooms should be swabbed after every terminal clean. Medical-surgical floors typically use random sampling on 20 to 30 percent of rooms post-discharge. Outpatient medical offices run weekly spot-checks on high-touch surfaces in exam rooms. The sampling schedule needs to be consistent enough to produce trend data, not just isolated spot readings that cannot drive corrective action.
No. ATP testing measures total organic residue, not specific pathogens. A low RLU reading does not confirm the absence of C. diff spores or Candida auris. Those pathogens require EPA List K and EPA List P disinfectants respectively, and their elimination depends on correct product selection and full contact time. ATP testing confirms a surface was thoroughly cleaned. It is a proxy for cleaning quality, not a pathogen-specific diagnostic tool.
A healthcare-grade luminometer costs between $1,500 and $3,000. Swabs run $1 to $3 each at volume pricing. A 10-room medical office running weekly spot-checks on five surfaces per room uses roughly 200 to 250 swabs per month, which runs $200 to $750 per month at retail. Many cleaning contractors who operate ATP programs include swab cost in their service rate rather than billing it separately. The luminometer capital cost can typically be amortized over 24 to 36 months.
Want us to walk your facility and show you what the numbers look like?
We run baseline ATP sampling, publish surface-level results, and tell you where the gaps are before a surveyor finds them. No obligation.
Your surveyor is going to ask for the ATP data. Do you have it?
Every Millennium healthcare account receives a monthly cleaning verification report with surface-level ATP data, contact-time compliance, and trend lines tied to high-touch zones. The report is yours. Show it to your infection prevention committee, your surveyor, or your CFO. That is the deliverable most cleaning vendors do not put in writing.
No obligation. We find the gaps before a surveyor does.