Maintenance Fundamentals

15 Maintenance KPIs Every Plant Manager Should Track

February 23, 202614 min readDovient Learning

You cannot improve what you do not measure. But tracking 50 KPIs means you are drowning in dashboards and acting on none of them. The maintenance teams that actually improve are the ones that track 12-15 metrics, review them consistently, and act on what they find.

This guide covers 15 maintenance KPIs organized into four groups: Equipment Performance, Process Effectiveness, Cost Management, and People. For each KPI, you get the formula, a target range, and an explanation of why it matters and what to do when the number moves in the wrong direction.

If you are new to maintenance metrics, start with the first four (OEE, MTBF, MTTR, Availability). Once those are being tracked consistently, add the rest one group at a time.

How to Use This Guide

Do not try to implement all 15 KPIs at once. Pick the group that matches your biggest current problem:

  • Too many breakdowns? Start with Equipment Performance KPIs.
  • PM program not delivering results? Start with Process Effectiveness KPIs.
  • Maintenance budget out of control? Start with Cost Management KPIs.
  • Struggling with workforce issues? Start with People KPIs.
Maintenance KPI Dashboard Overview Equipment Performance OEE 72% +3% vs last month MTBF 186 hrs Target: 250 hrs MTTR 2.4 hrs -0.6 hrs improved Avail. 88% Target: 90% 5-month trend Process Effectiveness PM Comp. 92% Target: 90%+ Backlog 3.2 wks Target: 2-4 wks Wrench 28% Target: 40%+ Sched. 85% Target: 80%+ Cost Management Cost/Unit $1.24 -$0.08 vs prior Parts Turn. 2.1x Target: 2-3x Planned % 68% +5% improving People Training 32 hrs Target: 40 hrs/yr Safety 0 Recordable incidents OT % 18% Target: below 10%

Group 1: Equipment Performance KPIs

These four metrics tell you how reliably your equipment is running. If you can only track four KPIs, track these.

1. OEE (Overall Equipment Effectiveness)

Formula: Availability x Performance x Quality

Target: 85% is world-class. Most plants operate between 55-75%. Continuous process industries (chemical, refining) tend to be higher; discrete manufacturing and packaging tend to be lower.

Why it matters: OEE is the single best measure of how productively you use your equipment. It combines downtime losses, speed losses, and quality losses into one number. A low OEE means you are leaving production capacity on the table. For a complete breakdown of OEE, see our what is OEE guide.

When it drops: Look at the three factors individually. Which one changed? If Availability dropped, check recent downtime events. If Performance dropped, check for speed losses or micro-stops. If Quality dropped, check for process drift or material issues.

2. MTBF (Mean Time Between Failures)

Formula: Total Operating Time / Number of Failures

Target: Depends heavily on equipment type and age. The goal is consistent improvement. If your MTBF on a critical pump was 150 hours last quarter, aim for 175 hours next quarter.

Why it matters: MTBF measures equipment reliability. A rising MTBF means your equipment is running longer between failures, which means your PM and PdM programs are working. A falling MTBF means reliability is degrading and you need to investigate. For a detailed guide, see what is MTBF.

When it drops: A sudden MTBF drop on a specific asset usually points to a new failure mode or a change in operating conditions (new material, new operator, environmental change). A gradual decline across multiple assets may indicate PM program deterioration.

3. MTTR (Mean Time to Repair)

Formula: Total Repair Time / Number of Repairs

Target: Industry average is 3-5 hours. Best-in-class plants achieve 1-2 hours on most equipment. Target depends on equipment complexity.

Why it matters: MTTR measures how fast you can get equipment back online after a failure. It reflects your team's diagnostic skills, parts availability, procedure quality, and repair execution. Even if you cannot prevent every failure, reducing MTTR shortens the impact of each one. See our MTTR guide for improvement strategies.

When it increases: Common causes: spare parts not available (technician waiting for delivery), diagnostic difficulty (new or unfamiliar failure mode), skill gaps (less experienced technicians), or lack of documented procedures.

4. Equipment Availability

Formula: (Planned Production Time - Downtime) / Planned Production Time x 100

Target: 90%+ for critical production equipment. 95%+ for bottleneck machines.

Why it matters: Availability is the most visible component of OEE and the one most directly affected by maintenance. Every unplanned stop and every PM task that runs over schedule reduces Availability. It is the link between maintenance performance and production output.

When it drops: Look at downtime event frequency and duration. Are you having more events (prevention problem) or longer events (response problem)? The answer tells you whether to focus on PM/PdM improvements or MTTR improvements.

Group 2: Process Effectiveness KPIs

These metrics measure how well your maintenance processes are working. Good equipment performance comes from good maintenance processes, and these KPIs tell you whether your processes are solid.

5. PM Compliance

Formula: (PM Tasks Completed On Time / PM Tasks Due) x 100

Target: 90%+ consistently. Below 80% means your PM program is not protecting your equipment.

Why it matters: A PM schedule is only useful if the work gets done. Low PM compliance leads to more breakdowns, which leads to more emergency work, which leads to less time for PM. It is a death spiral. Tracking PM compliance catches the problem before it cascades. Full details in our PM compliance guide.

When it drops: Common causes: too many PM tasks overwhelming the team, emergency work consuming PM time, poor scheduling, or PM tasks taking longer than estimated. Start by looking at which PMs are being missed and why.

6. Maintenance Backlog (in weeks)

Formula: Total Open Work Order Hours / Weekly Labor Capacity

Target: 2-4 weeks. Below 2 weeks may mean your team is underutilized. Above 6 weeks means work is piling up faster than you can complete it.

Why it matters: Backlog is your leading indicator of whether your team has enough capacity to keep up with maintenance demand. A growing backlog means deferred maintenance is accumulating, which means future breakdowns are building. See our maintenance backlog guide for management strategies.

When it grows: Either demand is increasing (more breakdowns, more PM tasks) or capacity is decreasing (vacancies, training time, excessive overtime leading to burnout). The backlog number alone does not tell you which one. You need to look at work order inflow rate and team capacity separately.

7. Wrench Time

Formula: (Time Spent on Actual Repair Work / Total Shift Time) x 100

Target: 40-55% is good. Industry average is 25-35%. Best-in-class operations reach 55%+.

Why it matters: Wrench time measures how much of your technicians' day is spent actually repairing or maintaining equipment vs. waiting for parts, walking to jobs, looking for manuals, filling out paperwork, or sitting in meetings. Low wrench time means your support processes are wasting your most expensive resource: skilled technician labor.

When it is low: Do a wrench time study. Follow a technician for a full shift and categorize their time. The non-wrench categories reveal where the waste is. Common findings: 15-20% travel time (storeroom too far from work area), 10-15% waiting for parts, 10-15% waiting for permits or access.

8. Schedule Compliance

Formula: (Scheduled Work Orders Completed / Scheduled Work Orders Planned) x 100

Target: 80%+. Below 70% means your weekly schedule is not reliable and planning is not driving the work.

Why it matters: Schedule compliance tells you whether the work you planned for this week actually got done. Low compliance means emergency work is disrupting your planned schedule, or your estimates are unreliable, or your priorities are shifting mid-week. The maintenance team that runs 90% schedule compliance is proactive. The one running 50% is reactive.

When it drops: Track the reason for every schedule break (work order that got bumped or delayed). The reasons cluster into: emergency work preempting scheduled work, parts not available when the job started, equipment not released by operations, or job took longer than estimated.

Group 3: Cost Management KPIs

These metrics connect maintenance performance to financial outcomes. They are the KPIs your plant manager and finance team care about.

9. Maintenance Cost Per Unit Produced

Formula: Total Maintenance Cost / Good Units Produced

Target: Varies widely by industry. The important thing is the trend. If cost per unit is decreasing while output stays stable or increases, your maintenance is getting more efficient.

Why it matters: This connects maintenance spend directly to production output. It accounts for both the cost of maintenance and the productivity of the equipment. A plant that spends $1 million on maintenance and produces 1 million units ($1.00/unit) is more efficient than a plant that spends $800,000 on maintenance but only produces 600,000 units ($1.33/unit). See our manufacturing cost guide for the full cost calculation.

When it rises: Either maintenance costs increased (more breakdowns, more parts, more overtime) or production decreased (lower OEE, fewer operating hours). Break it apart to find the cause.

10. Spare Parts Inventory Turnover

Formula: Annual Parts Consumption / Average Inventory Value

Target: 2-3x per year for most industrial operations. Higher turnover means less money tied up in inventory. Too high means you are running lean and risking stockouts.

Why it matters: Spare parts inventory is a balance between availability (having the part when you need it) and carrying cost (money sitting on shelves). A turnover of 1x means your average part sits in the storeroom for a full year before being used. A turnover of 0.5x means two years. That is money that could be deployed elsewhere.

When it is too low: You have excess or obsolete inventory. Conduct a review of parts that have not moved in 18+ months. If the equipment they belong to has been retired or the part is no longer failure-prone, remove them from inventory.

11. Planned vs. Reactive Work Ratio

Formula: (Planned Work Order Hours / Total Work Order Hours) x 100

Target: 80%+ planned, 20% or less reactive. World-class operations run above 85% planned.

Why it matters: This ratio tells you whether your team is proactive or reactive. Planned work is more efficient (parts ready, procedures available, equipment scheduled for access), safer (job planned with hazards considered), and cheaper (straight-time labor, standard parts pricing). The shift from reactive to planned work is covered in detail in our preventive vs reactive maintenance guide.

When it is below 60%: Your team is firefighting. Address the root causes of your top breakdown modes (Step 4 in our downtime reduction guide), build PM schedules for your critical assets, and protect PM time on the weekly schedule from emergency work interruptions.

Group 4: People KPIs

Equipment does not maintain itself. Your team's skills, safety, and workload directly affect every other metric on this list.

12. Training Hours Per Technician

Formula: Total Training Hours / Number of Technicians (annual)

Target: 40-80 hours per technician per year (roughly 1-2 weeks). This includes formal training, vendor training, on-the-job mentoring, and certification programs.

Why it matters: Equipment complexity increases every year. New control systems, new drives, new materials. If your team's skills are not keeping pace, MTTR increases and repeat failures increase. Training is not an expense you cut when budgets are tight. It is an investment that pays back through faster repairs and fewer mistakes.

When it is too low: Common reason: "We do not have time for training because we are too busy with breakdowns." This is the same trap as "we do not have time for PM." Invest in training and the breakdowns decrease, freeing up more time. Start with the skills that address your most common failures.

13. Maintenance Safety Incidents

Formula: Number of recordable safety incidents in maintenance per period

Target: Zero. Every recordable incident should trigger an investigation and corrective action.

Why it matters: Maintenance work is inherently higher risk than most plant activities: working with energized equipment, confined spaces, elevated work, hot work, and chemical exposure. A rising incident rate signals that procedures are not being followed, risk assessments are not thorough, or time pressure from breakdowns is causing shortcuts.

When incidents occur: Investigate every one, no matter how minor. Look for procedure gaps, training gaps, and organizational pressure that encourages shortcuts. A near-miss reporting program catches risks before they become injuries.

14. Maintenance Overtime Percentage

Formula: (Overtime Hours / Total Hours Worked) x 100

Target: Below 10% for a well-run operation. Sustained overtime above 15% leads to burnout, turnover, and safety issues.

Why it matters: Overtime at 1.5x labor rate is expensive. But the bigger cost is burnout and turnover. Skilled maintenance technicians are hard to replace, and the replacement cycle (recruiting, hiring, training) takes 12-18 months to get a new technician fully productive. Sustained overtime is a sign of understaffing, poor planning, or excessive reactive work.

When it is too high: Determine the cause. Emergency work driving weekend callouts? Address the equipment reliability issues causing the emergencies. Planned work not being completed during straight time? Improve scheduling and wrench time. Not enough people? Build the business case for additional headcount.

15. First-Time Fix Rate

Formula: (Repairs Completed Successfully on First Visit / Total Repairs) x 100

Target: 85%+. Below 70% indicates significant diagnostic or skill gaps.

Why it matters: When a technician has to come back a second or third time to fix the same problem, you are paying double or triple the labor cost and the equipment stays down longer. A low first-time fix rate means technicians are guessing rather than diagnosing, parts are being replaced speculatively, or root causes are not being identified on the first attempt.

When it is low: Improve diagnostic procedures, build a knowledge base of past repairs, and invest in diagnostic skills training. AI-powered diagnostic tools can also help by matching current symptoms to similar past repairs.

How the KPIs Connect

These 15 KPIs are not independent. They form a connected system where improving one often improves others, and deterioration in one often cascades.

KPI Hierarchy: How Maintenance KPIs Connect Cost Per Unit Business outcome OEE | MTBF | MTTR | Availability Equipment Performance PM Compliance | Schedule Compliance | Backlog Wrench Time | Planned vs Reactive | Parts Turnover Process Effectiveness Training Hours | Safety Incidents | Overtime % | First-Time Fix Rate People Foundation Lower cost per unit = better competitiveness Better equipment = more output at lower cost Better processes = better equipment performance Skilled, safe team = effective processes Drives Drives Drives Reading the pyramid bottom-up: Invest in people to improve processes. Better processes improve equipment. Better equipment lowers cost per unit. Lower cost drives business results.

Here are the most common connections:

  • PM Compliance drives MTBF. When PM gets done consistently, equipment reliability improves and time between failures increases.
  • MTBF and MTTR drive Availability. Fewer failures (higher MTBF) and shorter repairs (lower MTTR) both increase the percentage of time equipment is running.
  • Availability drives OEE. In most plants, Availability is the weakest OEE factor and the one most influenced by maintenance.
  • OEE drives Cost Per Unit. Higher OEE means more output from the same fixed costs, which reduces cost per unit produced.
  • Training Hours drive First-Time Fix Rate. Better-trained technicians diagnose correctly on the first attempt, which reduces MTTR and repeat visits.
  • Wrench Time drives Schedule Compliance. When technicians spend more time on actual work and less on non-value activities, more scheduled work gets completed on time.

Getting Started: A 90-Day KPI Implementation Plan

Month 1: Start tracking OEE (or at least Availability), MTBF, MTTR, and PM Compliance. These four give you the core picture. Use simple tools: a shared spreadsheet, whiteboard, or your CMMS reports.

Month 2: Add Maintenance Backlog, Schedule Compliance, and Planned vs. Reactive ratio. These tell you whether your processes are supporting the equipment metrics. Start posting results where the team can see them.

Month 3: Add the remaining metrics as you build the tracking capability. Start weekly or monthly review meetings where you look at the numbers, discuss trends, and assign actions for any KPIs moving in the wrong direction.

The key is consistency. A KPI tracked accurately for 12 months gives you trend data, seasonal patterns, and a real baseline for improvement targets. A KPI tracked sporadically is just noise.

Where Dovient Fits

Dovient helps improve the KPIs that are hardest to move through traditional approaches alone.

  • MTTR and First-Time Fix Rate. Dovient's AI matches symptoms to past repairs, giving technicians a diagnostic head start. This directly reduces repair time and increases the chance of a correct fix on the first visit.
  • Knowledge capture for training. When experienced technicians document their repair approaches in Dovient, that knowledge becomes training material for the entire team. This improves First-Time Fix Rate and reduces the skill gap between senior and junior technicians.
  • Better data for better decisions. Dovient tracks repair patterns and failure frequencies, giving you the data foundation for most of the KPIs in this guide. Pattern visibility shows you where to focus improvement efforts for maximum impact.

Want to talk about your plant's KPIs? Schedule a conversation with our team to discuss where your biggest improvement opportunities are.


Related Articles