Your plant manager walks into the morning meeting and says "the hydraulic press had three failures last month." Someone asks: "Is that bad?" Without context, nobody knows. Three failures in 30 days of continuous operation is very different from three failures in a single 8-hour shift.
MTBF gives you that context. It tells you, on average, how long a piece of equipment runs before the next failure. It is the most widely used reliability metric in maintenance, and it drives everything from spare parts planning to capital replacement decisions.
If your hydraulic press has an MTBF of 240 hours, you can expect a failure roughly every 10 days of continuous operation. That number tells your maintenance planner when to schedule inspections, your storeroom manager how many spare parts to keep on hand, and your production scheduler how much buffer to build into the plan.
What is MTBF?
MTBF stands for Mean Time Between Failures. It is the average elapsed time between one failure and the next failure on a repairable system. The key word is "repairable." MTBF applies to equipment you fix and put back into service, not components you throw away and replace.
MTBF is calculated only during the time the equipment is actually operating. Planned downtime for scheduled maintenance, shift changes, or holidays does not count. You are measuring how long the machine runs when it is supposed to be running.
A higher MTBF means the equipment is more reliable. A compressor with an MTBF of 2,000 hours is more reliable than one with an MTBF of 500 hours, assuming both are in similar service conditions.
The MTBF Formula
The formula is straightforward:
MTBF = Total Operating Time / Number of Failures
Total operating time is the sum of all the periods when the equipment was running. You exclude repair time, planned downtime, and any period when the machine was not supposed to be operating.
Worked Example
Let's say you are tracking a centrifugal pump over one month. The pump is scheduled to run 24 hours a day, 30 days a month. That is 720 hours of scheduled time. During the month, the pump had 3 failures:
| Failure | When | Repair Time |
|---|---|---|
| Failure 1: Seal leak | Day 8 | 4 hours |
| Failure 2: Bearing overheat | Day 17 | 6 hours |
| Failure 3: Coupling misalignment | Day 26 | 5 hours |
Step 1: Calculate total repair time = 4 + 6 + 5 = 15 hours
Step 2: Calculate total operating time = 720 - 15 = 705 hours
Step 3: MTBF = 705 / 3 = 235 hours
This means, on average, the pump runs for 235 hours (about 9.8 days) between failures. That is your baseline. Now you can set a target: maybe 400 hours after addressing the root causes of those three failures.
MTBF vs MTTR vs MTTF: What is the Difference?
These three metrics sound similar but measure very different things. Here is the breakdown:
| Metric | Full Name | What It Measures | Applies To | Higher or Lower is Better? |
|---|---|---|---|---|
| MTBF | Mean Time Between Failures | Average run time between failures | Repairable systems (pumps, motors, conveyors) | Higher is better |
| MTTR | Mean Time to Repair | Average time to fix a failed system | Repairable systems | Lower is better |
| MTTF | Mean Time to Failure | Average time until a component fails permanently | Non-repairable items (light bulbs, bearings, seals) | Higher is better |
The critical distinction: MTBF is for things you repair and keep using. MTTF is for things that fail once and get replaced. A motor is repairable (MTBF). The bearing inside that motor is not (MTTF). When the bearing fails, you don't repair the bearing. You replace it with a new one.
In practice, many people use MTBF loosely for both cases. That is fine for internal conversation, but if you are writing reliability specifications or comparing vendors, the distinction matters.
For a deep dive on the repair side of this equation, see our complete guide to MTTR.
MTBF Benchmarks by Industry
MTBF varies enormously depending on the type of equipment, operating conditions, maintenance quality, and industry. Here are typical ranges based on industry data and field experience.
| Industry | Typical Equipment | Typical MTBF | World-Class MTBF |
|---|---|---|---|
| Automotive Manufacturing | Welding robots, stamping presses | 300-600 hrs | 1,200+ hrs |
| Food & Beverage | Filling lines, pasteurizers | 200-500 hrs | 800+ hrs |
| Chemical / Process | Pumps, compressors, reactors | 1,000-3,000 hrs | 5,000+ hrs |
| Pharmaceuticals | Tablet presses, packaging lines | 150-400 hrs | 800+ hrs |
| Power Generation | Turbines, generators | 2,000-6,000 hrs | 8,000+ hrs |
| Mining & Metals | Crushers, conveyors, mills | 100-400 hrs | 800+ hrs |
| Packaging | Case packers, wrappers, labelers | 80-250 hrs | 500+ hrs |
| Semiconductor | Lithography, etching, deposition | 500-1,500 hrs | 3,000+ hrs |
These numbers are broad ranges. Your specific MTBF depends on equipment age, operating conditions, maintenance quality, and operator training. The important thing is to track your own MTBF over time and trend it upward.
How to Improve MTBF: 6 Practical Steps
Improving MTBF means making your equipment fail less often. That sounds obvious, but the methods are specific and proven. Here are six steps that work in any plant.
1. Fix the repeat offenders first
Pull your failure history for the last 12 months. Sort by equipment and failure mode. You will almost certainly find that 3-5 failure modes account for 60-70% of all failures. These are your repeat offenders.
Do a proper root cause analysis on each one. Not a quick "the bearing failed," but a real investigation: why did the bearing fail? Was it a lubrication issue? Misalignment? Wrong bearing spec? Overloading? Fix the root cause and the failure stops repeating.
2. Implement or improve your preventive maintenance program
Most equipment failures are preventable with proper PM. If you are running your equipment until it breaks and then fixing it, your MTBF will always be low. Build a preventive maintenance program based on manufacturer recommendations and your own failure data.
Start with the basics: lubrication schedules, filter changes, belt inspections, alignment checks. These simple tasks prevent the majority of early-life failures. A well-executed PM program typically improves MTBF by 25-40% in the first year.
3. Improve operating procedures
Operator error causes 15-25% of equipment failures in most plants. That is not because operators are careless. It is because procedures are unclear, training is inconsistent, or the equipment setup process is not standardized.
Watch how different operators run the same machine. If they do it differently, you have a standardization problem. Document the best method, train everyone on it, and post the procedure at the machine. Video-based SOPs work especially well for complex startup sequences.
4. Address environmental factors
Heat, dust, moisture, and vibration accelerate equipment wear. A motor in a clean, temperature-controlled room will have a much higher MTBF than the same motor next to a grinding station covered in metal dust.
Common environmental fixes include: adding filters or enclosures, improving ventilation around heat-generating equipment, installing vibration isolation mounts, and keeping electrical cabinets sealed and clean. These are cheap fixes that pay back over years of improved reliability.
5. Use better parts and materials
If the same component keeps failing, consider upgrading to a higher-grade replacement. A standard nitrile seal that lasts 3 months might cost $12. A Viton seal that lasts 12 months might cost $28. The math is obvious when you factor in the labor cost and production loss of each failure.
Talk to your bearing suppliers, seal vendors, and component manufacturers. They often know about upgraded versions that are direct replacements with longer service life. The cost premium is usually small compared to the cost of a failure event.
6. Track and trend your MTBF data
You cannot improve what you do not measure. Track MTBF by equipment, by system, and by failure mode. Review monthly. Look for equipment where MTBF is declining, because that signals a developing problem.
Set targets: if your current MTBF on a critical compressor is 500 hours, set a 12-month target of 750 hours. Break that down into specific actions (new PM tasks, root cause fixes, parts upgrades) and track progress quarterly.
Common MTBF Calculation Mistakes
MTBF seems simple, but these mistakes are surprisingly common:
- Including planned downtime in operating time. If the machine was shut down for a weekend or a scheduled overhaul, that time should not count as operating time. MTBF measures reliability during operation, not calendar time.
- Counting planned maintenance as a failure. If you stopped the machine for a scheduled PM and found a worn part, that is not a failure. A failure is an unplanned stop. If you count PMs as failures, your MTBF will be artificially low and you will understate your reliability.
- Not enough data. MTBF calculated from 2 failures is statistically meaningless. You need at least 5-10 failure events to get a useful number. For critical equipment with very few failures, consider pooling data from identical machines.
- Mixing failure modes. A pump might have an MTBF of 300 hours for seal failures but 2,000 hours for bearing failures. If you lump them together, you get one number that does not help you prioritize. Track MTBF by failure mode for actionable insights.
- Confusing MTBF with expected life. An MTBF of 1,000 hours does not mean the equipment will last exactly 1,000 hours. It means, on average, you can expect a failure every 1,000 operating hours. Some intervals will be shorter, some longer. MTBF is a statistical average, not a guarantee.
- Using MTBF from manufacturer specs without adjustment. Manufacturers calculate MTBF under ideal lab conditions. Your plant is not a lab. Temperature, dust, vibration, and maintenance quality all affect real-world MTBF. Use manufacturer numbers as a starting point, but track your own actuals.
MTBF and Equipment Availability
MTBF and MTTR together determine your equipment availability. The formula is:
Availability = MTBF / (MTBF + MTTR)
Using our pump example: MTBF = 235 hours, average MTTR = 5 hours.
Availability = 235 / (235 + 5) = 235 / 240 = 97.9%
This connects directly to your OEE calculation, where Availability is the first factor. If you want to push your OEE Availability factor above 90%, you need a combination of high MTBF (few failures) and low MTTR (fast repairs when they do happen).
Here is a quick reference showing how MTBF and MTTR combinations affect availability:
| MTBF (hours) | MTTR (hours) | Availability |
|---|---|---|
| 100 | 8 | 92.6% |
| 200 | 6 | 97.1% |
| 500 | 4 | 99.2% |
| 1,000 | 4 | 99.6% |
| 2,000 | 2 | 99.9% |
Notice the diminishing returns. Going from MTBF 100 to 200 gives you a big availability jump (92.6% to 97.1%). Going from 1,000 to 2,000 barely moves the needle. At high MTBF values, reducing MTTR becomes the more effective strategy.
MTBF in Maintenance Strategy
MTBF data drives several important maintenance decisions:
- PM scheduling. If a component has an MTBF of 500 hours, you should inspect or replace it well before that point. A common rule of thumb is to schedule PM at 60-80% of MTBF. So inspect at 300-400 hours.
- Spare parts stocking. Low MTBF equipment needs more spares on hand. If your MTBF for a motor is 200 hours and your lead time for a replacement is 2 weeks, you better have one on the shelf.
- Capital replacement. When MTBF drops below a certain threshold despite good maintenance, the equipment may be reaching end of life. Tracking MTBF over years helps you build the business case for replacement before the machine becomes a constant production bottleneck.
- Vendor comparison. When buying new equipment, compare MTBF data from different vendors. A machine that costs 20% more but has 3x the MTBF is almost always the better deal when you factor in maintenance and downtime costs.
Where Dovient Fits
Dovient helps maintenance teams improve MTBF by capturing failure patterns and making repair knowledge accessible across the entire team. Here is how:
- Failure pattern recognition. Dovient tracks every repair event with structured data: what failed, why it failed, and what fixed it. Over time, this reveals repeat failure modes that manual tracking often misses.
- Root cause knowledge sharing. When a technician solves a tricky problem, that solution gets captured in Dovient. The next time the same failure occurs, any technician can find the proven fix immediately, not just the one person who figured it out last time. Read more about capturing tribal knowledge.
- Data-driven PM adjustments. Dovient's repair history data shows you exactly which equipment needs more frequent PM and which can safely go longer between services. That means fewer unnecessary PMs and fewer failures from under-maintained equipment.
If you want to start tracking your reliability metrics, try our free OEE Calculator to see how your current equipment performance stacks up. For a deeper conversation about improving your plant's MTBF, schedule a conversation with our team.