What is MTBF? Mean Time Between Failures Explained

Your plant manager walks into the morning meeting and says "the hydraulic press had three failures last month." Someone asks: "Is that bad?" Without context, nobody knows. Three failures in 30 days of continuous operation is very different from three failures in a single 8-hour shift.

MTBF gives you that context. It tells you, on average, how long a piece of equipment runs before the next failure. It is the most widely used reliability metric in maintenance, and it drives everything from spare parts planning to capital replacement decisions.

If your hydraulic press has an MTBF of 240 hours, you can expect a failure roughly every 10 days of continuous operation. That number tells your maintenance planner when to schedule inspections, your storeroom manager how many spare parts to keep on hand, and your production scheduler how much buffer to build into the plan.

What is MTBF?

MTBF stands for Mean Time Between Failures. It is the average elapsed time between one failure and the next failure on a repairable system. The key word is "repairable." MTBF applies to equipment you fix and put back into service, not components you throw away and replace.

MTBF is calculated only during the time the equipment is actually operating. Planned downtime for scheduled maintenance, shift changes, or holidays does not count. You are measuring how long the machine runs when it is supposed to be running.

A higher MTBF means the equipment is more reliable. A compressor with an MTBF of 2,000 hours is more reliable than one with an MTBF of 500 hours, assuming both are in similar service conditions.

The MTBF Formula

The formula is straightforward:

MTBF = Total Operating Time / Number of Failures

Total operating time is the sum of all the periods when the equipment was running. You exclude repair time, planned downtime, and any period when the machine was not supposed to be operating.

Worked Example

Let's say you are tracking a centrifugal pump over one month. The pump is scheduled to run 24 hours a day, 30 days a month. That is 720 hours of scheduled time. During the month, the pump had 3 failures:

Failure	When	Repair Time
Failure 1: Seal leak	Day 8	4 hours
Failure 2: Bearing overheat	Day 17	6 hours
Failure 3: Coupling misalignment	Day 26	5 hours

Step 1: Calculate total repair time = 4 + 6 + 5 = 15 hours

Step 2: Calculate total operating time = 720 - 15 = 705 hours

Step 3: MTBF = 705 / 3 = 235 hours

This means, on average, the pump runs for 235 hours (about 9.8 days) between failures. That is your baseline. Now you can set a target: maybe 400 hours after addressing the root causes of those three failures.

MTBF vs MTTR vs MTTF: What is the Difference?

These three metrics sound similar but measure very different things. Here is the breakdown:

Metric	Full Name	What It Measures	Applies To	Higher or Lower is Better?
MTBF	Mean Time Between Failures	Average run time between failures	Repairable systems (pumps, motors, conveyors)	Higher is better
MTTR	Mean Time to Repair	Average time to fix a failed system	Repairable systems	Lower is better
MTTF	Mean Time to Failure	Average time until a component fails permanently	Non-repairable items (light bulbs, bearings, seals)	Higher is better

The critical distinction: MTBF is for things you repair and keep using. MTTF is for things that fail once and get replaced. A motor is repairable (MTBF). The bearing inside that motor is not (MTTF). When the bearing fails, you don't repair the bearing. You replace it with a new one.

In practice, many people use MTBF loosely for both cases. That is fine for internal conversation, but if you are writing reliability specifications or comparing vendors, the distinction matters.

For a deep dive on the repair side of this equation, see our complete guide to MTTR.

MTBF Benchmarks by Industry

MTBF varies enormously depending on the type of equipment, operating conditions, maintenance quality, and industry. Here are typical ranges based on industry data and field experience.

Industry	Typical Equipment	Typical MTBF	World-Class MTBF
Automotive Manufacturing	Welding robots, stamping presses	300-600 hrs	1,200+ hrs
Food & Beverage	Filling lines, pasteurizers	200-500 hrs	800+ hrs
Chemical / Process	Pumps, compressors, reactors	1,000-3,000 hrs	5,000+ hrs
Pharmaceuticals	Tablet presses, packaging lines	150-400 hrs	800+ hrs
Power Generation	Turbines, generators	2,000-6,000 hrs	8,000+ hrs
Mining & Metals	Crushers, conveyors, mills	100-400 hrs	800+ hrs
Packaging	Case packers, wrappers, labelers	80-250 hrs	500+ hrs
Semiconductor	Lithography, etching, deposition	500-1,500 hrs	3,000+ hrs

These numbers are broad ranges. Your specific MTBF depends on equipment age, operating conditions, maintenance quality, and operator training. The important thing is to track your own MTBF over time and trend it upward.

How to Improve MTBF: 6 Practical Steps

Improving MTBF means making your equipment fail less often. That sounds obvious, but the methods are specific and proven. Here are six steps that work in any plant.

1. Fix the repeat offenders first

Pull your failure history for the last 12 months. Sort by equipment and failure mode. You will almost certainly find that 3-5 failure modes account for 60-70% of all failures. These are your repeat offenders.

Do a proper root cause analysis on each one. Not a quick "the bearing failed," but a real investigation: why did the bearing fail? Was it a lubrication issue? Misalignment? Wrong bearing spec? Overloading? Fix the root cause and the failure stops repeating.

2. Implement or improve your preventive maintenance program

Most equipment failures are preventable with proper PM. If you are running your equipment until it breaks and then fixing it, your MTBF will always be low. Build a preventive maintenance program based on manufacturer recommendations and your own failure data.

Start with the basics: lubrication schedules, filter changes, belt inspections, alignment checks. These simple tasks prevent the majority of early-life failures. A well-executed PM program typically improves MTBF by 25-40% in the first year.

3. Improve operating procedures

Operator error causes 15-25% of equipment failures in most plants. That is not because operators are careless. It is because procedures are unclear, training is inconsistent, or the equipment setup process is not standardized.

Watch how different operators run the same machine. If they do it differently, you have a standardization problem. Document the best method, train everyone on it, and post the procedure at the machine. Video-based SOPs work especially well for complex startup sequences.

4. Address environmental factors

Heat, dust, moisture, and vibration accelerate equipment wear. A motor in a clean, temperature-controlled room will have a much higher MTBF than the same motor next to a grinding station covered in metal dust.

Common environmental fixes include: adding filters or enclosures, improving ventilation around heat-generating equipment, installing vibration isolation mounts, and keeping electrical cabinets sealed and clean. These are cheap fixes that pay back over years of improved reliability.

5. Use better parts and materials

If the same component keeps failing, consider upgrading to a higher-grade replacement. A standard nitrile seal that lasts 3 months might cost $12. A Viton seal that lasts 12 months might cost $28. The math is obvious when you factor in the labor cost and production loss of each failure.

Talk to your bearing suppliers, seal vendors, and component manufacturers. They often know about upgraded versions that are direct replacements with longer service life. The cost premium is usually small compared to the cost of a failure event.

6. Track and trend your MTBF data

You cannot improve what you do not measure. Track MTBF by equipment, by system, and by failure mode. Review monthly. Look for equipment where MTBF is declining, because that signals a developing problem.

Set targets: if your current MTBF on a critical compressor is 500 hours, set a 12-month target of 750 hours. Break that down into specific actions (new PM tasks, root cause fixes, parts upgrades) and track progress quarterly.

Common MTBF Calculation Mistakes

MTBF seems simple, but these mistakes are surprisingly common:

Including planned downtime in operating time. If the machine was shut down for a weekend or a scheduled overhaul, that time should not count as operating time. MTBF measures reliability during operation, not calendar time.
Counting planned maintenance as a failure. If you stopped the machine for a scheduled PM and found a worn part, that is not a failure. A failure is an unplanned stop. If you count PMs as failures, your MTBF will be artificially low and you will understate your reliability.
Not enough data. MTBF calculated from 2 failures is statistically meaningless. You need at least 5-10 failure events to get a useful number. For critical equipment with very few failures, consider pooling data from identical machines.
Mixing failure modes. A pump might have an MTBF of 300 hours for seal failures but 2,000 hours for bearing failures. If you lump them together, you get one number that does not help you prioritize. Track MTBF by failure mode for actionable insights.
Confusing MTBF with expected life. An MTBF of 1,000 hours does not mean the equipment will last exactly 1,000 hours. It means, on average, you can expect a failure every 1,000 operating hours. Some intervals will be shorter, some longer. MTBF is a statistical average, not a guarantee.
Using MTBF from manufacturer specs without adjustment. Manufacturers calculate MTBF under ideal lab conditions. Your plant is not a lab. Temperature, dust, vibration, and maintenance quality all affect real-world MTBF. Use manufacturer numbers as a starting point, but track your own actuals.

MTBF and Equipment Availability

MTBF and MTTR together determine your equipment availability. The formula is:

Availability = MTBF / (MTBF + MTTR)

Using our pump example: MTBF = 235 hours, average MTTR = 5 hours.

Availability = 235 / (235 + 5) = 235 / 240 = 97.9%

This connects directly to your OEE calculation, where Availability is the first factor. If you want to push your OEE Availability factor above 90%, you need a combination of high MTBF (few failures) and low MTTR (fast repairs when they do happen).

Here is a quick reference showing how MTBF and MTTR combinations affect availability:

MTBF (hours)	MTTR (hours)	Availability
100	8	92.6%
200	6	97.1%
500	4	99.2%
1,000	4	99.6%
2,000	2	99.9%

Notice the diminishing returns. Going from MTBF 100 to 200 gives you a big availability jump (92.6% to 97.1%). Going from 1,000 to 2,000 barely moves the needle. At high MTBF values, reducing MTTR becomes the more effective strategy.

MTBF in Maintenance Strategy

MTBF data drives several important maintenance decisions:

PM scheduling. If a component has an MTBF of 500 hours, you should inspect or replace it well before that point. A common rule of thumb is to schedule PM at 60-80% of MTBF. So inspect at 300-400 hours.
Spare parts stocking. Low MTBF equipment needs more spares on hand. If your MTBF for a motor is 200 hours and your lead time for a replacement is 2 weeks, you better have one on the shelf.
Capital replacement. When MTBF drops below a certain threshold despite good maintenance, the equipment may be reaching end of life. Tracking MTBF over years helps you build the business case for replacement before the machine becomes a constant production bottleneck.
Vendor comparison. When buying new equipment, compare MTBF data from different vendors. A machine that costs 20% more but has 3x the MTBF is almost always the better deal when you factor in maintenance and downtime costs.

Where Dovient Fits

Dovient helps maintenance teams improve MTBF by capturing failure patterns and making repair knowledge accessible across the entire team. Here is how:

Failure pattern recognition. Dovient tracks every repair event with structured data: what failed, why it failed, and what fixed it. Over time, this reveals repeat failure modes that manual tracking often misses.
Root cause knowledge sharing. When a technician solves a tricky problem, that solution gets captured in Dovient. The next time the same failure occurs, any technician can find the proven fix immediately, not just the one person who figured it out last time. Read more about capturing tribal knowledge.
Data-driven PM adjustments. Dovient's repair history data shows you exactly which equipment needs more frequent PM and which can safely go longer between services. That means fewer unnecessary PMs and fewer failures from under-maintained equipment.

If you want to start tracking your reliability metrics, try our free OEE Calculator to see how your current equipment performance stacks up. For a deeper conversation about improving your plant's MTBF, schedule a conversation with our team.