At-a-Glance: Rig maintenance best practices center on a risk-based, condition-driven program that maximizes uptime, protects people and assets, and lowers OPEX through disciplined planning, effective execution, and continuous verification. Focus on critical equipment (hoisting, well control, power, mud systems) with measurable KPIs and a closed-loop CMMS process.
I. Objective Definition and Key KPIs
- I.1 Objective
- Maximize safe drilling availability while minimizing non-productive time (NPT), unplanned failures, and total maintenance cost per operating hour.
- Assure well control barrier integrity and regulatory compliance.
- Extend equipment life and reduce emissions via optimized energy and lubrication practices.
- I.2 Core KPIs
- Technical Availability: A measure of uptime vs. total scheduled time
\( \text{Availability} = \dfrac{\text{Uptime}}{\text{Uptime} + \text{Downtime}} \times 100\% \)
- MTBF/MTTR:
\( \text{MTBF} = \dfrac{\text{Operating Time}}{\text{Number of Failures}} \), \( \text{MTTR} = \dfrac{\text{Repair Time}}{\text{Number of Repairs}} \)
- NPT due to Maintenance: hours and % of operating time.
- PM Compliance: % of planned maintenance completed on time (target = 95%).
- Backlog: total age-weighted work orders; critical PM backlog target = 2 weeks.
- Condition KPIs: vibration (mm/s RMS), oil PQ index, ISO 4406 cleanliness, thermography ?T.
- Safety: TRIR, high-potential near-misses, LOTO/PTW compliance rate.
- Well Control Integrity: successful BOP pressure test rate, accumulator precharge compliance.
- Energy/Emissions: fuel consumption (L/hr), load factor (%), CO2e/operating hour, leaks eliminated.
- Inventory Health: critical spares service level (%), stockouts (#/month), obsolescence (%)
- Cost: maintenance OPEX per operating hour; % reactive vs. preventive vs. predictive spend.
- Technical Availability: A measure of uptime vs. total scheduled time
II. Critical Parameters and Target Ranges
| System | Parameter | Target/Range (estimated) | Notes |
|---|---|---|---|
| Hoisting (drawworks, top drive, blocks) | Vibration | = 4.5 mm/s RMS (ISO 10816–3 equivalent) | Alarm at 7.1 mm/s; trip above 11.2 mm/s (estimated) |
| Hoisting | Drill line ton-miles | Slip & cut at 300–600 ton-miles (estimated) | Optimize based on sheave count, line size, WOB/loads |
| Top Drive | Gearbox oil ISO 4406 | = 18/16/13 | Water = 200 ppm; PQ trending stable |
| Mud Pumps | Liner/piston ?P rise | = 10% above baseline | Exceedance triggers inspection/change-out |
| Well Control (BOP) | Accumulator precharge | 0.9 × set pressure (±10%) | Verify bottle-by-bottle |
| Well Control | Pressure tests | As per program; typically 3,000–10,000 psi | No leak/hold per acceptance criteria |
| Rotary/Handling | Chain elongation | = 2% | Replace above limit; verify lubrication |
| Power Generation | Load factor per generator | 65–80% | Rotate units to balance hours |
| Electrical | IR/PI for motors | PI = 2.0; IR = manufacturer min | Trend degradation |
| Hydraulics | Leakage rate | Zero visible leaks; < 2% top-up/month | Use tagged leak-elimination program |
| Cranes/Marine | Wire rope diameter loss | = 10% nominal | Retire by discard criteria; NDE sheaves |
| HVAC/Fire & Gas | Detector test success | 100% per monthly function test | Calibrate per schedule |
III. Step-by-Step Procedure / Workflow / Checklist
III.1 Program Design (Pre-operations and Continuous)
- III.1.1 Asset Criticality and RCM
- Perform FMEA/RCM on major systems: hoisting, top drive, mud pumps, BOP, power, cranes, DP/marine (offshore), electrical, control, fire & gas.
- Classify maintenance strategy per failure mode: run-to-failure (non-critical), time-based PM, usage-based PM, condition-based (CBM), or redesign.
- III.1.2 CMMS Foundation
- Build equipment hierarchy, BOMs, maintenance task lists, safe job plans, estimated durations, resources, and spares.
- Load PM schedules by run-hours, starts, ton-miles, calendar, pressure-test intervals, and regulatory requirements.
- III.1.3 Standards and Procedures
- Issue task-specific procedures with torque values, tolerances, acceptance criteria, and hold points. Embed LOTO and PTW steps.
- Define quality checks: as-found/as-left data, photos, sign-offs, calibrations, and test records.
III.2 Planning and Scheduling (Rolling 12–16 Weeks)
- III.2.1 Plan
- Bundle PMs by system to minimize starts/stops; align with rig moves, casing jobs, or scheduled NPT windows.
- Identify skill and tooling needs; verify special tools, test benches, NDE kits, and calibration gear.
- Reserve critical spares; set kitting level = 95% completeness before job release.
- III.2.2 Schedule
- Use constraint-based scheduling: crane time, permit conflicts, SIMOPS, production/drilling priorities.
- Level by crew and shift; target planned work = 70% of total maintenance hours.
- III.2.3 Execute
- Pre-job brief: hazards, contingency, quality expectations, stop-work authority.
- Use standardized work packs with checklists, torque charts, and inspection forms.
- Capture as-found/as-left data in CMMS; attach measurements and photos.
- III.2.4 Review
- Close-out review: verify functional tests, remove inhibitions, restore to service; conduct post-job lessons learned.
- Update task frequencies based on condition data and defects found.
III.3 Daily–Weekly–Monthly Routines
- III.3.1 Daily
- Pre-use inspections: derrick/hoisting, top drive, mud pumps, BOP control, mud pits/shakers, cranes. Record defects immediately.
- Lubrication routes: bearings, chains, wire ropes; verify correct grade/quantity; wipe excess to avoid contamination.
- CBM rounds: vibration spot checks, IR scans on MCCs and brakes, oil level/condition checks.
- Verify critical alarms, ESDs, fire & gas, and UPS status; check accumulator pressures and HPU levels.
- III.3.2 Weekly
- Function test BOP control, diverter, annular, choke/kill lines; check for leaks and response times.
- Test emergency brakes, e-stops, and interlocks; verify brake cooling and condition of bands/discs.
- Inspect wire ropes: broken wires per lay length, diameter reduction, corrosion; grease as per spec.
- Fuel and power audit: balance generator loads; rotate lead unit; clean/inspect air filters.
- III.3.3 Monthly/Quarterly
- Full CBM: route-based vibration, oil sampling/analysis (top drive, gearboxes, mud pump crossheads, draw-works gearboxes), ultrasonic leak surveys.
- Hoisting NDE: sheaves, hooks, bail ears, links; MPI or UT as applicable.
- Electrical maintenance: IR thermography on switchgear/MCCs, insulation resistance tests, UPS battery checks.
- Fire & gas calibrations, relief valve inspections, HVAC filter changes, crane annual/periodic certifications per region.
III.4 Equipment-Specific Best Practices
- III.4.1 Drawworks & Hoisting
- Brake inspections (discs/bands, lining thickness, glazing, cracks); verify brake cooling flow and temperature.
- Drilling line slip-and-cut scheduling using ton-miles:
\( \text{Ton-miles} = \dfrac{W \times S \times N}{2{,}000} \), where W = hook load (lb), S = length moved (ft), N = number of line parts
Set a threshold and cut length to redistribute wear; trend ton-miles per day and per well phase.
- Sheave and bearing lubrication with correct NLGI grade; alignment checks; periodic NDE of grooves.
- III.4.2 Top Drive
- Gearbox oil sampling every 250–500 operating hours; change oil based on condition, not strictly hours.
- Monitor rotor/stator temps, torque response; check swivel seals for leaks; maintain washpipe packing.
- Inspect guide track rollers and torque tube fasteners; torque-verify critical bolts.
- III.4.3 Mud Pumps
- Condition index from liner/piston wear, rod packing leakage, crosshead temps, and fluid-end vibration.
- Torque check fluid-end studs; verify alignment; monitor pulsation dampener precharge.
- Use suction conditioning and clean strainers; maintain proper lubrication of power end.
- III.4.4 BOP and Well Control
- Accumulator bottle-by-bottle precharge verification; pump performance test (flow vs. pressure).
- Function and pressure testing per program; verify ROV panels (offshore) and remote panels.
- Strict torque control on flanged connections; replace elastomers by age/service; maintain cleanliness of control fluid.
- III.4.5 Power Generation and Electrical
- Run generators at optimal load factor; avoid wet-stacking via load banking when required.
- Fuel quality control: water drain and biocide management; dual-stage filtration differential pressure monitoring.
- Verify grounding, bonding, EX/ATEX integrity; maintain MCC cleanliness; tighten lugs to spec (torque audit).
- III.4.6 Cranes/Marine (Offshore)
- Focus on slewing bearings, boom heel pins, load indicators; annual NDE and lubrication of slew ring.
- Wire rope fleet angle and reeving inspection; maintain grease penetration and discard by code.
- III.4.7 Safety-Critical Systems
- Firewater pumps tests, deluge flushing, foam proportioner calibration; verify hydrant/nozzle performance.
- Gas detectors bump tests, calibrations, and proof testing intervals; ESD logic verification.
IV. Risk & Mitigation (HSE, Reliability, Redundancy)
- IV.1 HSE Controls
- LOTO and PTW mandatory; isolate electrical, hydraulic, pneumatic, and pressure energy; verify zero energy with tests/bleeds.
- Pressure hazards: bleed and lock choke/kill lines before breaking flanges; use rated blinds; maintain exclusion zones.
- Dropped objects: certified secondary retention, torque verification, DROPS inspections after storms or heavy lifts.
- Hot work controls: gas testing, fire watch, segregation; respect EX/ATEX boundaries.
- Confined space entry: ventilation, gas monitoring, rescue plan; control of ignition sources.
- IV.2 Reliability and Quality
- Use calibrated torque tools; document torque and angle; enforce hold points and peer checks.
- Contamination control: clean assembly areas, lint-free wipes, desiccant breathers, dedicated transfer pumps.
- Post-maintenance testing: function, load, and leak tests with acceptance criteria before handover.
- IV.3 Redundancy and Spares
- Maintain cold/hot standby for critical systems (pumps, generators); validate auto-changeover.
- Spares: define min/max and reorder point based on lead time and demand variability:
\( \text{Reorder Point} = d \times L + z \times \sigma_L \), where d = average demand, L = lead time, z = service factor, \( \sigma_L \) = demand variability during lead time
- Preserve shelf-life-limited items (seals, accumulators, electronics) with climate control and rotation.
V. Optimization Levers (Analytics, Strategy, Debottlenecking)
- V.1 Condition-Based and Predictive Maintenance
- Deploy online sensors for critical rotating equipment: vibration, temperature, oil condition (dielectric, moisture), pressure ripple.
- Use anomaly detection on SCADA/PLC data to flag drift in torque response, pump efficiency, and control valve position vs. flow.
- Oil analysis program: trend wear metals, viscosity, TAN/TBN; shift to condition-based oil changes to cut OPEX and waste.
- V.2 Maintenance Strategy Mix
- Target spend mix: preventive 50–60%, predictive 20–30%, corrective = 20% (estimated best-in-class).
- Dynamic PM: increase/decrease PM frequency based on findings density and condition trends.
- V.3 Execution Efficiency
- Kitting and point-of-use tooling; reduce wrench time losses; aim for wrench time = 60%.
- Standard job packs with photos and checklists; reduce rework via QA sign-offs and digital procedures.
- V.4 Energy and Emissions
- Power management: optimize generator fleet to operate near best specific fuel consumption; leverage VFDs for large motors.
- Track fuel burn vs. load to quantify gains; prevent leaks; maintain air systems to reduce compressor run hours.
- Calculate and trend CO2e intensity:
\( \text{CO2e Intensity} = \dfrac{\text{Fuel Used (L)} \times \text{EF}}{\text{Operating Hours}} \), EF = emission factor
- V.5 Reliability Modeling
- System availability for series components:
\( A_{\text{series}} = \prod_{i=1}^{n} A_i \)
- Parallel redundancy:
\( A_{\text{parallel}} = 1 - \prod_{i=1}^{n} (1 - A_i) \)
- Use results to justify redundancy or redesign for chronic bad actors.
- System availability for series components:
VI. Verification & Monitoring Plan
- VI.1 What to Measure
- Availability, NPT, MTBF/MTTR, PM compliance, backlog age, wrench time, TRIR, leak counts, fuel per hour, condition metrics (vibration/oil/IR).
- Well control: BOP test results, accumulator pressures, function times, control fluid cleanliness.
- Hoisting: ton-miles, brake temps, wire rope condition indices.
- VI.2 How Often
- Real-time/online: critical rotating equipment KPIs, fuel and load, safety system states.
- Daily: operator rounds, lubrication, housekeeping, leak tags, HSE checks.
- Weekly: functional tests (brakes, ESD, BOP control), electrical IR spot checks, rope inspections.
- Monthly/Quarterly: route CBM, oil analysis, full thermography, calibration campaigns.
- Per-well/Phase: BOP pressure testing, hoisting NDE, slip-and-cut review.
- Annual: major overhauls, crane certifications, verification audits.
- VI.3 Control Limits and Actions
- Set statistical control limits for vibration/oil KPIs; trigger work orders when trends breach alarms for two consecutive samples.
- Bad actor threshold: top 10% of assets by downtime cost; initiate root cause analysis (RCA) and action plan.
- Monthly performance review: compare to targets; adjust PM frequencies and spares; update risk register.
- VI.4 Documentation & Compliance
- All inspections and tests entered into CMMS with traceability; retain calibration and pressure test certificates.
- Close the loop: feed defects into FMEA updates and training content; audit PTW/LOTO records for 100% compliance.


Collaborate and learn alongside you peers. Professional development on your schedule. API training programs will help you advance your career. Browse our list of courses today.