At-a-Glance: Oilfield maintenance risk centers on loss of containment, hazardous energy, SIMOPS conflicts, and dropped objects. Manage with robust planning, verified isolations, disciplined permits, competent crews, and condition-based strategies measured by uptime, TRIR, PM compliance, and deferment.
I. Objective & KPIs
Minimize HSE and process safety exposure while executing safe, efficient maintenance that sustains production, integrity, and compliance.
- 1.1 Key HSSE KPIs: TRIR; high-potential incidents; Loss of Primary Containment (LOPC) count; permit/LOTO violations; gas exceedance alarms; dropped-object events.
- 1.2 Reliability/Production KPIs: Facility uptime (%); unplanned deferment (boe/d); MTBF, MTTR; availability \(A\); critical SCE impairments; overdue SCE proof tests.
- 1.3 Maintenance Execution KPIs: PM compliance (% on time); corrective-to-preventive ratio; maintenance backlog (man-weeks by craft); wrench time (%); rework rate (%); work-order cycle time (days).
- 1.4 Cost/Emissions KPIs: OPEX per boe; critical spare stockouts; venting/flaring during maintenance (scf); fugitive methane intensity; VOC/H2S exposure counts.
I. Equations
- 1.E.1 Availability: \( A=\dfrac{\text{MTBF}}{\text{MTBF}+\text{MTTR}} \)
- 1.E.2 Risk Priority Number (RPN): \( \text{RPN}=S \times O \times D \) (Severity, Occurrence, Detectability)
- 1.E.3 Production deferment: \( \text{Deferment}=\text{Potential} - \text{Actual} \)
- 1.E.4 Emissions from maintenance leaks: \( E_{\text{total}}=\sum_{i=1}^{n} \dot{m}_i \times t_i \)
- 1.E.5 Basic risk score: \( R=P \times C \) (Probability × Consequence; managed to ALARP)
II. Critical Parameters & Target Ranges
Assumptions marked “estimated”; always follow site standards, codes, and OEM guidance.
| Parameter | Target/Range | Notes |
|---|---|---|
| Gas testing before/hot work | LEL < 10% (estimated); O2 19.5–23.5% (estimated) | Continuous monitoring in enclosed/confined areas |
| H2S action levels | Per site matrix (estimated: alarm around 10 ppm; evacuate per local rules) | Use calibrated detectors and escape sets |
| Pressure/leak test | Per code/OEM (estimated: 1.1–1.5× service/MAOP) | Define hold time, acceptance criteria, and isolation boundaries |
| Energy isolation quality | Zero-energy verified; LOTO 100% matches permits | Try-out/verify for rotating/electrical/hydraulic energies |
| Crane/lifting limits | Within crane charts; wind limits per plan (estimated: 10–15 m/s) | Tag lines; exclusion zones; banksman control |
| Electrical arc-flash boundary | Per arc study; PPE category compliant | Live work only with approved variance |
| Confined space entry | Permit in place; continuous gas/O2 monitoring | Rescue plan and attendant mandatory |
| SIMOPS exclusion zones | Defined on layout/SIMOPS plan | Barriers and radio discipline |
| Torque/tension values | Per OEM/engineering spec | Use calibrated tooling |
| SCE proof-test interval | Per SIL/RBI basis of safety | Track overdue = 0 for critical barriers |
III. Step-by-Step Procedure / Workflow / Checklist
III.A Planning & Risk Identification
- 3.1 Criticality ranking: Classify equipment by HSE, production, and environmental consequence; assign maintenance strategies (run-to-failure, PM, CBM, RBI).
- 3.2 Work scoping: Define job steps, SIMOPS interfaces, isolation points, special tools, spares, access, weather windows (offshore), and vendor support.
- 3.3 Risk assessment: Perform JHA/TRA; quantify with \( R=P \times C \) and RPN; pre-approve mitigation barriers to ALARP.
- 3.4 Materials & spares: Verify BOM, QA/QC certificates, shelf life, and critical spares availability; line up rental/backup equipment.
III.B Permit-to-Work (PTW) & Isolations
- 3.5 PTW: Issue correct permits (hot work, confined space, electrical, excavation, working at height) with gas test, drawings, and isolations attached.
- 3.6 LOTO/Isolation plan: Positive isolation for hydrocarbons (spade/blank, double block and bleed where approved); lock and tag all energy sources; independent verification and sign-off.
- 3.7 SIMOPS control: Review scheduling with drilling/wells/marine; enact exclusion zones and radio channels.
III.C Pre-Job Controls
- 3.8 Toolbox talk: Walk down job, verify hazards, confirm stop-work authority, assign roles, confirm rescue plans.
- 3.9 Condition checks: Bump test gas detectors; calibrate torque/pressure instruments; inspect slings/shackles; verify scaffolds/MEWPs.
- 3.10 Barricade/housekeeping: Establish drop zones and fire watch; position spill kits and extinguishers; stage materials.
III.D Execution
- 3.11 Dynamic risk management: Pause/PTR checks at step changes; update JHA if scope shifts; escalate via MOC if needed.
- 3.12 Quality controls: Apply OEM torques, gasket specifications, cleanliness; witness points; NDE as specified.
- 3.13 Environmental controls: Zero spill mindset; closed-loop depressurization; flare/vent minimization; containment for chemicals/NORM scale.
III.E Reinstatement & Start-Up
- 3.14 Inspections: Independent checks, flange tagging, instrument calibration, relief device re-install verification.
- 3.15 Leak/pressure testing: Execute per procedure; record pressures, temperatures, and hold; verify no pressure entrapment.
- 3.16 Functional testing: Close-out PTW; remove LOTO; conduct bump/functional tests; gradual startup with performance monitoring.
III.F Closeout & Learning
- 3.17 Documentation: Update CMMS history, as-built drawings, corrosion circuits, RBI; attach photos/measurements.
- 3.18 RCA/lessons: Complete post-job review; capture deviations, delays, rework; feed into standard job plans and risk registers.
IV. Risk Categories & Mitigations
IV.A Process Safety & Hazardous Energy
- 4.1 Loss of Containment (hydrocarbons/H2S): Positive isolation, gas testing, double sealing for sour service, purge/inerting, use of intrinsically safe tools.
- 4.2 Overpressure: Verify relief devices installed and in service; bypasses locked; pressure test boundaries clear; staged pressurization; remote position during testing.
- 4.3 Ignition sources: Hot work controls, area classification awareness, bond/grounding, eliminate static, hot-cold work segregation.
- 4.4 Electrical/arc flash: De-energize when possible; tested-for-dead; PPE per arc study; insulated tools; live work permits tightly controlled.
- 4.5 Stored energy: Bleed-down verification; spring/hydraulic accumulators locked and discharged; rotating inertia managed via try-out.
IV.B People & Task Execution
- 4.6 Lifting/dropped objects: Certified lifting plans, pre-use checks, secondary retention for hand tools at height, exclusion zones, weather limits.
- 4.7 Confined space: Entry permit, 4-gas monitor, ventilation, rescue plan, standby attendant with continuous communications.
- 4.8 Working at height: 100% tie-off, inspected anchor points, scaffold tags green, edge protection, MEWP familiarization.
- 4.9 Chemical/NORM exposure: SDS review, correct PPE/respiratory protection, closed handling, dose meters for NORM, waste segregation.
- 4.10 Fatigue/ergonomics: Shift limits, breaks, job rotation, mechanical aids for handling, warm-up routines to avoid strains.
IV.C SIMOPS & Management Controls
- 4.11 SIMOPS conflicts: Central schedule, interface meetings, tie-in isolation maps, marine/well ops control, stop-work alignment across contractors.
- 4.12 Contractor competency: Pre-qualification, certification verification, task-specific assessments, supervision ratios, language/briefing checks.
- 4.13 Documentation & MOC: Changes via formal MOC; P&ID/as-built updates; PSSR for major changes; SCE impairment tracking with compensating measures.
- 4.14 Driving/marine logistics: Journey management, rest rules, weather routing, certified vessels, man-overboard drills, helideck procedures.
V. Optimization Levers
- 5.1 Condition-based maintenance (CBM): Vibration, oil analysis, thermography, ultrasound, motor current; apply Weibull analysis to optimize intervals and reduce failures.
- 5.2 Risk-Based Inspection (RBI): Focus inspection on high consequence/likelihood circuits; update corrosion rates with online thickness monitoring and fluid chemistry.
- 5.3 Digital PTW/CMMS: ePTW integration with isolations, JHAs, and SCE impairments; mobile work packs; barcode/QR for asset history; automated overdue alerts.
- 5.4 Standard job plans: Time-phased steps, tool/spare lists, hold points, torque values; benchmark wrench time; reduce variability and rework.
- 5.5 Spares strategy: Criticality-based stocking, min/max optimization, consignment for long lead items, condition-preservation routines.
- 5.6 Turnaround/STO excellence: Scope freeze, gate reviews, parallel path prefabs, modular spools, pre-commissioning, and readiness KPIs (schedule adherence, cost variance).
- 5.7 Remote/low-touch: Valve actuators with partial-stroke testing, online proof tests for SIFs, remote gas detection telemetry, drones/ROVs for visual/NDE in hard-to-reach areas.
- 5.8 Emissions-aware maintenance: Fugitive leak elimination campaigns, LDAR prioritization, temporary recovery lines, depressurize to process where feasible.
V. Reliability/Inspection Equations
- 5.E.1 Failure rate: \( \lambda = \dfrac{1}{\text{MTBF}} \)
- 5.E.2 Weibull reliability: \( R(t)=e^{-(t/\eta)^{\beta}} \) (scale \( \eta \), shape \( \beta \))
- 5.E.3 Proof test coverage effect: \( \text{PFD}_{\text{avg}} \approx \dfrac{\lambda_{DU} \times T}{2} \) (simplified, for low-demand SIF; \( \lambda_{DU} \)= dangerous undetected failure rate, \(T\)=proof test interval)
VI. Verification & Monitoring Plan
VI.A What to Measure
- 6.1 Safety: TRIR; HiPo counts; LOPC events; gas exceedances; permit and LOTO audits; dropped-object reports.
- 6.2 Reliability: MTBF, MTTR, availability; SCE impairments; overdue proofs; vibration/oil alerts; repeat failures within 90 days.
- 6.3 Execution: PM compliance; backlog (total and aged > 28 days); wrench time; rework rates; plan vs. actual hours.
- 6.4 Production/Cost: Unplanned deferment; OPEX per boe; critical spares stockouts; turnaround schedule/cost adherence.
- 6.5 Environment: Flaring/venting during maintenance; LDAR findings; spill volume and response time.
VI.B How Often
- 6.6 Shift/Daily: PTW/LOTO audits; toolbox talk quality checks; SIMOPS board review; gas detection trending.
- 6.7 Weekly: Backlog and PM compliance review; spare stockout report; review of deviations and open actions.
- 6.8 Monthly: Reliability dashboard (MTBF/MTTR), RCA status, repeat failure analysis, emissions from maintenance activities.
- 6.9 Quarterly: RBI updates with latest inspection/corrosion data; SCE assurance review; functional safety proof test compliance; contractor performance scorecards.
- 6.10 Annually: Program audit of PTW/LOTO/MOC; turnaround readiness review; emergency response drills; competency matrix gap closure.
VI.C Acceptance Targets (typical)
- 6.11 PM compliance: = 95% on-time (critical = 100%).
- 6.12 Overdue SCE tests: 0 critical overdue; non-critical < 1%.
- 6.13 LOPC: Continuous reduction trend; investigate all Tier 1/2 events.
- 6.14 Backlog: 2–4 man-weeks by craft; aged > 90 days < 5%.
- 6.15 Wrench time: 45–55% sustained without compromising safety/quality.
Summary Action List
- S1: Build and maintain a live maintenance risk register with RPNs and controls.
- S2: Enforce PTW + verified LOTO with independent checks and SIMOPS control.
- S3: Standardize high-risk job plans (hot work, vessel entry, lifting, pressure tests).
- S4: Shift to CBM/RBI for critical assets; close gaps on SCE proof testing.
- S5: Measure and manage with leading/lagging KPIs; close RCAs; drive lessons learned into procedures.


Collaborate and learn alongside you peers. Professional development on your schedule. API training programs will help you advance your career. Browse our list of courses today.