Company: BP
Skills: IT - Analysis & Management, IT - Programming & Database, IT - Software Development
Experience: 5 + Years
Education: Bachelors/3-5 yr Degree
Location: Pune, Maharashtra, India

bp is transforming and at our Digital Hub in Pune we are growing the digital expertise and solutions needed to advance the global energy transition.dataWorx is the name for the data team that is responsible for all data within these areas:
  • Production & Projects including Health, Safety, Environment & Carbon
  • Refining & Operations
  • Wells & Subsurface
  • Business Services including Finance, Procurement, People & Culture, Performance Management
  • Strategy & Sustainability
We are developing deep data capability to transform the access, supply, control and quality to our vast and ever-growing reserves of data. The dataWorx team covers many data sub-disciplines, including data science, analytics, engineering and management as well as specialist areas such as geospatial, remote sensing, knowledge management and digital twin.
The dataWorx team is looking for outstanding data engineers to power this transformation and unlock the value of our digital assets to power our journey to net zero emissions and build a new, sustainable bp.

Key Accountabilities:
  • Architects, designs, implements and maintains reliable and scalable data infrastructure
  • Writes, deploys and maintains software to build, integrate, manage, maintain, and quality-assure data
  • Design, develop, and deliver large-scale data ingestion, data processing, and data transformation projects on Azure
  • Mentor and share knowledge with customers as well as provide architecture reviews, discussions, and prototypes
  • Work with customers to deploy, manage, and audit best practices for cloud products
  • Adheres to and advocates for software engineering best practices (e.g. technical design and review, unit testing, monitoring, alerting, checking in code, code review, documentation)
  • Deploy secure and well-tested software that meets privacy and compliance requirements; develops, maintains and improves CI / CD pipeline
  • Service reliability and following site-reliability engineering best practices: on-call rotations for services they maintain, responsible for defining and maintaining SLAs. Design, build, deploy and maintain infrastructure as code. Containerizes server deployments
  • Actively contributes to improve developer velocity
  • Work closely with other data engineers, software engineers, data scientists, data managers and business partners

Desirable Education and Experience:
  • BS degree in computer science or related field
  • 8 to 12 years with minimum of 5 to 7 years relevant experience
Required Criteria
  • Deep and hands-on experience (typically 5+ years) designing, planning, productionizing, maintaining and documenting reliable and scalable data infrastructure and data products in complex environments
  • Hands on experience with:
    • Databricks and using Spark for data processing
    • Configuring Delta Lake on Azure Databricks
    • Python, Scala, SQL
    • C#, ASP.NET, MVC, .NET Core, .Net Framework (4.6) JSON, and API development (optional)
    • Azure Data Factory
    • Azure Data Lake, Azure SQL DB, Synapse, and Cosmos DB
    • Data Management Gateway, Azure Storage Options, Stream Analytics and Event Hubs
    • Designing data solutions in Azure incl. data distributions and partitions, scalability, disaster recovery and high availability
  • Advanced hand-on experience with different query languages
  • Experience designing and implementing large-scale distributed systems
  • Deep knowledge and hands-on experience in technologies across all data lifecycle stages
  • Stakeholder management and ability to lead large organizations through influence