热门公司

Halliburton
Halliburton

Halliburton Company is an American multinational corporation and the world's second-largest oil service company, responsible for most of the world's fracking operations

Brazil - Remote: Data Engineer - Platform & Pipelines

职能数据工程
级别中级
方式现场办公
类型全职
发布2个月前
立即申请

必备技能

Python

SQL

Spark

Airflow

We are looking for the right people — people who want to innovate, achieve, grow and lead. We attract and retain the best talent by investing in our employees and empowering them to develop themselves and their careers. Experience the challenges, rewards and opportunity of working for one of the world’s largest providers of products and services to the global energy industry.

Job Duties

We are implementing a strict Medallion Architecture to organize petabytes of industrial data. This role is for a Data Engineer who excels at transforming raw chaos into structured, queryable assets.

You will build and maintain the ELT pipelines that move data from "Bronze" (Raw) to "Silver" (Cleaned) and "Gold" (Aggregated). You will work with Delta Lake (On-prem/Databricks), Polars and Airflow to ensure data quality and availability for Data Scientists and the Knowledge Graph.

What You’ll Do

  • Pipeline Development: Develop and maintain robust Airflow DAGs to orchestrate complex data transformations.

  • Data Transformation: Use Spark (when scale requires) and Polars to clean, enrich, and aggregate data according to business logic.

  • Architecture Implementation: Enforce the Medallion Architecture patterns, ensuring clear separation of concerns between data layers.

  • Performance Tuning: Optimize processing workflows (Polars/Spark) jobs and SQL queries to reduce costs and execution time; make intelligent decisions on when to use Polars vs. Spark.

  • Deployment & Operations: Manage code deployment to on-prem and cloud infrastructure, including containerization and environment configuration.

  • Data Quality: Implement comprehensive data validation checks and quality gates between medallion layers.

  • Data Cataloging: Maintain the metadata and catalog entries to ensure all data assets are discoverable and documented.

The Technology Stack

  • Orchestration: Apache Airflow.

  • Data Processing: Polars (primary for ETL), Py Spark/SQL (for massive scale)

  • Compute: Single-node workers (Polars), Databricks/Spark clustrers (when scale requires)

  • Storage: Delta Lake, Parquet, S3/Blob Storage, MinIO

  • Language: Python 3.12+ (w/ Polars), SQL.

Qualifications Must Haves:

Complete Bachelor's degree in Computer Science, Engineering, or related.

  • 3+ years of experience in Data Engineering.

  • Strong proficiency in Apache Airflow and Databricks.

  • Experience implementing Medallion/Delta Lake architectures.

  • Strong SQL and Python skills.

  • Advanced English communication skills.

Good to Have:

  • Experience with Unity Catalog or other governance tools.

  • Familiarity with dbt (data build tool).

  • Background in processing telemetry or sensor data.

Knowledge, Skills, and Abilities

  • The Structured Thinker: You love organizing data. You understand the importance of schemas, data typing, and normalization.

  • Quality Obsessive: You don't just move data; you test it. You implement checks to ensure no bad data reaches the Gold layer.

  • Pipeline Builder: You view data engineering as software engineering. You write modular, reusable code for your transformations.

Halliburton is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.Location

Fully Remote position.

Job Details Requisition Number: 205556
Experience Level: Entry-Level
Job Family: Engineering/Science/Technology
Product Service Line: Landmark Software & Services
Full Time / Part Time: Full Time

Employee Group: Temporary

Compensation Information
Compensation is competitive and commensurate with experience.

浏览量

0

申请点击

0

Mock Apply

0

收藏

0

关于Halliburton

Halliburton

Halliburton Company is an American multinational corporation and the world's second-largest oil service company, responsible for most of the world's fracking operations. The company, incorporated in the United States, has dual headquarters located in Houston and in Dubai.

10,001+

员工数

Houston

总部位置

$15B

企业估值

评价

10条评价

3.4

10条评价

工作生活平衡

2.8

薪酬

4.2

企业文化

3.5

职业发展

2.5

管理层

2.3

65%

推荐率

优点

Good benefits and competitive salary

Training programs and learning opportunities

Supportive team environment

缺点

Poor work-life balance and long hours

High pressure and stress during peak times

Disorganized management

薪资范围

446个数据点

Junior/L3

Senior/L5

Junior/L3 · Data Analyst

0份报告

$65,490

年薪总额

基本工资

-

股票

-

奖金

-

$55,666

$75,313

面试评价

2条评价

难度

3.0

/ 5

时长

14-28周

面试流程

1

Application Review

2

HR Screen

3

Technical Assessment

4

Behavioral Interview

5

Hiring Manager Interview

6

Offer

常见问题

Technical Knowledge

Behavioral/STAR

Past Experience

Safety Protocols

Problem Solving