
Global investment banking and financial services
Lead Engineer Bigdata - PySpark
Overview
We are seeking a highly skilled and experienced Senior Bigdata/Py Spark Engineer to join our dynamic Big Data Analytics team. The ideal candidate will have a strong background in Python programming and extensive experience with Apache Spark, particularly Py Spark, for large-scale data processing and analytics. This role involves designing, developing, and optimizing robust and scalable data pipelines, working with vast datasets, and contributing to the architecture of our Big Data solutions.
Responsibilities:
- Design, develop, and maintain efficient, scalable, and reliable data pipelines using Py Spark.
- Implement complex data transformations, aggregations, and data quality checks on large datasets.
- Collaborate with multiple stakeholders (technology and business) to understand data requirements and translate them into technical specifications.
- Optimize Py Spark jobs for performance, efficiency, and cost-effectiveness.
- Develop and maintain documentation for data pipelines, data models, and data processing logic.
- Participate in code reviews, ensuring code quality, best practices, and adherence to established standards.
- Troubleshoot and resolve issues in existing data pipelines and data processing jobs.
- Stay up-to-date with the latest advancements in Py Spark, Apache Spark, and the broader Big Data ecosystem.
- Mentor junior developers and contribute to the continuous improvement of the team's technical capabilities and processes.
Required Qualifications:
- 8-12 years of relevant experience
- Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or a related field.
- 5+ years of professional experience in software development with a focus on Big Data technologies.
- 5+ years of hands-on experience specifically with Py Spark for large-scale data processing.
- Strong proficiency in Python programming, including object-oriented design and data manipulation libraries (e.g., Pandas, Num Py).
- In-depth understanding of Apache Spark architecture, including Spark Core, Spark SQL, Spark Streaming, and Data Frame API.
- Experience with various data storage technologies such as HDFS, S3, Azure Blob Storage, or similar distributed file systems.
- Solid understanding of relational databases and SQL.
- Experience with version control systems (e.g., Git).
- Excellent problem-solving, analytical, and communication skills.
Preferred Qualifications:
- Experience with cloud platforms (AWS, Azure, GCP) and their Big Data services (e.g., EMR, Databricks, Glue, Azure Synapse, Google Dataproc).
- Familiarity with workflow orchestration tools (e.g., Apache Airflow, Luigi).
- Experience with streaming data processing (e.g., Kafka, Spark Streaming).
- Knowledge of data warehousing concepts and data modeling techniques.
- Experience with containerization technologies (e.g., Docker, Kubernetes).
- Understanding of data governance, data security, and compliance best practices.
Job Family Group:
Technology
Job Family:
Applications Development
Time Type:
Full time
Most Relevant Skills
Please see the requirements listed above.
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
浏览量
0
申请点击
0
Mock Apply
0
收藏
0
相似职位

Engineering-L2-Bengaluru-Vice President-Data Governance
Goldman Sachs · Bengaluru, Karnataka, India

Lead Data Engineer
Stryker · Gurugram, India

India Head of Chief Data Office, Technology Executive
Wells Fargo · Bengaluru, India

Vice President, Data Governance
BNY Mellon · Pune, MH, India

DE Manager,eCS Data Engineering and Analytics Team, Amazon
Amazon · Bengaluru, KA, IND
关于Citigroup

Citigroup
PublicCitigroup Inc. or Citi is an American multinational investment bank and financial services company based in New York City. The company was formed in 1998 by the merger of Citicorp, the bank holding company for Citibank, and Travelers; Travelers was spun off from the company in 2002.
10,001+
员工数
New York City
总部位置
$86B
企业估值
评价
10条评价
3.7
10条评价
工作生活平衡
3.8
薪酬
2.5
企业文化
4.0
职业发展
3.2
管理层
3.5
65%
推荐率
优点
Good work-life balance
Supportive management and colleagues
Good benefits
缺点
Low or uncompetitive salary/pay
Long hours during peak times
Poor management and lack of direction
薪资范围
48个数据点
Mid/L4
Senior/L5
Staff/L6
Mid/L4 · Business Analytics Senior Analyst
3份报告
$117,000
年薪总额
基本工资
$120,800
股票
-
奖金
-
$117,000
$117,000
面试评价
3条评价
难度
3.3
/ 5
时长
14-28周
体验
正面 0%
中性 33%
负面 67%
面试流程
1
Application Review
2
Recruiter Screen
3
Technical Interview
4
Panel/Group Interview
5
Final Round
6
Offer
常见问题
Technical Knowledge
Coding/Algorithm
Behavioral/STAR
Past Experience
Culture Fit
最新动态
Citigroup : Citi Announces Senior Leadership Appointments to Strengthen International Franchise - marketscreener.com
marketscreener.com
News
·
1w ago
Citigroup Escapes Ex-Employee's Trade Secret Suit, For Now - Law360
Law360
News
·
1w ago
Citigroup vs. Wells Fargo: Which Bank Stock Is a Smarter Buy Now? - Zacks Investment Research
Zacks Investment Research
News
·
1w ago
Citigroup Issues Pessimistic Forecast for Palantir Technologies (NASDAQ:PLTR) Stock Price - MarketBeat
MarketBeat
News
·
1w ago