HCL Technologies
HCL Technologies

Python Technical Lead - Data Analysis, SQL

RoleData Engineering
LevelLead
LocationHyderabad, India
WorkOn-site
TypeFull-time
Posted2 days ago
Apply now

About the role

Job Summary

Years of Exp Location – Pan India or any constraint Hyderabad Project type (Support / Dev / Maintenance / Testing)Dev Shift involved – If yes please share timing.No Customer interview involved – yes / No NoSalary Range Skills Mandatory Proficiency Level (1-5) GCP Data Orchestration (Composer/Airflow)Yes5Real-time Processing (Dataflow/Pub/Sub)Yes5Big Query & Big Query ML (BQML)Yes5Generative AI Integration (Gemini/AI Studio)Yes4Data Governance & Security Yes5Automated MLOps Pipelines (Vertex AI)Yes4

Responsibilities: Data Pipeline Development: Design, build, and maintain scalable, efficient, and reliable ETL/ELT data pipelines for batch and real-time processing using GCP services (e.g., Dataflow, Dataproc, Cloud Composer, Pub/Sub).

AI/ML Data Preparation: Collaborate closely with Data Scientists and Machine Learning Engineers to understand data requirements for model training, evaluation, and serving. Prepare, transform, and curate large, diverse datasets (structured, unstructured, streaming) to optimize them for AI/ML workloads.

GCP Ecosystem Expertise: Leverage a wide range of GCP data and AI/ML services, including:

Data Warehousing & Storage: Big Query (for analytics and Big Query ML), Cloud Storage, Cloud SQL, Cloud Bigtable.

Data Processing: Dataflow, Dataproc (Spark, Hadoop), Cloud Composer (Apache Airflow), Data Fusion.

AI/ML Services: Gemini AI, AI Studio, Vertex AI (for model training, deployment, MLOps, Pipelines, Workbench, AutoML). Should be able to create prompts and create build applications using AI studio.

Data Governance & Quality: Implement and enforce data quality, security, and governance standards throughout the data lifecycle, ensuring data accuracy, consistency, and compliance with regulations.

Performance Optimization: Monitor, troubleshoot, and optimize the performance and cost-effectiveness of data pipelines and AI/ML infrastructure.

Automation & MLOps: Automate data processes, develop CI/CD pipelines for data and ML models, and contribute to MLOps best practices for seamless deployment and monitoring of AI/ML solutions.

Collaboration & Communication: Work effectively with cross-functional teams, including Data Scientists, Analysts, Software Engineers, and Product Managers, to understand data needs and deliver impactful solutions.

Innovation & Research: Stay up-to-date with the latest advancements in data engineering, AI/ML, and GCP technologies, continuously exploring and recommending new tools and approaches.

Key Responsibilities

null

Skill Requirements

null

Other Requirements

null

Benefits and perks

Learning Budget

Required skills

Python

Airflow

Dataflow

Pub/Sub

BigQuery

Vertex AI

Data governance

About HCL Technologies

Hyderabad

Headquarters