热门公司

Honeywell
Honeywell

The future is what we make it.

Advanced Data Scientist

职能数据科学
级别中级
地点Bengaluru, Karnataka, India
方式现场办公
类型全职
发布2周前
立即申请

Job Description Advanced Data Scientist Location

Bangalore, India

Role Overview

We are looking for a Advanced Data Scientist who can own end‑to‑end data science and machine learning solutions, from problem formulation to production deployment.
This role requires a strong blend of machine learning expertise, data engineering, MLOps, cloud platforms, and technical leadership.

You will work closely with product, engineering, and business stakeholders to design scalable data and ML systems that drive measurable business impact.

Experience

8–12+ years

Key Responsibilities Data Science & Machine Learning

  • Translate business problems into data science and ML solutions
  • Perform advanced EDA, feature engineering, and model development
  • Build and optimize:
  • Classical ML models (regression, classification, tree‑based models)
  • Time‑series, anomaly detection, and recommendation systems
  • Develop and fine‑tune deep learning models using Py Torch / Tensor Flow
  • Design and evaluate experiments (A/B testing, statistical validation)

GenAI, NLP & LLM Solutions

  • Build NLP and GenAI applications using modern LLMs
  • Implement RAG pipelines, prompt engineering, and vector search
  • Integrate LLMs using OpenAI / Azure OpenAI APIs
  • Evaluate model quality, latency, and cost for production LLM systems

Data Engineering & Pipelines (Good to Have)

  • Design and build scalable data pipelines for batch and streaming use cases
  • Work with distributed processing frameworks like Apache Spark
  • Orchestrate workflows using Airflow / Dagster / Prefect/ Azure Data Factory / Databricks
  • Handle real‑time data using Kafka or cloud‑native streaming services
  • Ensure data reliability, quality, and performance at scale

MLOps, Deployment & Production

  • Own the full ML lifecycle: experimentation → training → deployment → monitoring
  • Implement model versioning, reproducibility, and CI/CD pipelines
  • Deploy models using REST APIs or batch inference pipelines
  • Monitor model performance, drift, and data quality in production
  • Work with Docker and Kubernetes for scalable deployments

Cloud & Platform Engineering

  • Build solutions on AWS / Azure / GCP (at least one in depth)
  • Work with cloud data platforms like Databricks, Snowflake, Big Query
  • Optimize system performance and cloud costs
  • Ensure security, access control, and compliance best practices

Architecture, Collaboration & Leadership

  • Design end‑to‑end data and ML architectures
  • Make tradeoffs between batch vs streaming, cost vs performance
  • Mentor junior data scientists and review code and models
  • Set data science and ML best practices across teams
  • Communicate insights clearly to technical and non‑technical stakeholders

Required Skills & Qualifications Core Technical Skills

  • Strong proficiency in Python and advanced SQL
  • Solid foundation in statistics, probability, and linear algebra
  • Hands‑on experience with XGBoost, LightGBM
  • Experience with Py Torch or Tensor Flow Data Engineering (Good to have)
  • Strong experience with Spark / Py Spark
  • Pipeline orchestration using Airflow or similar tools
  • Experience with relational, NoSQL, and analytical databases
  • Understanding of data lakes and warehouse architectures

MLOps & DevOps (Optional)

  • Experience with MLflow, DVC, or W&B
  • Model deployment using FastAPI
  • Containers and orchestration: Docker, Kubernetes
  • CI/CD and monitoring tools

Cloud Platforms

  • Deep expertise in at least one cloud provider:
  • AWS, Azure, or GCP
  • Experience with managed ML and data services

Preferred / Nice‑to‑Have

  • Experience with LLM frameworks (Lang Chain, Llama Index)
  • Vector databases (FAISS, Pinecone, Weaviate)
  • Streaming frameworks (Flink)
  • Knowledge of data governance, privacy, and compliance
  • Experience leading cross‑functional technical initiatives

Machine Learning Algorithms & Techniques (Hands‑On)Supervised Learning

  • Linear Models
  • Linear Regression
  • Logistic Regression
  • Regularization (L1, L2, Elastic Net)
  • Tree‑Based Models
  • Decision Trees
  • Random Forest
  • Gradient Boosting (XGBoost, LightGBM, Cat Boost)
  • Clustering Techniques
  • K‑Means
  • Hierarchical Clustering
  • DBSCAN
  • PCA (feature reduction)
  • t‑SNE / UMAP (visualization & analysis)

Dimensionality Reduction Time Series & Forecasting (Basic–Intermediate)

  • Statistical forecasting:
  • Moving averages
  • ARIMA / SARIMA (conceptual + basic use)
  • ML‑based forecasting using regression and tree‑based models

Model Evaluation & Optimization

  • Cross‑validation techniques
  • Hyperparameter tuning (Grid Search, Random Search)
  • Bias–variance tradeoff
  • Handling class imbalance
  • Selection of appropriate evaluation metrics

浏览量

1

申请点击

0

Mock Apply

0

收藏

0

关于Honeywell

Honeywell

Honeywell

Public

Honeywell International Inc. is an American publicly traded, multinational conglomerate corporation headquartered in Charlotte, North Carolina. It primarily operates in four areas of business: aerospace, building automation, industrial automation, and energy and sustainability solutions (ESS).

10,001+

员工数

Charlotte

总部位置

$130B

企业估值

评价

10条评价

3.7

10条评价

工作生活平衡

4.2

薪酬

2.8

企业文化

3.9

职业发展

2.7

管理层

3.1

65%

推荐率

优点

Good work-life balance

Great benefits and job security

Collaborative and friendly environment

缺点

Low or uncompetitive compensation

Poor management and communication

Limited growth opportunities

薪资范围

655个数据点

Junior/L3

Mid/L4

Senior/L5

Junior/L3 · AI Engineer II

1份报告

$136,500

年薪总额

基本工资

$105,000

股票

-

奖金

-

$136,500

$136,500

面试评价

3条评价

难度

3.0

/ 5

时长

14-28周

录用率

33%

体验

正面 0%

中性 33%

负面 67%

面试流程

1

Application Review

2

Recruiter Screen

3

Technical Interview

4

Assessment/Testing

5

Final Interview

6

Offer

常见问题

Technical Knowledge

Behavioral/STAR

Past Experience

Problem Solving

Culture Fit