
Senior Data Scientist - Big Data R&D, Identity Graph & KYC
Why Socure?
Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.
We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won’t be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.
About the Role
The Big Data R&D team develops cutting‑edge big data and graph‑based solutions for entity search, entity resolution, and identity matching that power Socure’s KYC and compliance products.
As a Senior Data Scientist I, you will lead the design and deployment of advanced ML and graph algorithms on large-scale PII datasets, own end‑to‑end projects from problem definition through production validation, and serve as a key technical partner to Product, Engineering, and Client‑facing teams. You will help define standards for feature engineering, experimentation, and data quality across our identity graph stack, with substantial impact on coverage, accuracy, and fairness.
What You'll Do
-
Own the design, development, and evaluation of machine learning, statistical, and graph-based algorithms for entity-resolution, identity trust scoring, and anomaly detection on massive datasets.
-
Architect and optimize graph-based identity representations (identity graph structure, linkage rules, clustering) to improve match rates, reduce false positives/negatives, and support downstream fraud and KYC models.
-
Build and maintain scalable data pipelines and feature stores in Spark/Py Spark (or Scala), including data normalization, deduplication, and feature computation across large PII datasets in AWS/Databricks environments.
-
Lead A/B tests and offline/online experimentation for new models, features, and data sources; define success metrics, design experiments, and ensure rigorous validation before rollout.
-
Evaluate new internal and external data sources: explore signal quality, design backtests, quantify incremental value, and provide clear recommendations on vendor selection and integration.
-
Partner closely with product managers and engineers to translate ambiguous business and regulatory requirements (e.g., KYC coverage, watchlist matching) into concrete modeling and data roadmaps.
-
Provide deep analytical support to Socure’s compliance and regulatory product suite, including investigative analyses, root‑cause analysis for anomalies, and clear narratives for internal and external stakeholders.
-
Contribute to model governance and documentation: clearly explain model logic, data dependencies, limitations, and monitoring plans to internal risk/compliance stakeholders.
-
Mentor junior data scientists and engineers on best practices in data exploration, feature engineering, experimentation, and code quality.
-
Communicate complex technical concepts and trade‑offs in a concise, structured way to both technical and non‑technical audiences (e.g., product reviews, customer meetings, internal briefings).
What You Bring
-
Master’s degree with 3+ years of relevant industry experience, or Ph.D. with 1+ years of experience in applied ML / data science roles; background in Computer Science, Statistics, Mathematics, or related quantitative fields preferred.
-
Strong proficiency in Python (preferred) or Scala, including experience with ML libraries such as scikit‑learn, XGBoost, Tensor Flow or Py Torch.
-
Extensive experience with Spark or Py Spark and distributed data systems (e.g., AWS EMR, Databricks) working on very large, messy datasets.
-
Deep understanding of supervised and unsupervised learning, feature engineering, model evaluation, and experiment design (A/B testing, holdout strategies, stratification).
-
Experience developing production-quality data pipelines and automated workflows using Airflow or similar orchestration tools.
-
Practical familiarity with graph databases and/or graph frameworks (Neo4j, AWS Neptune, Graph Frames, DGL, Py Torch Geometric) and graph algorithms for clustering, link prediction, and community detection is strongly preferred.
-
Solid SQL skills and experience working with large-scale analytical data stores.
-
Experience in at least one of: identity verification, fraud detection, credit risk, or adjacent high‑stakes domains is a plus.
-
Demonstrated ability to lead medium‑to‑large projects end‑to‑end, make sound trade‑off decisions under ambiguity, and influence cross‑functional stakeholders with data and clear reasoning.
Please note that sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered.
Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly.
Follow Us!
YouTube | LinkedIn | X (Twitter) | Facebook
전체 조회수
0
전체 지원 클릭
0
전체 Mock Apply
0
전체 스크랩
0
비슷한 채용공고

Data Scientist, Center for Economic Intelligence - Sr. Associate
JPMorgan Chase · New York, NY

Data Governance Lead
Stripe · Remote US

Sr Analyst
HCL Technologies · Hyderabad, India

Senior Computer Scientist - Cloud Software
Adobe · Hamburg

Senior Data Scientist, Data & AI Team
Hearst · New York, NY, United States, US
Socure 소개

Socure
Series DSocure is an identity verification and fraud prevention platform that uses artificial intelligence and machine learning to help organizations verify customer identities in real-time. The company provides digital identity verification solutions for financial services, fintech, gaming, and other industries.
201-500
직원 수
New York
본사 위치
$4.5B
기업 가치
리뷰
10개 리뷰
3.7
10개 리뷰
워라밸
3.2
보상
4.0
문화
4.1
커리어
2.8
경영진
2.3
65%
지인 추천률
장점
Supportive team and colleagues
Good benefits and competitive salary
Flexible hours and work-life balance
단점
Poor management and lack of direction
High pressure and overwhelming workload
Limited career advancement opportunities
연봉 정보
23개 데이터
Junior/L3
Mid/L4
Senior/L5
Senior
Director
Junior/L3 · Data Scientist II
4개 리포트
$195,000
총 연봉
기본급
$150,000
주식
-
보너스
-
$188,500
$201,500
최근 소식
Socure Report Warns of AI-Enabled Fraud Rings Targeting Federal Agencies - ExecutiveGov
ExecutiveGov
News
·
5w ago
Socure launches payment screening as fintechs seek streamlined IDV, risk checks - Biometric Update
Biometric Update
News
·
5w ago
Socure Partners With Checkr Trust to Strengthen Identity and Criminal Risk Decision Making for Businesses - Business Wire
Business Wire
News
·
6w ago
Socure Partners With Checkr Trust to Strengthen Identity and Criminal Risk Decision Making for Businesses - Yahoo Finance
Yahoo Finance
News
·
6w ago