Socure - Staff Data Scientist - Entity Resolution, IDGraph
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• Strong proficiency in Python and PySpark. • Classification models • Learning-to-Rank • Anomaly Detection • Statistical Modeling • Experience building and maintaining production-grade ML systems at scale. • Familiarity with graph databases and query languages such as NeptuneDB and OpenCypher. • Experience with graph processing frameworks (e.g., GraphFrames). • Experience applying LLMs for evaluation, automation, or signal discovery. • Familiarity with Knowledge Graphs and Graph Neural Networks (GNNs). • Leadership & Collaboration • Proven ability to drive cross-functional projects, mentor peers, and influence technical and business outcomes. • Excellent communication skills, with the ability to present technical concepts to both technical and non-technical audiences. • Master’s or PhD in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field. • 5+ years of experience in applied data science, machine learning, or artificial intelligence, with a focus on graph-based modeling and large-scale data systems.
Responsibilities
• Entity Resolution & Graph Evaluation • Lead the evaluation and continuous improvement of entity resolution and entity linking pipelines. • Debug new builds, identify anomalies, and recommend modeling or system-level improvements. • Define, implement, and maintain scalable performance and quality metrics, leveraging automation and LLM-based approaches where appropriate. • Partner with Engineering to optimize entity linking and ranking systems using Learning-to-Rank and related techniques. • Design methods to assess and classify entity confidence and quality across the graph. • Data Quality & Modeling Frameworks • Design and implement a comprehensive data quality framework for graph-based identity data. • Translate abstract quality concepts (e.g., reliability, stability, consistency) into measurable signals. • Use data quality insights to guide modeling decisions, experimentation strategy, and product prioritization. • Identify and operationalize generalized, high-impact predictive signals derived from graph structure, temporal dynamics, and relational patterns. • Develop scalable approaches to link prediction, label propagation, and semi-supervised learning within the ID Graph. • Explore and evaluate advanced graph modeling techniques, including graph-based ML, knowledge graph methods, and Graph Neural Networks (GNNs), when appropriate. • Focus on durable abstractions rather than one-off features, ensuring solutions are explainable, compliant, and reusable across multiple products. • Cross-Functional Collaboration & Technical Leadership • Collaborate closely with Engineering, Product Management, Compliance, and downstream product teams. • Act as a technical leader within the Identity organization, influencing modeling standards, experimentation rigor, and best practices. • Translate complex technical findings into clear insights and recommendations for both technical and non-technical stakeholders. • Support the launch of new product capabilities built on top of the ID Graph. • Leadership Competencies • Demonstrate strong ownership, strategic impact, and assertive communication. • Mentor peers, foster a culture of growth, and build authentic relationships across teams. • Embrace feedback, adapt resiliently to challenges, and pursue continual self-improvement.
Benefits
• $170K – $205K • Offers Equity • Offers Bonus • This is a base salary range for this job based on the job requirements. • Base pay is only one component of Socure's compensation and our total rewards package includes equity, benefits, and an annual bonus or a commission plan. • annual bonus • commission plan. • Upload your resume here to autofill key application fields. • Drop your resume here! • Parsing your resume. Autofilling key fields... • or drag and drop here • This includes candidates currently on OPT visas who will require sponsorship in the future. • What work have you done on entity resolution or graph technologies? We really read these! • Be compelling, brief, and clear. • Unfortunately we are unable to hire employees living in these states. • Socure's Recruiting Privacy Policy • We like to get together in person when possible! Eligible Hub Locations: • San Francisco, CA • Decline to self-identify • Hispanic or Latino - A person of Cuban, Mexican, Puerto Rican, South or Central American, or other Spanish culture or origin regardless of race. • Hispanic or Latino • White (Not Hispanic or Latino) - A person having origins in any of the original peoples of Europe, the Middle East, or North Africa. • White • Black or African American (Not Hispanic or Latino) - A person having origins in any of the black racial groups of Africa. • Black or African American • Native Hawaiian or Other Pacific Islander (Not Hispanic or Latino) - A person having origins in any of the peoples of Hawaii, Guam, Samoa, or other Pacific Islands. • Native Hawaiian or Other Pacific Islander • Asian (Not Hispanic or Latino) - A person having origins in any of the original peoples of the Far East, Southeast Asia, or the Indian Subcontinent, including, for example, Cambodia, China, India, Japan, Korea, Malaysia, Pakistan, the Philippine Islands, Thailand, and Vietnam. • Asian • American Indian or Alaska Native (Not Hispanic or Latino) - A person having origins in any of the original peoples of North and South America (including Central America), and who maintain tribal affiliation or community attachment. • American Indian or Alaska Native • Two or More Races (Not Hispanic or Latino) - All persons who identify with more than one of the above five races. • Two or More Races • Hispanic or Latino • White (Not Hispanic or Latino) • Black or African American (Not Hispanic or Latino) • Native Hawaiian or Other Pacific Islander (Not Hispanic or Latino) • Asian (Not Hispanic or Latino) • American Indian or Alaska Native (Not Hispanic or Latino) • Two or More Races (Not Hispanic or Latino) • I identify as one or more of the classifications of protected veteran listed above • I am not a protected veteran
Similar Jobs
No credit card. Takes 10 seconds.