socure - Data Scientist II - Big Data R&D, Identity Graph & KYC
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Master’s degree with 2+ years of experience, or Ph.D. with 1+ years of experience in a data science or analytics role, or equivalent practical experience. • Proficiency in at least one general-purpose programming language used in data science (Python, or Scala). • Solid experience writing and optimizing SQL for large datasets; comfort working in data lake / warehouse environments. • Hands‑on experience with Spark or PySpark and common ML libraries (e.g., scikit‑learn, XGBoost, TensorFlow/PyTorch a plus). • Familiarity with UNIX environments and the AWS ecosystem (e.g., EMR, S3); Databricks experience is a plus. • Working knowledge of supervised/unsupervised ML and basic statistics (similarity measures, clustering, evaluation metrics). • Exposure to graph techniques or graph databases (Neo4j, AWS Neptune, GraphFrames) is a strong plus. • Bonus: experience with Elasticsearch or DynamoDB; workflow tools such as Airflow for automating data pipelines. • Ability to break down loosely defined problems, ask good clarifying questions, and iterate quickly with feedback. • Please note that sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered.
Responsibilities
• Contribute to the design and implementation of machine learning, data mining, statistical, and graph-based algorithms to analyze very large datasets for identity verification and anomaly detection. • Analyze large datasets to help develop and refine entity-resolution and identity-matching algorithms that drive Socure’s KYC and compliance solutions. • Build and maintain components of data-processing pipelines (ETL, feature generation, normalization) using tools such as Spark/PySpark and AWS (e.g., EMR, S3). • Support senior data scientists with feature engineering, data exploration, error analysis, and A/B test setup for new models and signals. • Help evaluate new third‑party and internal data sources: profile data quality, design offline experiments, and summarize impact on coverage and model performance. • Implement and maintain SQL and Python/R code for data extraction, transformation, and validation; contribute to code reviews and basic testing. • Provide analytical support to compliance and regulatory product teams, including ad hoc investigations, simple dashboards, and data deep dives. • Communicate findings in a clear, structured way to peers and cross‑functional partners (Product, Engineering, Client Analysis), focusing on key insights and trade‑offs. • Work effectively in a fast‑paced, cross‑functional environment; demonstrate ownership of well-scoped tasks and follow through to completion.
Benefits
• DS2:$140K – $170K • Offers Equity • Offers Bonus • This is a base salary range for this job based on the job requirements. • Base pay is only one component of Socure's compensation and our total rewards package includes equity, benefits, and an annual bonus or a commission plan. • annual bonus • commission plan. • Upload your resume here to autofill key application fields. • Drop your resume here! • Parsing your resume. Autofilling key fields... • Please note: we have set up limits for applications for this role. • Candidates may not apply more than 3 times in any 30 day span for any job at Socure. • Candidates may not re-apply to the same role within 30 days. • or drag and drop here • Mark No: Candidates on F1, OPT, or H1 visas that will require sponsorship now or in the future. • We really read these! Please be brief & compelling :) • Unfortunately we are unable to hire employees residing in these states. • You may have to go into office 2-3 times a week. • San Francisco, CA • New York City, NY • Socure's Recruiting Privacy Policy • Decline to self-identify • Hispanic or Latino - A person of Cuban, Mexican, Puerto Rican, South or Central American, or other Spanish culture or origin regardless of race. • Hispanic or Latino • White (Not Hispanic or Latino) - A person having origins in any of the original peoples of Europe, the Middle East, or North Africa. • White • Black or African American (Not Hispanic or Latino) - A person having origins in any of the black racial groups of Africa. • Black or African American • Native Hawaiian or Other Pacific Islander (Not Hispanic or Latino) - A person having origins in any of the peoples of Hawaii, Guam, Samoa, or other Pacific Islands. • Native Hawaiian or Other Pacific Islander • Asian (Not Hispanic or Latino) - A person having origins in any of the original peoples of the Far East, Southeast Asia, or the Indian Subcontinent, including, for example, Cambodia, China, India, Japan, Korea, Malaysia, Pakistan, the Philippine Islands, Thailand, and Vietnam. • Asian • American Indian or Alaska Native (Not Hispanic or Latino) - A person having origins in any of the original peoples of North and South America (including Central America), and who maintain tribal affiliation or community attachment. • American Indian or Alaska Native • Two or More Races (Not Hispanic or Latino) - All persons who identify with more than one of the above five races. • Two or More Races • Hispanic or Latino • White (Not Hispanic or Latino) • Black or African American (Not Hispanic or Latino) • Native Hawaiian or Other Pacific Islander (Not Hispanic or Latino) • Asian (Not Hispanic or Latino) • American Indian or Alaska Native (Not Hispanic or Latino) • Two or More Races (Not Hispanic or Latino) • I identify as one or more of the classifications of protected veteran listed above • I am not a protected veteran
No credit card. Takes 10 seconds.