Abacus Insights - Principal Site Reliability & Forward Deployed Engineer
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT
Requirements
• Translate client needs into solution deliverables such as reports, dashboards, data extracts, and analytical views • Solution Validation & UAT • Validate delivered solutions against documented requirements and acceptance criteria • Support and coordinate User Acceptance Testing (UAT) with client teams, working in alignment with internal QA resources • Cross‑Functional Collaboration • Partner with Product, Engineering, Account Management, Program Management, and Sales teams to deliver client solutions • Contribute client insights to product roadmap discussions and solution improvements • Client Relationship & Enablement • Act as a trusted client partner throughout implementation and ongoing engagement • Support client onboarding, training, and enablement activities • Assist internal teams in understanding client use cases and solution intent • Product & Process Improvement • Maintain strong awareness of product capabilities, enhancements, and releases • Identify opportunities to improve processes, solution quality, and client satisfaction • Bachelor’s degree in Business, Computer Science, Information Systems, or equivalent relevant professional experience • 5+ years of experience in business analysis, solution management, or client‑facing roles in software or data‑driven environments • Proven experience gathering, documenting, and validating business requirements • Ability to communicate effectively with both technical and non‑technical audiences • Strong organizational skills with the ability to manage multiple priorities • Analytical, detail‑oriented problem solver with a collaborative mindset • Self‑motivated, adaptable, and comfortable working in a fast‑paced environment • What you’ll get in return • Competitive Leave & Benefits • Comprehensive health coverage • Equity for every employee – share in our success • Growth-focused environment – your development matters here • Work arrangements • Work arrangements • Standard hours: 8 hours/day, 5 days/week • Location: Pune, Hybrid (3 days a week in office) • Shift: 12pm-9pm IST
Responsibilities
• Production Operations & Incident Response • Act as a senior technical escalation point during production incidents • Lead real-time incident triage, mitigation, and recovery efforts • Drive root cause analysis (RCA) with a focus on systemic, long-term fixes • Identify recurring failure patterns and push for architectural or operational improvements • Partner with Customer Success and Engineering to manage customer impact during incidents • Sustaining Engineering & Post‑Launch Ownership • Own post-launch reliability, stability, and operational quality of core systems • Investigate and resolve complex field issues and production defects • Ensure fixes developed during incidents or customer escalations are up streamed into the core product • Improve operational readiness of services through better runbooks, monitoring, and alerting • Reduce operational toil by converting repeated manual work into automation • Forward Deployed / Customer‑Facing Engineering • Engage directly with strategic customers to solve real-world, production-grade technical challenges • Support complex deployments, integrations, and escalations in customer environments • Act as a trusted technical partner to customers during high-impact issues • Translate customer learnings into concrete product, platform, and operational improvements • Contribute to reusable tools, playbooks, and best practices that accelerate future deployments • AWS & Databricks Technical Expertise • Serve as a subject matter expert for AWS-hosted production systems • AWS compute, storage, networking, IAM, and security • Databricks jobs, clusters, and Spark-based data pipelines • Debug performance degradation, scalability issues, job failures, and data correctness problems • Partner with platform and data teams to harden systems for reliability, scale, and operability • Software Development & Automation • Automate operational workflows • Improve reliability and observability • Eliminate manual intervention and reduce incident frequency • Contribute primarily in Python, with exposure to JVM-based systems as needed • Review code with a strong emphasis on operability, resiliency, and maintainability • Advocate for “build it so it can be operated” engineering standards • Technical Leadership & Collaboration • Provide technical leadership without formal authority, influencing design and operational decisions • Mentor engineers through pairing, reviews, and incident leadership • Collaborate closely with Product, Engineering, Data, and Customer teams • Operate effectively in high-pressure, ambiguous environments, especially during customer-impacting incidents
Benefits
• What you’ll get in return: • What you’ll get in return • Unlimited paid time off – recharge when you need it • Work from anywhere – flexibility to fit your life • Comprehensive health coverage – multiple plan options to choose from • Equity for every employee – share in our success • Growth-focused environment – your development matters here • Monthly cell phone allowance – stay connected with ease#LI-MS1
No credit card. Takes 10 seconds.