Java R&D Engineer/expert - Stability engineering platform
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Responsibilities
• 研究如何用技术快速识别问题、定位问题、以及恢复故障,达到1-5-10目标; • 负责slo/sla制定和落地,以目标为导向保证业务稳定性; • 持续建设稳定性保障工具平台,包括巡检系统、问题根因诊断系统、风险库等,让问题发现、定位、分析更准确和高效; • 制定、推动稳定性规范落地,确保产品设计和编码符合稳定性原则; • 持续关注业界前沿技术动态,组织团队学习提升,适时引入、推进新技术的升级迭代 • What We Look For In You: • 计算机或相关专业本科以上学历,7年以上研发、架构经验,有基础架构、框架类研发经验者更佳; • 熟练掌握java、熟练应用springcloud微服务技术栈,具有良好的编码风格和算法能力; • 熟练应用flink、elasticsearch、clickhouse、skywalking、prometheus/VictoriaMetrics、python等数据计算与分析工具; • 具有RAG/Agent开发和调优经验更佳; • 善于发现问题、分析问题、解决问题,有清晰的分析逻辑和全局架构思维; • 具有产品化思维,熟悉研发流程,熟悉故障分析和故障处理流程,善于使用工具解决问题; • 具备良好的沟通能力和领导能力,能够与跨部门团队协作,推动稳定性相关工作,能英语沟通者更佳; • 有稳定性保障建设、巡检系统、问题根因诊断系统、混沌工程系统实践者更佳。 • 技能关键字
Benefits
• Comprehensive insurance coverage for employees and their dependants • More that we love to tell you along the process! • Disclaimer: Please note that Hong Kong is a group-level service hub, and OKX does not carry on a business of operating a virtual asset trading platform in Hong Kong. • Disclaimer: • All official OKX vacancies are published on this website. While roles may appear on selected third-party platforms from time to time, information on other sites may be inaccurate or outdated. If in doubt, please apply directly through our official careers website. • If in doubt, please apply directly through our official careers website. • Information collected and processed as part of the recruitment process of any job application you choose to submit is subject to OKX's Candidate Privacy Notice.