wagey.ggwagey.ggv1.0-68eec7a-3-May
Browse Tech JobsCompaniesFeaturesPricingFAQs
Log InGet Started Free
Jobs/Senior Software Engineer Role/Unknown - Careers - Senior Software Engineer, Data Ingestion Platform
Unknown

Unknown - Careers - Senior Software Engineer, Data Ingestion Platform

Unknown$218k - $218k2d ago
In OfficeSeniorWWCloud ComputingData AnalyticsSenior Software EngineerSenior Data ScientistJavaGoPythonScalaKafkaAWSGCPAzureSnowflakeApache SparkDatabricksAirflowTerraform

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click
Apply in One Click

Requirements

• 8+ years of experience in software engineering or data platform development, with a focus on building scalable data systems or distributed infrastructure. • Strong programming proficiency in languages such as Java, Python, Scala, or Go, with experience developing data frameworks, libraries, or services. • Hands-on experience with streaming data systems and technologies such as Apache Kafka, Kafka Connect, or similar distributed messaging platforms. • Solid understanding of Change Data Capture (CDC), database replication patterns, and data lake or Lakehouse architectures. • Experience with modern data storage formats and table formats such as Apache Iceberg or Delta Lake. • Experience with cloud-based data ecosystems (AWS, GCP, or Azure) and infrastructure-as-code tools. • Technologies We Use and Teach • Streaming & Messaging: Apache Kafka, Schema Registry, Kafka Connect, Debezium • Data Platform: Databricks, Snowflake • Data Processing & Storage: Apache Spark, Apache Iceberg, Delta Lake, Apache Airflow • Cloud & Infrastructure: AWS, Terraform

Responsibilities

• Design, build, and operate scalable data replication and ingestion pipelines that move data from production databases, event streams, and third-party sources into Block's Lakehouse. • Develop and enhance Kafka Iceberg connectors and data loading frameworks, enabling reliable, low-latency data delivery to Snowflake and Databricks. • Drive the modernization of Block's CDC platform — evaluating and implementing next-generation approaches for database replication, including cloud-native alternatives, and Iceberg-based ingestion patterns. • Build self-service tooling and observability features that empower internal teams to onboard, monitor, and troubleshoot their own data pipelines with minimal support. • Collaborate with data engineering, platform infrastructure, and product teams to define data contracts, improve service encapsulation, and reduce tight coupling between operational databases and analytics consumers. • Contribute to the unification of Block's data ingestion architecture by identifying opportunities to consolidate overlapping systems and reduce infrastructure complexity. • Design and implement solutions for PII detection, masking, and privacy-compliant data handling within ingestion pipelines, ensuring sensitive data is properly classified, protected, and governed in accordance with Block's privacy policies and regulatory requirements (e.g., GDPR, CCPA). • Establish and promote best practices for data pipeline reliability, cost optimization, schema management, and compliance across the ingestion platform.

Get Started Free

No credit card. Takes 10 seconds.

Privacy·Terms··Contact·FAQ·Wagey on X