Lithosquare - Data Engineer
Requirements
• 5+ years of experience in Data Engineering, with a proven track record of building scalable production systems; • Geospatial & remote sensing expertise: deep proficiency in processing raster, vector, and point cloud data, with a solid understanding of coordinate reference systems (CRS) and geospatial indexing; • Geospatial & remote sensing expertise: • Expertise in python & SQL: ability to write highly optimized code and complex analytical queries; • Expertise in python & SQL: • AI-Driven engineering: proven experience integrating LLMs/GenAI into data pipelines to automate the extraction and classification of complex, unstructured documents; • AI-Driven engineering: • Architectural vision: ability to build a modern analytics and geospatial stack from a blank slate, including tiling services (COG, MVT) for web visualization; • Architectural vision: • Rigorous data modeling: strong foundation in data warehousing concepts and performance optimization; • Rigorous data modeling: • Infrastructure fluency: understanding of Kubernetes and containerized environments for deploying data workloads; • Infrastructure fluency: • Mission-driven: a genuine passion for the energy transition and solving "hard" physical-world problems through digital innovation • Mission-driven:
Responsibilities
• Build intelligent ingestion: design and scale robust pipelines to harvest data from diverse sources, including satellite imagery (multispectral), LiDAR point clouds, and public/private multimodal geological records; • Build intelligent ingestion: • Implement self-adjusting pipelines: integrate GenAI/LLMs into our data workflows to create auto-adjustable pipelines capable of handling schema shifts and unstructured document extraction; • Implement self-adjusting pipelines: • Geospatial processing & tiling: architect high-performance systems for raster processing and vector tiling (COG, GeoJSON) to enable real-time 3D visualization and cartography; • Geospatial processing & tiling: • Own the analytics stack: architect and deploy our internal analytics infrastructure using open-source tools to monitor mining operations and field processes; • Own the analytics stack: • Quantify product value: build data models and dashboards to track platform usage and quantify the scientific and economic value delivered to our geologists; • Quantify product value: • Lead data modeling: design and maintain scalable data schemas that serve as the single source of truth for the entire company; • Lead data modeling: • Cross-functional collaboration: partner with AI engineers and geologists to align on data ingestion requirements, structural modeling, and analytics; • Cross-functional collaboration: • Production ownership: deploy and operate data services in production (cloud services), ensuring high availability, data observability, and strict security for sensitive exploration data; • Production ownership: • Tech advocacy: continuously evaluate and implement emerging open-source data technologies to maintain our competitive edge in data processing. • Tech advocacy: • Technical Stack • Languages: Python (expert level), SQL (GIS), Bash • Languages: • Geospatial Libraries: GDAL/OGR, Rasterio, Shapely, Fiona, PyProj, Geopandas • Geospatial Libraries: • Data Formats & Tiling: GeoTIFF / COG, GeoParquet, LAS/LAZ, Zarr, Vector Tiles • Data Formats & Tiling: • Orchestration: Temporal.io, Airflow or Dagster • Orchestration: • AI Integration: LLM orchestration, vector databases, prompt engineering for ETL • AI Integration: • Cloud & Infrastructure: Docker, kubernetes, terraform • Cloud & Infrastructure: • Analytics & BI: dbt, metabase, open-source observability tools • Analytics & BI:
Benefits
• 🏢 Offices located in the heart of Paris • 🌱 Strong culture of ownership & entrepreneurship, with clear growth paths as the company expand • 🌍 Opportunity to significantly contribute to energy transition • 👥 Collaborative work environment with world-class experts in geology, AI, and data science • 🔄 Flexible work arrangements enabling work-life balance • 🍽️ Meal vouchers and premium health insurance coverage (Alan)
Apply in one click
Upload My Resume
Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT