Lead Python Engineer, Data Infrastructure
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• 5+ years of experience in Python development, with prior experience in a leadership or senior role. • Strong programming skills and deep knowledge of Python data structures and libraries. • Solid understanding of HTML, CSS, JavaScript, HTTP protocols, cookies, headers, and DOM manipulation. • Experience with data cleaning, processing, and storage in various database systems like PostgreSQL. • Strong problem-solving and analytical skills. • Excellent attention to detail and data accuracy. • Effective communication skills for collaborating with cross-functional teams. • Preferred • Experience with web scraping and data extraction. • Experience using frameworks and libraries such as Scrapy, Crawlee, Playwright, etc. • Familiarity with AWS and containerization technologies (Docker, Kubernetes).
Responsibilities
• Lead the design and implementation of robust, efficient, and large-scale web scraping platforms using Python and associated frameworks. • Mentor junior developers, and provide technical guidance. Conduct code reviews to ensure the delivery of high-quality, maintainable code. • Develop sophisticated strategies to handle and bypass advanced anti-bot countermeasures like CAPTCHAs, Cloudflare, and IP blocking, while ensuring all practices adhere to legal and ethical guidelines and website terms of service. • Collaborate with data analysts and data engineers to define data requirements and ensure seamless integration of scraped data into databases. • Optimize scrapers for speed, performance, and stability; set up real-time monitoring and alerting systems to quickly detect and resolve failures or site changes. • Create clear technical documentation and communicate effectively with cross-functional teams and stakeholders to ensure alignment and manage expectations.
Benefits
• A small, collaborative, and fast-moving team where your contributions will have an outsized impact. • The chance to work on meaningful problems in regulatory technology. • Remote-first culture with flexibility and autonomy. • Recognition in the regtech space for our innovation and customer value.