Senior Software Engineer - Data Infrastructure
Plaid
Responsibilities
- Contribute towards the long-term technical roadmap for data-driven and machine learning iteration at Plaid
- Leading key data infrastructure projects such as improving ML development golden paths, implementing offline streaming solutions for data freshness, building net new ETL pipeline infrastructure, and evolving data warehouse or data lakehouse capabilities.
- Working with stakeholders in other teams and functions to define technical roadmaps for key backend systems and abstractions across Plaid.
- Debugging, troubleshooting, and reducing operational burden for our Data Platform.
- Growing the team via mentorship and leadership, reviewing technical documents and code changes.
Qualifications
- 5+ years of software engineering experience
- Extensive hands-on software engineering experience, with a strong track record of delivering successful projects within the Data Infrastructure or Platform domain at similar or larger companies.
- Deep understanding of one of: ML Infrastructure systems, including Feature Stores, Training Infrastructure, Serving Infrastructure, and Model Monitoring OR Data Infrastructure systems, including Data Warehouses, Data Lakehouses, Apache Spark, Streaming Infrastructure, Workflow Orchestration.
- Strong cross-functional collaboration, communication, and project management skills, with proven ability to coordinate effectively.
- Proficiency in coding, testing, and system design, ensuring reliable and scalable solutions.
- Demonstrated leadership abilities, including experience mentoring and guiding junior engineers.
- [Nice to have] Experience with Databricks, Airflow, AWS EMR