Sanofi Group Senior Data Engineer in Cambridge, Massachusetts
To enable the next generation of big data solutions at Sanofi, we are looking for a senior data engineer to join our Data Engineering team to lead strategic projects in architecting and engineering new computational platforms based on cloud solutions. These core platforms will be strategic for a successful operation of Sanofi's data ecosystem and industrialization of data science.
You will architect advanced technical solutions that enable Sanofi's data scientists and data engineers to quickly process new datasets which will both reduce time-to-product and increase productivity. You will design and develop software and tools to take advantage of on-demand availability of cloud resources, scalability, and parallel processing power. You will help chart the future directions for the Enterprise Data Lake.
Scope, plan and manage cloud infrastructure enhancements and upgrades.
Design and develop custom software solutions for data lake components to unify and standardize use patterns.
Partner with the cross-functional engineering teams and vendors to define and implement highly scalable and reliable global data stores with elastic computational power.
Architect and implement data governance and security for the data platforms.
Conduct code reviews to ensure adherence to coding best practices.
Individual contributor of code.
Work with Cloud Infrastructure DevOps team members to build a robust continuous integration, delivery and deployment platform.
Develop automation tools and services to minimize delivery time and increase data engineer and data scientist productivity.
Work with other Data Engineers to optimize data models and workflows
Significant experience (5+ years) in working with scalable high performance computing systems and public clouds like AWS or GCP.
Proficiency in programming with Python.
Experience in programming with Apache Spark (pyspark) for distributed data processing.
AWS CDK to model and provision cloud application resources or experience with other infrastructure-as-code tools.
Experience in automating operational needs and developing tools at scale.
Ability to create and maintain continuous integration and deployment workflows in GitLab.
Strong understanding of Linux systems, Docker, and container orchestration.
Experience with version control (git), issue tracking, and agile development methodologies.
Worked with remote teams before.
BS, MS or PhD in Computer Science, Engineering or a related technical field.
Proficient in C++, Rust or a similar language.
Sanofi Inc. and its U.S. affiliates are Equal Opportunity and Affirmative Action employers committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race; color; creed; religion; national origin; age; ancestry; nationality; marital, domestic partnership or civil union status; sex, gender, gender identity or expression; affectional or sexual orientation; disability; veteran or military status or liability for military status; domestic violence victim status; atypical cellular or blood trait; genetic information (including the refusal to submit to genetic testing) or any other characteristic protected by law.
At Sanofi diversity and inclusion is foundational to how we operate and embedded in our Core Values. We recognize to truly tap into the richness diversity brings we must lead with inclusion and have a workplace where those differences can thrive and be leveraged to empower the lives of our colleagues, patients and customers. We respect and celebrate the diversity of our people, their backgrounds and experiences and provide equal opportunity for all.