Data Engineering Intern, Staples, Inc., Framingham, MA
Posted January 13, 2020
Position Overview
The Supply Chain Analytics team is looking for an exceptionally talented Data Engineering Intern who has the skills in modernizing and improving our data infrastructure from the ground up.
If you are passionate about working with large data sets (structured/unstructured), building large scale data processing platforms, implementing world class data governance and operational controls, solving complex performance challenges and building robust ETL pipelines then this is the job for you!
You will be responsible for the development and ongoing maintenance of the Supply Chain data mart in the Microsoft Azure cloud platform. Our team will be focused on development in Databricks + ADLS and Snowflake. Your interests in big data will help Staples build large scale distributed applications that serve to accelerate our growth and profitability as well as identify opportunities for optimization
Responsibilities
-
Create and optimize our data pipeline architecture
-
Build the data access platform for our data science and business teams
-
Design and implement modernized ETL through cloud-based solutions (Microsoft Azure)
-
Working on Databricks (supports Python, R, Scala and SQL) and Snowflake (SQL)
-
Design and develop large-scale data structures for business intelligence analytics by using data mining tools
-
Add and improve logging and monitoring to current solutions
-
Provide ad hoc queries and analysis
-
Develop and maintain a data-infrastructure comprising internal, external, and transformed data.
-
Gathering, storing, transforming, and cataloging data and processes
-
Collaborate with Supply Chain teams to ideate and receive instruction on business requirements, to inform the above job duties
-
Maintain strong partnerships and collaborate with other teams
Application Process
Please apply on Handshake by visiting this link. The application deadline is February 29th, 2020.