Senior Data Engineer - OCLC, Inc.
Dublin, OH 43017
About the Job
You have a life. We like that about you.
At OCLC, we believe you'll do the best work of your life when you're living the best life possible.
We work hard to build the technology that connects thousands of today's libraries. But we also work hard to make a job at OCLC a meaningful part of a balanced life- not a substitute for one.
Technology with a Purpose. OCLC supports thousands of libraries in making information more accessible and more useful to people around the world. OCLC provides shared technology services, original research and community programs that help libraries meet the ever-evolving needs of their users, institutions, and communities. With office locations around the globe, OCLC employees are dedicated to offering premier services and software to help libraries.
The Job Details are as follows:
Data Engineers are responsible for bringing robust, efficient, and integrated data models and products to life and sit at the intersection of business teams, Data Analysts, and Software Engineers.Responsibilities
- Collaborate with team members to collect business requirements, design data models and define successful analytics outcomes.
- Build and maintain data pipelines from internal databases and SaaS applications · Design and develop dbt code to extend OCLC’s Enterprise Data Model that meets the OCLC Data Team’s internal standards for style, maintainability, and best practices for a high-scale database environment. Maintain and advocate for these standards through code reviews and best practice socialization
- Create and maintain architecture and systems documentation.
- Provide data modeling expertise to the Data Science and Analytics teams through code reviews, pairing, and training to help deliver optimal, DRY, and scalable database designs and SQL queries
- Develop and maintain data mapping specifications based on the results of data analysis and functional requirements.
- Build automated Extract, Transform & Load (ETL) jobs based on data mapping specifications
- Maintain metadata structures needed for building reusable Extract, Transform & Load (ETL) components.
- Identify and resolve impediments to efficiency and enable the entire Data Program to iterate faster
- Help promote data innovation across OCLC with a willingness to experiment and to confront hard and complex problems
- Profile and analyze data for pipeline quality assurance, and own the QA process
- Conduct exploratory data analysis and generate visual summaries of data. Identify data quality issues proactively.
Qualifications
- 4+ years performing in a Data Engineering or Data Ops role.
- 4+ years’ experience designing, implementing, operating, and extending commercial enterprise dimensional models.
- 4+ years’ working with a large-scale Data Warehouse, preferably in a cloud environment.
- 2+ years building and deploying data solutions with cloud providers such as AWS, Azure, or GCP.
- 2+ years’ experience Big Data / Hadoop / Spark / Hive / NoSQL database engines (i.e. Cassandra or HBase).
- 2+ years experience using ETL tools such as dbt to build and maintain data pipelines.
- 4+ years writing SQL scripts for Snowflake SQL, MySQL, PostgreSQL, etc.
- 1+ year experience writing production-quality code in Python.
- 2+ years creating project plans to identify tasks, milestones, and deliverables.
- Positive, self-starting, solution-oriented mindset.
- Comfort working in a highly agile, iterative environment, and taking work ownership
- Demonstrated capacity to communicate complex technical requirements and business recommendations clearly and concisely.
- Experience working with unstructured datasets and ability to handle XML, JSON file formats.
- Bonus points for experience deploying machine learning or statistical. Models in production using tools like MLFlow or AWS Sagemaker.
This position demands a high level of technical proficiency and expertise, making it ideally suited for individuals who thrive in complex, technically challenging environments.
Working Conditions: Normal office environment.
ADA/EAA: The above statements cover what are generally believed to be the principal and essential functions of this job. Specific circumstances may allow or require some people assigned to the job to perform a somewhat different combination of duties.