Python PySpark with AWS - Tanisha Systems
Plano, DE 10016
About the Job
Job description:-
Title: Python PySpark with AWS
Location: Plano, TX/ Wilmington, DE
Duration: ... 12+ months
Additional Job Details:
Duties and responsibilities:
Collaborate with the team to build out features for the data platform and consolidate data
assets
Build, maintain and optimize data pipelines built using Spark
Advise, consult, and coach other data professionals on standards and practices
Work with the team to define company data assets
Migrate CMS data platform into Chase s environment
Partner with business analysts and solutions architects to develop technical
architectures for strategic enterprise projects and initiatives
Build libraries to standardize how we process data
Loves to teach and learn, and knows that continuous learning is the cornerstone of every
successful engineer
Has a solid understanding of AWS tools such as EMR or Glue, their pros and cons and
is able to intelligently convey such knowledge
Implement automation on applicable processes
Mandatory Skills:
5+ years of experience in a data engineering position
Proficiency is Python (or similar) and SQL
Strong experience building data pipelines with Spark
Strong verbal & written communication
Strong analytical and problem solving skills
Experience with relational datastores, NoSQL datastores and cloud object stores
Experience building data processing infrastructure in AWS
Bonus: Experience with infrastructure as code solutions, preferably Terraform
Bonus: Cloud certification
Bonus: Production experience with ACID compliant formats such as Hudi, Iceberg or
Delta Lake
Bonus: Familiar with data observability solutions, data governance frameworks
Requirements
Bachelor s Degree in Computer Science/Programming or similar is preferred
Right to work
Must have legal right to work in the USA
About We, Inc.
We, founded in 2002 in Massachusetts-USA, is a leading provider of Custom Application Development and end-to-end IT Services to clients globally. We use a client-centric engagement model that combines local on-site and off-site resources with the cost, global expertise and quality advantages of off-shore operations. We deliver Custom Application Development, Application Modernization, Business Process Outsourcing and Professional IT Services from office locations in USA and *.
We services clients in Government, Banking & Financial Markets, Insurance, Healthcare, Retail & Consumer Goods, Energy & Utilities, Life Sciences, Telecom, Manufacturing and Transportation Industries around the globe. Our engagement model provides a flexible operational environment that empowers our clients with the right levels of control.
Want to read more about We? Visit us at www.tanishasystems.com
Website Open Jobs
Title: Python PySpark with AWS
Location: Plano, TX/ Wilmington, DE
Duration: ... 12+ months
Additional Job Details:
Duties and responsibilities:
Collaborate with the team to build out features for the data platform and consolidate data
assets
Build, maintain and optimize data pipelines built using Spark
Advise, consult, and coach other data professionals on standards and practices
Work with the team to define company data assets
Migrate CMS data platform into Chase s environment
Partner with business analysts and solutions architects to develop technical
architectures for strategic enterprise projects and initiatives
Build libraries to standardize how we process data
Loves to teach and learn, and knows that continuous learning is the cornerstone of every
successful engineer
Has a solid understanding of AWS tools such as EMR or Glue, their pros and cons and
is able to intelligently convey such knowledge
Implement automation on applicable processes
Mandatory Skills:
5+ years of experience in a data engineering position
Proficiency is Python (or similar) and SQL
Strong experience building data pipelines with Spark
Strong verbal & written communication
Strong analytical and problem solving skills
Experience with relational datastores, NoSQL datastores and cloud object stores
Experience building data processing infrastructure in AWS
Bonus: Experience with infrastructure as code solutions, preferably Terraform
Bonus: Cloud certification
Bonus: Production experience with ACID compliant formats such as Hudi, Iceberg or
Delta Lake
Bonus: Familiar with data observability solutions, data governance frameworks
Requirements
Bachelor s Degree in Computer Science/Programming or similar is preferred
Right to work
Must have legal right to work in the USA
About We, Inc.
We, founded in 2002 in Massachusetts-USA, is a leading provider of Custom Application Development and end-to-end IT Services to clients globally. We use a client-centric engagement model that combines local on-site and off-site resources with the cost, global expertise and quality advantages of off-shore operations. We deliver Custom Application Development, Application Modernization, Business Process Outsourcing and Professional IT Services from office locations in USA and *.
We services clients in Government, Banking & Financial Markets, Insurance, Healthcare, Retail & Consumer Goods, Energy & Utilities, Life Sciences, Telecom, Manufacturing and Transportation Industries around the globe. Our engagement model provides a flexible operational environment that empowers our clients with the right levels of control.
Want to read more about We? Visit us at www.tanishasystems.com
Website Open Jobs
Source : Tanisha Systems