Big Data Architect - ApTask
Not Available, PA
About the Job
About Client:
The Client is a leading global IT services and consulting company, providing a wide range of services to clients in various industries, including banking, financial services, retail, manufacturing, healthcare, and more. It is one of the largest employers in the IT industry and has a vast and diverse workforce. The company places a strong emphasis on employee training and development. Client is known for its commitment to innovation and invests in research and development to stay at the forefront of technological advancements.
It offers a comprehensive set of services, including:
IT Services: Application development, maintenance, and testing.
Consulting: Business consulting, IT strategy, and digital transformation.
Business Process Outsourcing (BPO): Outsourcing of business processes to improve efficiency.
Enterprise Solutions: Implementation and support of enterprise-level software solutions. Digital Services: Services related to digital technologies, such as analytics, cloud, and IoT.
Salary Range: $120K-$160K/Annum
Job Description:
- 8+ years of data engineering/architecture experience in Waterfall and Agile Methodology, with a proven track record of delivering complex data solutions, preferably in the financial sector.
- Proficiency in Python for advanced data analysis, scripting, automation, and integration with Big Data platforms and ecosystems.
- Strong SQL experience (Oracle, Hive, Impala) in designing and implementing complex data models, DDLs, and DMLs, with a focus on performance optimization and scalability.
- Proficiency in shell scripting for automation, task orchestration, and system integration in Unix/Linux environments.
- Expertise in Cloudera Big Data Technologies, including Cloudera Distribution of Hadoop (CDP), Apache Hadoop, Apache Spark, Apache Hive, Apache Impala, and Apache HBase, with hands-on experience in deploying, configuring, and optimizing these technologies for large-scale data processing and analytics.
- In-depth knowledge of data governance policies, metadata management, and regulatory compliance requirements, with a focus on protecting sensitive data and ensuring data privacy and security.
- Proven experience migrating data from relational databases to Hadoop platforms, with a deep understanding of data migration strategies, tools, and best practices.
- Ability to integrate complex big data infrastructure with diverse systems, applications, and data sources, using industry-standard APIs, connectors, and integration patterns.
- Familiarity with data encryption tools and techniques, including Protegrity, for securing data at rest and in transit, and ensuring compliance with industry regulations and data protection standards.
- Experience working with development tools like Jenkins for continuous integration, deployment, and pipeline automation, streamlining the software development lifecycle and ensuring rapid delivery of high-quality solutions.
- Proficiency in Apache Oozie for workflow management, job scheduling, and orchestration of complex data processing workflows in Hadoop ecosystems, ensuring reliable and efficient execution of data pipelines and batch processing jobs.
- Hands-on experience with ELK (Elasticsearch, Logstash, Kibana) stack, including configuration, customization, and management of Kibana dashboards for real-time monitoring and visualization of system metrics and application logs.
- Effective communication, presentation, and organizational skills, with the ability to articulate complex technical concepts to diverse audiences and influence stakeholders at all levels of the organization.
- Proficiency in Microsoft Visio, Excel, PowerPoint, and Word, with the ability to create compelling visualizations, documentation, and presentations to support decision-making and knowledge sharing.
- Demonstrated leadership, analytical, and problem-solving skills, with a track record of driving innovation, fostering collaboration, and delivering results in dynamic, fast-paced environments.
- Excellent written and oral communication skills, with the ability to convey complex technical concepts clearly and persuasively to technical and non-technical audiences.
- Proven ability to work independently and as part of a high-performing team, with a collaborative and customer-centric approach to problem-solving and decision-making.
- Commitment to continuous learning and professional development, with a passion for staying abreast of emerging technologies, trends, and best practices in data management, analytics, and related fields.
- Strong critical thinking and decision-making abilities, with the capacity to prioritize competing demands, discern critical from minor issues, and innovate creative solutions to complex challenges.
- Big Data Architect, Cloudera Big Data Tool stack (Hive ,Impala, OoZie), Pyspark, Teradata, Shell Script, Unix programing, Hadoop
About ApTask:
ApTask is a leading global provider of workforce solutions and talent acquisition services, dedicated to shaping the future of work. As an African American-owned and Veteran-certified company, ApTask offers a comprehensive suite of services, including staffing and recruitment solutions, managed services, IT consulting, and project management. With a focus on excellence, collaboration, and innovation, ApTask provides unparalleled opportunities for professional growth and development. As a member of the ApTask team, you will have the chance to connect businesses with top-tier professionals, optimize workforce performance, and drive success across diverse industries. Join us at ApTask and be part of our mission to empower organizations to thrive while fostering a diverse and inclusive work environment.
Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government issued ID during each interview.
Candidate Data Collection Disclaimer:
At ApTask, we prioritize safeguarding your privacy. As part of our recruitment process, certain Personally Identifiable Information (PII) may be requested by our clients for verification and application purposes. Rest assured, we strictly adhere to confidentiality standards and comply with all relevant data protection laws. Please note that we only collect the necessary information as specified by each client and do not request sensitive details during the initial stages of recruitment.
If you have any concerns or queries about your personal information, please feel free to contact our compliance team at businessexcellence@aptask.com