Python Developer with Containers (GenAI) - Agama Solutions
New York, NY
About the Job
Python Developer with Containers (GenAI)
Duration: 12-18 months
W2 Candidates only
Bryant Park New York New York 10038
(we have 2 roles 1 Mid/Sr level and other is Lead)
Key Responsibilities:
o Collaborating with team and with partners in QSDG and Platform to define, build, test, and deploy platform meeting requirements.
o Defining & enforcing standards & best practices related to platform management.
o Evaluate third-party products to meet scalability, resiliency, and performance.
o Build new or leverage existing platforms (Lab, SDLC) by automating setup, installation, verification, monitoring & provisioning processes.
o Maintaining a central, version controlled, inventory of all environments, including their current versions and configuration settings.
o Planning & allocating environments to teams depending on their delivery lifecycle.
o Analyzing data to identify and proactively address environment-related issues.
o Working with project teams to manage costs & improve efficiency of environments.
o Partnering closely with Prod Support and Engineering to deploy & support applications.
Required Skills:
o Prior experience designing, implementing, and maintaining end to end environments, from POC to production.
o Deep understanding of hardware, software, network, data & application configuration.
o DevOps processes and CICD tooling (Jira, Git/Bitbucket, Jenkins, Datival, Artifactory, Ansible), orchestration & automation.
o Multi-tier (Python based) web application stack microservices/serverless/loosely coupled architecture.
o Mix of on-premises and cloud based, containerized (Docker/Kubernetes/OpenShift) deployment models.
o Familiarity with no-SQL (MongoDB) and relational (SQL Server/Oracle) databases, and other various forms of Object, Vector, and file stores.
o Unix scripting, SQL, work scheduling tools.
o Setting up infrastructure monitoring & reporting for GPU/CPU & memory consumption, inference latency and model performance.
o Performance profiling & optimization techniques to maximize performance & resource consumption / throughput and minimize latency.
o Load balancing, high availability & backup recovery strategies/techniques.
Non-Technical
o Ability to communicate effectively to a wide range of audience (business stakeholders, developer & support teams).
o Meticulous & highly organized.
o Adaptable to shifting & competing priorities.
o Skilled at delegating, mentoring & setting expectations.
o Critical thinking skills to diagnose & resolve complex issues.
Experience desired:
o Familiarity with AI & Deep learning, modeling techniques, Generative AI application stack.
o Proficiency in Python and familiarity with AI frameworks (TensorFlow/PyTorch).
o GPU cluster management (CUDA/Kubernetes), auto-scaling & scheduling (Triton Inference Server).
Qualifications:
o 7-10 years in similar roles. Preferably in the financial industry.
o Higher education in IT field or relevant previous work experience.