Principal Software Engineer - Kubernetes Expert - Remote - Startup - Living Talent
Brooklyn, NY
About the Job
Principal Software Engineer with Kubernetes Expertise
Build cloud-native solutions optimizing compute for CPU and GPU.
The platform: Autonomous Cloud Orchestration based on K8s control layer.
- Startup (revenue-generating, Series A) - Product Co.
- Company size: 30
- Future unicorn
- REMOTE first culture
- Smart, fun, low-ego team culture
- Compensation: Base Salary 200++, Equity (future Unicorn)
Key Responsibilities
- Lead Design, Architecture & Development of K8s-based cloud infrastructure.
- Use K8s Controllers, Operators & CRs to Implement scalable, high-availability solutions.
- Integrate Karpenter, and/or other advanced tools for infrastructure optimization.
- Architect MLOps Middleware integration (dynamic workload migration, resource disaggregation).
- Build monitoring, logging & alerting systems.
- Drive infrastructure cost optimization through FinOps best practices in K8s deployments.
- Promote K8s best practices & mentor software engineers.
- Collaborate across teams to drive K8s adoption in multi-cloud and hybrid environments.
- Open-Source Contributions in the Kubernetes community.
Qualifications
- Kubernetes Expertise
- Designing, deploying, and managing K8s clusters (AKS, EKS, GKE, OpenStack, etc.).
- Hands-on experience with K8s core components (Karpenter, cluster autoscaler, CNI, CSI, CRI, CRD, operators).
- 5+ years in Kubernetes infrastructure.
- Contributing to open-source Kubernetes projects.
- 10+ years: software engineering experience.
- Go, Python, Bash, etc. (one or more).
- IaC tools proficiency (Terraform, Helm).
- CI/CD pipelines & DevOps practices.
- Excellent communication skills for both technical and non-technical stakeholders.
- Bachelor’s or Master’s degree in Computer Science or related field (preferred).
Preferred Experience
- GPU scheduling, container orchestration, HPC (high-performance computing) workloads.
- Multi-cloud & hybrid cloud deployments familiarity.
- MLOps platforms experience (Kubeflow, TFX, etc.).
- FinOps practices & cloud cost management experience/knowledge
Source : Living Talent