AMAX is seeking a skilled Cloud Engineer with expertise in GPU workloads to join our team. In this role, you will be responsible for designing, deploying, and managing cloud infrastructure specifically tailored for GPU hosting. You will optimize GPU utilization and performance within cloud environments, ensuring that systems run efficiently and securely.
Essential Functions
Design, deploy, and manage cloud infrastructure for GPU hosting.
Optimize GPU utilization and performance in a cloud environment.
Implement and manage containerized workloads using Docker and Kubernetes.
Automate infrastructure deployment using IaC tools like Terraform or CloudFormation.
Ensure the security and compliance of cloud infrastructure.
Monitor system performance and troubleshoot issues as they arise.
Collaborate with cross-functional teams to deliver scalable cloud solutions.
Requirements
Bachelor’s degree in Computer Science, Engineering, or a related field.
3+ years of experience in cloud engineering, with a focus on GPU workloads.
Proficiency with AWS, GCP, or Azure.
Experience with GPU-specific services like AWS EC2 P3 and P4 instances, GCP's NVIDIA Tesla, and Azure's NC-series.
Strong knowledge of GPU technology and programming (CUDA, OpenCL).
In-depth understanding of GPU architecture, specifically NVIDIA GPUs.
Experience with containerization (Docker) and orchestration (Kubernetes).
Familiarity with IaC tools (Terraform, CloudFormation).
Excellent problem-solving skills and ability to work in a fast-paced environment.
Benefits
Medical Insurance
Dental Insurance
Vision Insurance
401(k)
Flexible spending account
Commuter benefits
Disability insurance
We also have a perfect location for all types of commuters: AMAX is located right between I-680 and I-880. Warm Springs/South Fremont BART station and bus stops are within a 10-minute walking distance. 5 grocery stores, 6+ coffee/tea places, and numerous restaurants within 1 mile. Feel free to try the delicious fusions or grab your daily groceries after work!