Job Description
Please find below the complete JD:
Lead design and optimization of scalable, secure AWS cloud architectures across compute, storage, networking, and security services.
Build and manage AI/ML pipelines leveraging AWS services like Sagemaker, Lambda, ECS/EKS, Bedrock, and DynamoDB.
Drive automation using IaC tools (Terraform/CloudFormation) and CI/CD pipelines for rapid, reliable deployments.
Enable MLOps best practices including model training, evaluation, deployment, and monitoring at scale.
Collaborate with product, data science, and engineering teams to deliver AI driven solutions.
Ensure compliance, cost optimization, observability, and high availability across cloud workloads.
Troubleshoot complex distributed systems and lead performance, reliability, and security improvements