AI Enabled Cloud & DevOps Senior Engineer

FiservSunnyvale, NJ
1dOnsite

About The Position

We are seeking an AI Enabled Cloud & DevOps Senior Engineer to revolutionize our infrastructure and operations through AI-powered automation and agentic workflows. You will design and implement intelligent systems that autonomously manage cloud infrastructure, CI/CD pipelines, and operational processes. This includes modernizing legacy infrastructure and build systems using AI capabilities. This role focuses on building AI agents and automation that transform how we provision, monitor, and operate cloud environments—not on building AI/ML models, but on leveraging AI capabilities to create self-managing, intelligent infrastructure.

Requirements

  • 10+ years of DevOps/SRE/Cloud Engineering experience with deep expertise in AWS, GCP, or Azure
  • Strong proficiency in Python and/or Java for automation and tooling development
  • Hands-on experience with major LLM-powered coding assistants and the ability to evaluate emerging tools
  • Experience with agentic AI frameworks (LangChain, LangGraph, CrewAI) or strong interest in learning
  • Expert-level knowledge of infrastructure-as-code (Terraform, CloudFormation, Pulumi)
  • Deep expertise in CI/CD tools (GitHub Actions, GitLab CI, ArgoCD) and GitOps practices
  • Kubernetes expertise including deployment, scaling, operators, and troubleshooting
  • Experience with observability stacks (Datadog, Prometheus, Grafana, ELK) and AIOps concepts

Nice To Haves

  • Experience in fintech or regulated industries with compliance requirements (SOC 2, PCI-DSS, FedRAMP)
  • Experience modernizing legacy infrastructure or migrating from on-premise to cloud
  • Background in prompt engineering or working with LLM APIs
  • Experience with MCP (Model Context Protocol) or A2A protocols for tool integration
  • Cloud certifications (AWS Solutions Architect, GCP Professional Cloud Architect)
  • Experience with chaos engineering, resilience testing, or self-healing infrastructure

Responsibilities

  • Design and build agentic automation systems that autonomously manage cloud infrastructure provisioning, scaling, and optimization across AWS, GCP, or Azure
  • Implement AI-powered CI/CD pipelines with intelligent code review, security scanning, test generation, and automated deployment decisions
  • Create agentic incident response systems that automatically detect, diagnose, and remediate infrastructure issues with appropriate human oversight
  • Leverage major LLM-powered coding assistants to accelerate Terraform, Kubernetes, and configuration management development
  • Use AI tools to document, analyze, and modernize legacy infrastructure, build scripts, and deployment processes
  • Build intelligent cost optimization agents that analyze usage patterns, recommend right-sizing, and automate resource management
  • Develop AI-assisted security and compliance automation including posture management, drift detection, and automated remediation
  • Implement intelligent monitoring and observability systems that use AI to correlate events, predict issues, and surface actionable insights
  • Create self-service infrastructure platforms with AI-powered assistance for development teams
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service