Feature Lead, Platform Engineering

Bank of AmericaCharlotte, NC
1dOnsite

About The Position

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day. Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates’ physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve. Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations. At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us! Job Description: We are seeking a Feature Lead to design, build, and scale a self-service platform supporting high-availability customer and internal applications. This role combines hands-on engineering with feature leadership: you will own platform features end-to-end—from discovery and design through delivery and L2/L3 operational readiness—while guiding a small squad of engineers (app dev, SRE, and infra) to deliver measurable outcomes. The ideal candidate has deep experience across container orchestration (OpenShift), networking and web runtimes (NGINX, Gunicorn, Uvicorn), data stores (PostgreSQL, Cassandra, MongoDB), caching (Redis), observability (Grafana/Prometheus), Linux administration, and Python for automation. This job is responsible for providing leadership, technical direction and oversight to a team delivering technology solutions. Key responsibilities of the job are to provide oversight of the design, implementation, and maintenance of complex computer programs, align technical solutions to business objectives, and ensure that coding practices/quality comply with software development standards. Job expectations include conducting multiple software implementations and applying both depth and breadth in knowledge of several technical competencies.

Requirements

  • 7+ years in platform/SRE/devops or back-end engineering with 4+ years leading features or workstreams
  • Hands-on with OpenShift (projects, operators, routes/ingress, RBAC, quotas, CI/CD integration)
  • Strong Linux administration (networking, systemd, kernel tuning basics, filesystem/permissions, security hardening)
  • Runtime stack expertise: NGINX, Gunicorn, Uvicorn (tuning workers/threads, async I/O, TLS, caching, observability)
  • Data layer familiarity: PostgreSQL, Cassandra, MongoDB (health, scaling, connectivity, basic performance tuning)
  • Redis operations (persistence strategies, clustering, eviction, sizing)
  • Grafana/Prometheus: metrics modeling, alert rules, dashboards, SLO/SLA reporting
  • Python for automation/tooling (APIs/SDKs, CLI tools, scripts) and Git-based workflows
  • Experience running workloads in regulated or enterprise environments (change management, CRQ, evidence, runbooks)

Nice To Haves

  • Bachelors Degree in Technology or related field
  • Service mesh (Istio/Linkerd), API gateways, or ingress controllers beyond NGINX
  • Secrets management (Vault, External Secrets Operator), PKI/TLS automation
  • CI/CD (GitHub Actions, Jenkins, Argo CD/Tekton) and policy-as-code (OPA/Gatekeeper, Kyverno)
  • Chaos engineering, load testing (k6/Locust), and capacity planning
  • Familiarity with Python frameworks (FastAPI/Flask) for building internal platform services
  • Prior experience mentoring L1/L2 teams and formal incident command

Responsibilities

  • Own a platform feature area (e.g., deployment pipelines, runtime platform, or observability stack) from roadmap to release, ensuring clear scope, OKRs, and success metrics
  • Lead a cross-functional squad (platform/SRE/app dev) through discovery, design reviews, technical spikes, delivery, and production readiness
  • Partner with stakeholders (security, networking, DBAs, app teams) to align on requirements, SLAs/SLOs, and compliance needs
  • Design and operate OpenShift-based runtimes: cluster architecture, multi-tenant namespaces, quotas, RBAC, ingress, and service mesh patterns as applicable
  • Engineer performant north–south paths using NGINX and application servers (Gunicorn, Uvicorn)—optimize connection handling, TLS, gzip/brotli, caching, and timeouts
  • Build golden paths and reusable templates/Helm/Kustomize bundles for app onboarding and day‑2 ops
  • Partner with database (DB) platform teams to standardize health checks, backup/restore runbooks, connection pooling, and tuning for PostgreSQL, Cassandra, and MongoDB
  • Design and operationalize Redis usage patterns (eviction policy, persistence, clustering, and sizing guidance) for low-latency services.
  • Implement Grafana/Prometheus dashboards, alerting, and SLOs; drive error budget processes and incident retrospectives.
  • Establish runbooks, diagnostics scripts, and automated remediation for common failure modes
  • Build Python tooling and operators/automation to reduce toil (cluster maintenance, config drift, TLS rotation, secrets management, active checks).
  • Enforce secure-by-default baselines, including image policies, RBAC least privilege, and compliance evidence capture.
  • Serve as Level-Three (L3) escalation for complex incidents; mentor L1/L2 teams through knowledge transfer, documentation, and simulations.
  • Maintain high-quality documentation (architecture decision records, runbooks, playbooks, onboarding guides).
  • Designs, develops and is accountable for feature delivery
  • Applies enterprise standards for solution design, coding and quality
  • Ensures solution meets product acceptance criteria with minimal technical debt
  • Guides the team on work breakdown and execution

Benefits

  • This role is eligible to participate in the annual discretionary plan. Employees are eligible for an annual discretionary award based on their overall individual performance results and behaviors, the performance and contributions of their line of business and/or group; and the overall success of the Company.
  • This role is currently benefits eligible.
  • We provide industry-leading benefits, access to paid time off, resources and support to our employees so they can make a genuine impact and contribute to the sustainable growth of our business and the communities we serve.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service