Senior Technology Engineer (Operations)

CGIDeerfield Beach, FL
1d$89,600 - $176,300Hybrid

About The Position

We're looking for an engineer who thrives at the intersection of platform reliability, modern observability, and intelligent operations, someone who brings a developer mindset to infrastructure and is excited to push systems forward, not just maintain them. In this role, you'll take ownership of critical monitoring and management platforms, including SolarWinds and Azure Sentinel, ensuring they are resilient, scalable, and continuously evolving. You won't just support systems. You’ll optimize signal quality, reduce noise, and enable faster, smarter incident response across the organization. Own and enhance key monitoring platforms, driving reliability, supportability, and continuous improvement. In this role, you will provide hands-on systems administration across both Linux and Windows environments to ensure stable, secure, and high-performing platforms. You will collaborate closely with operations, monitoring, and security teams to improve platform reliability, streamline workflows, and enhance overall system health. You will support and optimize a hybrid infrastructure with a strong emphasis on Microsoft Azure monitoring and logging capabilities. Responsibilities include leading and contributing to platform lifecycle activities such as patching, upgrades, onboarding of new services, and maintaining clear, comprehensive documentation. To drive continuous improvement, you will explore and implement modern capabilities including advanced analytics, AI-driven enhancements, and improvements across event management, observability, and SIEM tooling to strengthen operational visibility and incident response across the organization. This position is located in Fort Lauderdale, FL in a hybrid environment.

Requirements

  • Minimum 4-5 years of experience in infrastructure operations, monitoring, observability, or platform operations roles, supporting enterprise environments
  • Hands on experience with systems administration for Linux and Windows servers, including troubleshooting, configuration, and deployment of monitoring or management agents (e.g., SolarWinds, Datadog, Dynatrace).
  • Foundational networking knowledge, including concepts such as SNMP, network monitoring, LAN/WAN fundamentals, firewalls, and telemetry collection, sufficient to support network centric monitoring platforms like SolarWinds
  • Experience with observability or monitoring platforms, such as SolarWinds, Datadog, Dynatrace, or similar tools, with an understanding of alerting, dashboards, and signal quality.
  • Exposure to cloud environments, preferably Microsoft Azure, including familiarity with monitoring and logging concepts (e.g., cloud based telemetry, logs, metrics, and integrations).
  • Basic understanding of incident and event management practices, including alert triage, escalation, and collaboration with incident response or operations teams.
  • Demonstrated willingness and ability to learn new technologies quickly, with examples of picking up new platforms, tools, or domains outside of prior core expertise.
  • Familiarity with Agile or SAFe ways of working, including collaboration in sprint based delivery models, and cross functional team engagement is a plus.
  • Strong communication and collaboration skills, with the ability to work effectively with platform owners, operations teams, security teams, and external stakeholders.
  • Experience working in a modern Dev workflow using GitHub (branches, pull requests, code reviews, and CI/CD) to manage and deploy scripts/automation used for platform operations
  • Working proficiency in scripting languages such as PowerShell, Python, BASH, or similar scripting languages.

Nice To Haves

  • Not a must but nice to have experience with platform like StruxureWare.
  • Knowledge with Azure, Azure Active Directory (AD), and hybrid cloud environments is a plus.
  • Exposure to SIEM concepts or platforms such as Azure Sentinel, CRIBL, or similar is a plus.
  • Experience with change management practices in an enterprise IT environment is beneficial.

Responsibilities

  • Platform Ownership – Network & Monitoring Tools (Must Have)
  • Hands-on experience with SolarWinds, including Net Path
  • Ownership of platform stability, upgrades, patching, and daily support
  • Understanding of network‑centric monitoring such as SNMP polling, traps, and device visibility
  • Ensure new sites and devices are correctly onboarded
  • Partner with platform and cloud teams to validate monitoring readiness for migrated workloads
  • Systems Administration (Must Have)
  • Sysadmin support for Linux and Windows servers, including:
  • Agent deployment/upgrades (SolarWinds, Datadog, Dynatrace)
  • OS-level troubleshooting and configuration
  • Monitoring and logging enablement
  • Support hybrid environments across on‑prem and Azure
  • Developer mindset with experience in GitHub, PowerShell, and modern Dev workflows
  • Observability & Event Management (Should Have)
  • Experience with observability tools such as Datadog and Dynatrace
  • Collaborate with platform owners to support integrations, data quality, and alert hygiene
  • Support event management workflows to ensure alerts are actionable and routed correctly
  • Participate in efforts to reduce noise and prevent repeat incidents
  • SIEM & Security Visibility (Nice to Have)
  • Working knowledge of SIEM concepts and platforms such as Azure Sentinel and Cribl
  • Support log ingestion, troubleshooting, and coordination with security/IR teams
  • Ensure infrastructure and network telemetry meets security detection requirements
  • Cloud Monitoring & Azure Integration (Should Have)
  • Experience with Azure monitoring and logging, including:
  • Azure Monitor
  • Log Analytics
  • Understanding of observability patterns for Azure workloads
  • Automation, AI & Continuous Improvement (Nice to Have)
  • Explore and apply AI‑assisted monitoring and incident management capabilities
  • Improve signal quality and reduce alert fatigue
  • Support faster incident triage
  • Contribute to documentation, runbooks, and incremental operational improvements
  • Knowledge Transfer & Operational Resilience
  • Participate in knowledge transfer for platform transitions and tool retirements
  • Maintain documentation and support on‑call or escalation rotations as needed

Benefits

  • Competitive compensation
  • Comprehensive insurance options
  • Matching contributions through the 401(k) plan and the share purchase plan
  • Paid time off for vacation, holidays, and sick time
  • Paid parental leave
  • Learning opportunities and tuition assistance
  • Wellness and Well-being programs
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service