Senior Technology Engineer (Operations)

CGI•Deerfield Beach, FL

1d•$89,600 - $176,300•Hybrid

About The Position

We're looking for an engineer who thrives at the intersection of platform reliability, modern observability, and intelligent operations, someone who brings a developer mindset to infrastructure and is excited to push systems forward, not just maintain them. In this role, you'll take ownership of critical monitoring and management platforms, including SolarWinds and Azure Sentinel, ensuring they are resilient, scalable, and continuously evolving. You won't just support systems. You’ll optimize signal quality, reduce noise, and enable faster, smarter incident response across the organization. Own and enhance key monitoring platforms, driving reliability, supportability, and continuous improvement. In this role, you will provide hands-on systems administration across both Linux and Windows environments to ensure stable, secure, and high-performing platforms. You will collaborate closely with operations, monitoring, and security teams to improve platform reliability, streamline workflows, and enhance overall system health. You will support and optimize a hybrid infrastructure with a strong emphasis on Microsoft Azure monitoring and logging capabilities. Responsibilities include leading and contributing to platform lifecycle activities such as patching, upgrades, onboarding of new services, and maintaining clear, comprehensive documentation. To drive continuous improvement, you will explore and implement modern capabilities including advanced analytics, AI-driven enhancements, and improvements across event management, observability, and SIEM tooling to strengthen operational visibility and incident response across the organization. This position is located in Fort Lauderdale, FL in a hybrid environment.

Requirements

Minimum 4-5 years of experience in infrastructure operations, monitoring, observability, or platform operations roles, supporting enterprise environments
Hands on experience with systems administration for Linux and Windows servers, including troubleshooting, configuration, and deployment of monitoring or management agents (e.g., SolarWinds, Datadog, Dynatrace).
Foundational networking knowledge, including concepts such as SNMP, network monitoring, LAN/WAN fundamentals, firewalls, and telemetry collection, sufficient to support network centric monitoring platforms like SolarWinds
Experience with observability or monitoring platforms, such as SolarWinds, Datadog, Dynatrace, or similar tools, with an understanding of alerting, dashboards, and signal quality.
Exposure to cloud environments, preferably Microsoft Azure, including familiarity with monitoring and logging concepts (e.g., cloud based telemetry, logs, metrics, and integrations).
Basic understanding of incident and event management practices, including alert triage, escalation, and collaboration with incident response or operations teams.
Demonstrated willingness and ability to learn new technologies quickly, with examples of picking up new platforms, tools, or domains outside of prior core expertise.
Familiarity with Agile or SAFe ways of working, including collaboration in sprint based delivery models, and cross functional team engagement is a plus.
Strong communication and collaboration skills, with the ability to work effectively with platform owners, operations teams, security teams, and external stakeholders.
Experience working in a modern Dev workflow using GitHub (branches, pull requests, code reviews, and CI/CD) to manage and deploy scripts/automation used for platform operations
Working proficiency in scripting languages such as PowerShell, Python, BASH, or similar scripting languages.

Nice To Haves

Not a must but nice to have experience with platform like StruxureWare.
Knowledge with Azure, Azure Active Directory (AD), and hybrid cloud environments is a plus.
Exposure to SIEM concepts or platforms such as Azure Sentinel, CRIBL, or similar is a plus.
Experience with change management practices in an enterprise IT environment is beneficial.

Responsibilities

Platform Ownership – Network & Monitoring Tools (Must Have)
Hands-on experience with SolarWinds, including Net Path
Ownership of platform stability, upgrades, patching, and daily support
Understanding of network‑centric monitoring such as SNMP polling, traps, and device visibility
Ensure new sites and devices are correctly onboarded
Partner with platform and cloud teams to validate monitoring readiness for migrated workloads
Systems Administration (Must Have)
Sysadmin support for Linux and Windows servers, including:
Agent deployment/upgrades (SolarWinds, Datadog, Dynatrace)
OS-level troubleshooting and configuration
Monitoring and logging enablement
Support hybrid environments across on‑prem and Azure
Developer mindset with experience in GitHub, PowerShell, and modern Dev workflows
Observability & Event Management (Should Have)
Experience with observability tools such as Datadog and Dynatrace
Collaborate with platform owners to support integrations, data quality, and alert hygiene
Support event management workflows to ensure alerts are actionable and routed correctly
Participate in efforts to reduce noise and prevent repeat incidents
SIEM & Security Visibility (Nice to Have)
Working knowledge of SIEM concepts and platforms such as Azure Sentinel and Cribl
Support log ingestion, troubleshooting, and coordination with security/IR teams
Ensure infrastructure and network telemetry meets security detection requirements
Cloud Monitoring & Azure Integration (Should Have)
Experience with Azure monitoring and logging, including:
Azure Monitor
Log Analytics
Understanding of observability patterns for Azure workloads
Automation, AI & Continuous Improvement (Nice to Have)
Explore and apply AI‑assisted monitoring and incident management capabilities
Improve signal quality and reduce alert fatigue
Support faster incident triage
Contribute to documentation, runbooks, and incremental operational improvements
Knowledge Transfer & Operational Resilience
Participate in knowledge transfer for platform transitions and tool retirements
Maintain documentation and support on‑call or escalation rotations as needed