If you are motivated and believe in the credit union philosophy of "People Helping People," join our team! Position Overview: The Senior Site Reliability Engineer mentors more junior SREs and coaches on process improvement opportunities. This role takes responsibility for support and stability of many applications. This role develops, maintains, and build on team standards. This role regularly reviews and collaborates with other SRE for best practices in the development and maintenance of varied technology delivery pipelines. This role actively monitors and ensures application monitoring methods are consistent and optimized. Essential Responsibilities: 40% Maintain, monitor, troubleshoot and optimize systems for reliability and efficient performance on a 24/7 365 days a year model. Partner with development and other teams to improve services through rigorous testing and release procedures. Participate in system design consulting, platform management, and capacity planning. Balance feature development speed and reliability with well-defined service-level objectives. Support of Dev, QA, UAT, Production and DR for many team-supported applications. Follows ITSM processes for incident response, change management, and problem investigation. 20% Automate infrastructure and operations tasks, creating sustainable systems and services through automation and uplifts. Leverage data analytics to identify trends, predict and prevent issues, and promote data-driven decision-making. Continuously reviews processes and procedures for improvement opportunities. 10% Ensure systems adhere to relevant security protocols and regulations. 10% Ensure certificates are renewed and maintained within expiration windows. 10% Prepare disaster recovery plans. 10% Provides guidance and coaching to more junior level team members.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
Associate degree