Senior Platform Systems Analyst - Enterprise ...
Darden - Orlando, FL
Apply NowJob Description
JOB OVERVIEW: The Senior Platform Systems Analyst - Enterprise Monitoring Solutions will play a crucial role in implementing and maintaining a robust monitoring solution that ensures high service availability through proactive and predictive alerting. You will oversee monitoring responsibilities across network infrastructure, server hardware, operating systems, applications, and complete business processes in a 24x7 enterprise production environment. Your primary focus will be on unifying current monitoring and automation tools such as AppDynamics, SCOM, Azure Monitor, Azure Application Insights, Azure SQL Insights, Prometheus, BMC AIOps, and xMatters into a cohesive observability platform. This will enhance visibility into business transactions across the enterprise while delivering actionable alerts and management reports on key metrics, trends, and performance indicators. ROLES AND RESPONSIBILITIES: Design, deploy, and manage comprehensive monitoring and observability solutions for infrastructure, applications, databases, and cloud services. Administer, maintain, and support enterprise monitoring tools through routine upgrades, license management, and system tuning. Manage existing monitors across platforms to ensure relevance, accuracy, and alignment with evolving system and business requirements. Review and recommend alert thresholds, KPIs, and escalation criteria for meaningful alerting. Collaborate with application, infrastructure, database, and network teams to define performance, availability, and reliability monitoring strategies. Integrate monitoring tools into a cohesive observability platform. Build and maintain dashboards, metrics visualizations, and alert rules. Support monitoring to evaluate application performance and user experience. Streamline alerting workflows for 24x7 incident response. Identify monitoring gaps, reduce noise and false positives to enhance correlation. Oversee monitoring of SSL certificates, URLs, and Key Vault secrets for proactive renewal and alerting. Analyze telemetry and diagnostic data to assist in troubleshooting and incident analysis. Provide monitoring expertise for IT projects and technology rollouts. Document tool configurations, monitoring policies, and operational procedures. Continuously assess observability coverage for visibility, scalability, and security. Deliver reports on system health, capacity trends, alert volumes, and KPIs to stakeholders. REQUIRED TECHNICAL SKILLS: Bachelor's degree in computer science, Engineering, Information Systems, or a related field, or equivalent experience. 3+ years of experience in enterprise monitoring and observability, including application performance monitoring. Proficiency with tools such as AppDynamics, Prometheus, Azure Monitor, SCOM, and others. Hands-on experience with alert routing and escalation management tools. Knowledge of event correlation platforms. Strong understanding of cloud-native monitoring principles (especially Azure). Familiarity with SNMP and network monitoring principles. Knowledge of Windows and UNIX/Linux server administration. Ability to convert telemetry data into insightful business reports and dashboards. OTHER KEY QUALIFICATIONS: Strong analytical and diagnostic skills for proactive problem prevention. Excellent communication skills for explaining complex issues to diverse audiences. Self-motivated and detail-oriented, capable of independent and team work. Effective collaboration and stakeholder engagement skills. Able to thrive in fast-paced, incident-driven environments. PREFERRED SKILLS AND EXPERIENCE: Familiarity with ITSM platforms. Experience with dashboarding tools such as Grafana. Familiarity with DevOps and Site Reliability Engineering principles. ITIL Certification. #LI-KP1 #LI-Hybrid
Created: 2026-03-13