The Walt Disney Company Lead Platform Engineer - Enterprise Monitoring in Seattle, Washington
The Lead Platform Engineer (Enterprise Monitoring) works as part of a team responsible for end-to-end monitoring of complex web based applications. The ideal candidate would be a performance monitoring expert, supporting operating systems, applications, and middleware monitoring. The role will also interface with Enterprise Engineering, Software Delivery and Testing teams in a Dev-Ops environment. Work closely with other Application Operations Engineering resources and teams to provide monitoring expertise in support of the WDAT application portfolio.
Drive a high standard of quality and consistency across our monitoring environment.
Responsible for creation through delivery of projects on the Monitoring Roadmap.
Maintain and improve the standards within the Monitoring group and to evangelize good practice.
Collaborates with key stakeholders to assess near- and long-term monitoring capacity needs.
Creates and maintains documentation, best practices, and reporting as it relates to Monitoring configuration, mapping, processes, governance, and service records.
Drive and manage the delivery of the monitoring systems, standards and training.
Conducts research on Monitoring products, services, and standards to remain abreast of developments in the various Monitoring industry.
Interacts and assists in negotiations with vendors, outsourcers, and contractors to secure Monitoring products and services are implemented according to business requirements and enterprise standards.
Monitors Systems performance and troubleshoots problem areas as needed.
Plan, engineer and implement robust and cost-effective monitoring environments, exploiting emerging technologies to provide compelling solutions.
Responsible for generating, assessing and improving alerting with a focus on proactive alerts and self-healing.
Responsible for generating, assessing and improving a unified alert dashboard.
Responsible for working with appropriate teams on creating and maintaining other dashboards.
Conducts research on event and visualization products, services, and standards to remain abreast of developments in the industry.
Participate in process development with partners and Service Providers.
Basic Qualifications :
Splunk Certified (Admin or Power User)
Hands on experience with AppDynamics, Sitescope, NewRelic or Datadog
Experience in designing and implementing large, global, multi-tool, multi-platform enterprise monitoring systems.
Expertise with application, remote, logging monitoring solutions
Experience with cloud and complex/multi-Datacenter environments
Expert in the collection of various performance metrics and producing reports. Expertise in utilizing scripting languages (Ruby, Python, Perl, etc..) to analyze data and validate problem statements
Expertise monitoring UNIX and Linux environments.
Expertise in Regular Expression some networking
Demonstrated strong problem solving and analytical skills
Excellent written, verbal and interpersonal communication skills
Proven team player with the ability to guide and influence cross-functional teams Proven ability to mentor Jr. Engineers
Preferred Qualifications :
Splunk Certified (Architect)
Understanding of Machine Learning or other advanced data processing
Hands on experience to monitoring database technologies like Oracle and MSSQL Good knowledge and understanding of SOX or an alternative Security Compliance including data privacy practices and laws.
Experience monitoring Windows environments.
Preferred Education :
- BS Computer Science, Computer Engineering, Information Technology or applicable experience
Job ID: 553441BR
Job Posting Company: Disney Parks & Resorts