The Walt Disney Company Lead Platform Engineer - Enterprise Monitoring in Seattle, Washington

The Lead Platform Engineer (Enterprise Monitoring) works as part of a team responsible for end-to-end monitoring of complex web based applications. The ideal candidate would be a performance monitoring expert, supporting operating systems, applications, and middleware monitoring. The role will also interface with Enterprise Engineering, Software Delivery and Testing teams in a Dev-Ops environment. Work closely with other Application Operations Engineering resources and teams to provide monitoring expertise in support of the WDAT application portfolio.

Responsibilities :

  • Drive a high standard of quality and consistency across our monitoring environment.

  • Responsible for creation through delivery of projects on the Monitoring Roadmap.

  • Maintain and improve the standards within the Monitoring group and to evangelize good practice.

  • Collaborates with key stakeholders to assess near- and long-term monitoring capacity needs.

  • Creates and maintains documentation, best practices, and reporting as it relates to Monitoring configuration, mapping, processes, governance, and service records.

  • Drive and manage the delivery of the monitoring systems, standards and training.

  • Conducts research on Monitoring products, services, and standards to remain abreast of developments in the various Monitoring industry.

  • Interacts and assists in negotiations with vendors, outsourcers, and contractors to secure Monitoring products and services are implemented according to business requirements and enterprise standards.

  • Monitors Systems performance and troubleshoots problem areas as needed.

  • Plan, engineer and implement robust and cost-effective monitoring environments, exploiting emerging technologies to provide compelling solutions.

  • Responsible for generating, assessing and improving alerting with a focus on proactive alerts and self-healing.

  • Responsible for generating, assessing and improving a unified alert dashboard.

  • Responsible for working with appropriate teams on creating and maintaining other dashboards.

  • Conducts research on event and visualization products, services, and standards to remain abreast of developments in the industry.

  • Participate in process development with partners and Service Providers.

Basic Qualifications :

  • Splunk Certified (Admin or Power User)

  • Hands on experience with AppDynamics, Sitescope, NewRelic or Datadog

  • Experience in designing and implementing large, global, multi-tool, multi-platform enterprise monitoring systems.

  • Expertise with application, remote, logging monitoring solutions

  • Experience with cloud and complex/multi-Datacenter environments

  • Expert in the collection of various performance metrics and producing reports. Expertise in utilizing scripting languages (Ruby, Python, Perl, etc..) to analyze data and validate problem statements

  • Expertise monitoring UNIX and Linux environments.

  • Expertise in Regular Expression some networking

  • Demonstrated strong problem solving and analytical skills

  • Excellent written, verbal and interpersonal communication skills

  • Proven team player with the ability to guide and influence cross-functional teams Proven ability to mentor Jr. Engineers

Preferred Qualifications :

  • Splunk Certified (Architect)

  • Understanding of Machine Learning or other advanced data processing

  • Hands on experience to monitoring database technologies like Oracle and MSSQL Good knowledge and understanding of SOX or an alternative Security Compliance including data privacy practices and laws.

  • Experience monitoring Windows environments.

Preferred Education :

  • BS Computer Science, Computer Engineering, Information Technology or applicable experience

Job ID: 553441BR

Location: Seattle,Washington

Job Posting Company: Disney Parks & Resorts