Application Production Support Engineer in Los Angeles, California | DiversityInc Careers
 
This job has expired and you can't apply for it anymore. Start a new search.

Application Production Support Engineer

DIRECTV is looking for a Site Reliability Engineer to join our Operations team. The Video Operations group is responsible for supporting on-air systems that power the DIRECTV platform, including streaming on mobile devices, VOD and satellite TV; all systems that are fast, fault-tolerant and scalable.  The Operations team is responsible for resiliency, swift response, performance and security of DIRECTV’s production infrastructure.
  • We provide support to the Software Engineering teams and drive best practices for DIRECTV/AT&T’s products nationwide.
  • We partner with the development teams to optimize and operationalize their applications correctly
  • We ensure systems are properly monitored, deployed and supported to provide the ultimate experience for our customers
  • We realize that failure is inevitable, so we embrace it and plan for fast recovery.
As a Site Reliability Engineer, you’re curious, with deep technical knowledge. You’re a problem solver and an engineer who uses ingenuity to solve hard problems. You foster a culture of inquisitiveness, collaboration and learning and are able to empathize with others. Your adaptation and evolution are guided by your experiences. You don’t need to be the smartest person in the room, because you know that every interaction is a learning opportunity.We’re looking for an influential decision-maker who’s ready to take on a high level of ownership and responsibility; a forecaster and problem solver for all of Operations.Responsibilities
  • Define and verify standards for configuration, monitoring, reliability and performance
  • Serve as subject matter expert for multiple proprietary and open source technologies
  • Select and develop automation tools and scripts to improve the availability, manageability, scalability and operability of services
  • Provide expert perspective regarding the capabilities and limits of the multi-datacenter production infrastructure in software architecture designs
  • Solve performance and stability issues and prevent their recurrence

Requirements

  • Advanced knowledge of Unix/Linux systems.
  • Ability to write code.  (Java, C++, Python, etc.)
  • Interest or experience in cloud technologies.
  • Skilled in use of automation for job efficiency.
  • A knack for troubleshooting tough problems. Your high level of ownership and curiosity empower this skill.
  • Ability to learn rapidly and communicate value of new technologies to technical and non-technical audiences
  • Meticulous and careful. You identify and consider all risks, and balance those with performing the task efficiently.
  • Thrive in a highly collaborative environment including strong communication skills.