The main focus of this group of techs is managing deployments consisting of 100s of linux servers and troubleshooting outages, conduct RCAs, coming up with permanent resolutions & working with System Architects for deployment automation and product development.
Operations ownership of projects using Linux and Linux application stacks (LAMP, Ruby, Postgres, Java, Python etc)
Responsible for setup and uptime of large Linux deployments (100s of servers)
Configure and optimize mail servers, web servers, cache servers, db servers, vps and cloud servers etc
Incident analysis/RCAs/troubleshooting and identification of permanent resolutions
Liaison with customer support teams and resolve customer issues.
Recommend/implement automated processes for scaling operations
Puppet configuration management
Plan & automate migrations.
Plan and execute service/site maintenance schedules.