Remote opportunity! Candidates must live in the following states to be considered: PA, NJ, NH, DE, CO, SD, TX, WY, AK, FL.
The Senior DevOps Engineer is an expert in the full technology stack, having a deep understanding of infrastructure, networking, databases, applications, and security. They collaborate with the development and operations teams to configure servers and containers, configure and maintain Azure and AWS cloud environments, implement infrastructure as code (Terraform), automate build and release pipelines (Azure DevOps), and configure applications for each environment. They quickly master new technologies and are continually evolving their skillset to keep pace with industry standards.
Key Responsibilities:
Implement Infrastructure as Code (IaC) for new and existing services in Azure and AWS
Create and maintain build and release YAML pipelines in Azure DevOps
Excellent technical knowledge of IT Infrastructure, including network and application load balancers, switches, routers, and IP addressing
Administer, monitor, and deploy systems and services on Azure and AWS
Maintain live services through measuring and monitoring availability, latency, and overall system health to maintain minimal downtime
Collaborate with operations and development teams to inform design decisions that improve maintenance, automation, scaling, performance, cost, and availability
Collaborate with development teams to achieve continuous integration and continuous delivery
Implement monitoring and alerting in NewRelic and Site24x7 to proactively mitigate production outages and ensure adherence to SLA requirements
Collaborate with development and operations teams to create, maintain, troubleshoot, and optimize infrastructure and pipelines
Identify and drive opportunities to automate new and existing processes
Attend agile ceremonies to ensure the visions of the teams align with DevOps principles
Create and update documents and diagrams for technical designs, decisions, configurations, and standards
Design, deploy, and maintain scalable Selenium server clusters in Kubernetes to support automated testing, ensuring high availability and optimal performance
Reasonable accommodations may be made to enable individuals with disabilities to perform these essential functions.
Key Requirements:
Expert in creating and maintaining CI/CD pipelines
Expert in Windows/Linux administration tasks and performance tuning
Expert in Azure and AWS Cloud and/or other cloud providers
Expert in Docker/Kubernetes clusters or similar containerization platforms
Expert in load balancing and scaling applications
Excellent communication skills both written and verbal
Deep understanding of n-tier applications, microservices, and service-oriented architecture
Knowledge of relational database technologies - MS SQL, MySQL, and PostgreSQL
Knowledge of networking protocols, DNS, HTTP, load balancing, web servers
Knowledge of Azure DevOps pipelines
Knowledge of GitHub, Jenkins, Bamboo, TeamCity, GitLab, or Octopus administration is a plus
Proficiency in modern agile SDLC practices and principles, orchestration, and CI/CD tooling (e.g., Python, Java, PowerShell, Bash, Terraform, Ansible, CloudFormation, Puppet, Chef, or similar)
Monitoring expertise (any of New Relic, Site24x7, Datadog, Nagios, Prometheus, Zabbix, Honeycomb, or similar)
Performance tuning of applications and identifying possible bottlenecks/weaknesses in current designs
Proven experience demonstrating hands-on business impact in combining software engineering skills with systems engineering skills to solve complex automation and reliability challenges
Ability to quickly receive and process information and make appropriate risk-based decisions
Education and Experience:
Bachelor’s degree or relevant work experience
10+ years of Cloud and/or Systems Administration Experience
Enterprise experience implementing IaC
Enterprise experience creating and maintaining CI/CD pipelines