About the Job As a Site Reliability Engineer at Aspire, you will play a crucial role in ensuring the reliability and scalability of our infrastructure. You will collaborate with development and operations teams to design and implement robust solutions, with a focus on automation to streamline deployment, monitoring, and incident response processes. Your responsibilities will include performing system capacity planning, troubleshooting infrastructure issues, and actively participating in an on-call rotation to address incidents promptly. What you’ll do
Collaborate with development and operations teams to design and implement highly reliable and scalable infrastructure.
Implement automation solutions to streamline deployment, monitoring, and incident response processes.
Perform system capacity planning and performance analysis to ensure optimal system performance and scalability.
Troubleshoot and resolve infrastructure issues, including performance bottlenecks, network problems, and configuration errors.
Participate in on-call rotation and respond to incidents in a timely manner, working towards proactive incident prevention.
Conduct post-incident reviews and contribute to the continuous improvement of system reliability.
What you’ll need
Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
Proven experience as a Site Reliability Engineer or a similar role.
Strong programming and scripting skills (e.g., Python, Go, Shell).
Deep understanding of Linux/Unix systems and experience with system administration tasks.
Experience with containerization technologies such as Docker and container orchestration tools like Kubernetes.
Proficiency in configuration management tools (e.g., Ansible, Puppet, Chef).
Knowledge of cloud platforms such as AWS.
Solid understanding of networking concepts and protocols.
Familiarity with monitoring and logging tools (e.g. Grafana).
Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment.
Excellent communication and teamwork skills.
Why AspireIn addition to a competitive long-term
total compensation with salary and performance-based bonus, we have a reward philosophy that expands beyond
this.
Be part of a (Remote is here-to stay) organization.
Work and learn from great minds.
Explore new opportunities to learn and grow everyday by attending technical and nontechnical training.
Get market exposure by working with international tech leaders.
Nursery reimbursement benefit.
Aspire Wellness Program.
Attend virtual and onsite international tech conference.