About the Opportunity
Our client is a well-known professional soccer league, and they are actively seeking a Site Reliability Engineer to join their team and support the TechOps Department in New York City. The SRE will optimize platform operations, promote automation, and refine workflows to minimize manual tasks, ensuring efficiency during both gameday and non-gameday activities while driving continuous innovation. This is a wonderful contract opportunity for anyone who is passionate about sports and eager to learn.
The base pay for this position is $38 to $50 per hour. Actual compensation offered to the successful candidate may vary from posted hiring range based upon geographic location, work experience, education, and/or skill level, among other things. Details about eligibility for bonus compensation (if applicable) will be finalized at the time of offer.
Job Responsibilities
- Develop and implement observability frameworks to monitor the health and performance of services, ensuring uptime and reliability
- Be the first line of defense in troubleshooting and resolving incidents without relying on runbooks, using strong problem-solving skills
- Perform thorough API testing for published content using tools like Postman and Cypress to ensure accuracy and performance
- Utilize Terraform for managing infrastructure, including ServiceNow integrations, and automate workflows
- Leverage Datadog or equivalent tools to set up monitoring, logging, and alerting systems
- Work closely with cross-functional teams to ensure seamless integration and deployment of services
- Manage and optimize AWS resources, including EKS and ECS, to ensure scalability and cost-efficiency
- Use GitLab pipelines for continuous integration and deployment, ensuring smooth and automated delivery of code changes
- Integrate tools like ServiceNow with Slack or Asana to streamline workflows and enhance team communication
Job Requirements
- A bachelor’s degree in a relevant field, such as Computer Science, Information Technology, or a related field.
- 3+ years of experience, with 2+ in Cloud Expertise and Technical Operations.
- Proven background in managing cloud solutions (AWS, Azure, Google Cloud).
- Hands-on experience in complex technology operations environments, including infrastructure, network, security, and incident management.
- Proficiency in implementing automation tools and a proven ability to drive automation excellence.
- Cypress or Postman (API Testing)
- Terraform (infrastructure management)
- ServiceNow (incident management)
- DataDog (Monitoring, logging & alerts)
Nice-To-Haves
- Advanced degrees or certifications (e.g., ITIL, AWS, Azure) are highly desirable
- Familiarity with GCP and Azure is a plus
- Experience with Go, React/React Native is a plus
- ETL experience between third parties is a bonus
- Advanced degrees or certifications (e.g., ITIL, AWS, Azure) are highly desirable
- Familiarity with GCP and Azure is a plus
- Experience with Go, React/React Native is a plus
- ETL experience between third parties is a bonus