About the Opportunity
Our client, a global fintech innovator, is seeking a Lead DevOps Engineer to guide architecture, operations, and team leadership in their New York City engineering hub. This role combines hands-on DevOps and SRE engineering, AWS cloud infrastructure ownership, operational strategy, and direct management of a growing team. The Lead DevOps Engineer will drive secure, cost-efficient infrastructure operations and modern DevOps practices, and be a key voice across platform and product engineering initiatives.
The annual base salary range is $180,000 to $200,000. Actual compensation offered to the successful candidate may vary from posted hiring range based upon geographic location, work experience, education, and/or skill level, among other things. Details about eligibility for bonus compensation (if applicable) will be finalized at the time of offer.
Job Responsibilities
- Lead, mentor, and develop DevOps engineers while fostering a culture of reliability, security, and continuous improvement
- Architect and operate AWS infrastructure using Terraform, CloudFormation, or CDK; ensure scalability, resilience, and cost efficiency
- Design/manage VPCs, subnets, routing, hybrid connectivity, load balancers, API gateways, and DNS
- Build/operate CI/CD pipelines (GitHub, GitHub Actions) enabling automated deployments across teams
- Deploy and manage Kubernetes/EKS clusters and Docker-based environments
- Implement cloud and network security controls: IAM, encryption, secrets, compliance guardrails
- Drive cloud monitoring, metrics, dashboards, and alerting strategy (Datadog) for distributed systems
- Guide incident response, root cause analysis, production troubleshooting, and long-term reliability improvements
- Build automation and developer tooling to enable rapid, reliable deployments
- Manage AWS cost monitoring, resource optimization, cloud architecture efficiency, and forecasting
- Create and maintain technical documentation and operational runbooks
- Oversee DR planning, backup/recovery, and cross-team exercises for operational readiness
- Promote AI-assisted development, SRE/DevOps best practices, and influence cloud engineering decisions at scale
Job Requirements
- Senior-level experience in DevOps/SRE with team leadership/mentoring responsibility
- In-depth knowledge of AWS core services, secure infrastructure, and production environment design
- Strong background in networking, TCP/IP, routing, DNS, VPNs, and firewalls
- Hands-on with cloud security: IAM, secrets, encryption, vulnerability management, and incident response
- Production experience with EKS/Kubernetes and Docker lifecycle management
- Advanced with Datadog for logging, dashboards, alerting, and distributed metrics
- Strong experience with GitHub and automated CI/CD
- Scripting or programming skills in Python, Go, or Bash
- Infrastructure as code (Terraform strongly preferred)
- Familiarity with AI-assisted development and automation workflows
- Ability to lead complex cloud and infrastructure initiatives and shape team standards
Preferred Skills
- Experience with service mesh (Istio, Linkerd), zero-trust networking, or chaos engineering
- SRE background with SLOs, error budgets, or capacity modeling
- Exposure to distributed systems or high-throughput data/event platforms
- AWS professional-level or security certifications
- Experience driving adoption of AI-enhanced engineering tools




