EngrewLabs is an AI-native technology company focused on building intelligent automation, scalable cloud infrastructure, and next-generation AI solutions. We help startups and enterprises deploy reliable, high-performance systems that power modern applications, machine learning platforms, and AI products.
Our team values ownership, automation, and operational excellence. We believe the best engineers build systems that are scalable, observable, and resilient while enabling developers to move fast.
What You’ll Do
- Design, implement, and maintain cloud infrastructure across AWS, GCP, or Azure.
- Build and improve CI/CD pipelines to support rapid and reliable deployments.
- Manage Kubernetes clusters and containerized applications in production environments.
- Develop Infrastructure as Code (IaC) solutions using tools such as Terraform.
- Monitor system performance, availability, and reliability across multiple services.
- Establish observability practices using logging, metrics, and distributed tracing.
- Respond to incidents, perform root cause analysis, and implement preventive measures.
- Improve system security, scalability, and disaster recovery capabilities.
- Collaborate closely with software engineers to optimize deployment workflows and platform reliability.
- Automate repetitive operational tasks and continuously improve engineering productivity.
Requirements
- Strong experience with Linux system administration.
- Experience managing cloud infrastructure on AWS, GCP, or Azure.
- Proficiency with Docker and Kubernetes.
- Experience building and maintaining CI/CD pipelines.
- Knowledge of Infrastructure as Code tools such as Terraform or Pulumi.
- Experience with monitoring and observability platforms.