JOB SUMMARY: We are looking for a DevOps Engineer to support our on-premise and hybrid deployments for AI-powered industrial applications. You will manage Kubernetes infrastructure, support system monitoring, and help build secure, scalable, and observable systems using tools like Fluentbit, Loki, and Grafana.
RESPONSIBILITIES:
- Deploy, configure, and maintain Kubernetes-based infrastructure for AI, data, and analytics systems.
- Set up and monitor logs, metrics, and alerts using Fluentbit, Loki, Grafana, and Elasticsearch.
- Automate system deployments and integrate CI/CD pipelines for smooth software delivery.
- Collaborate with AI and data teams to optimize compute environments for LLMs and ML pipelines.
- Implement RBAC, security protocols, and resource allocation policies.
- Troubleshoot infrastructure issues and perform system maintenance and performance tuning.