
Operations (GCP, Python, Bash) Engineer в Capgemini Engineering, віддалено
- Украина
- Постоянная работа
- Полная занятость
/ /2 вересня 2025Operations (GCP, Python, Bash) EngineerвіддаленоWe are seeking a skilled Operations Engineer to manage the day-to-day operations of cloud infrastructure and applications hosted on Google Cloud Platform (GCP). This role is responsible for system monitoring, incident response, performance optimization, and routine maintenance across production environments. The ideal candidate will have hands-on experience in cloud operations, system administration, and automation, with a strong focus on reliability, security, and operational excellence in 24/7 enterprise environments.Your Role
- Minimum 3+ years in cloud operations, system administration, or infrastructure management.
- Preferred 5+ years in enterprise cloud environments with 24/7 operational responsibilities.
- Operations & Administration
Cloud infrastructure monitoring and management.
Incident response and troubleshooting.
Backup and disaster recovery operations.
Performance tuning and resource optimization.
Security patching and vulnerability management.
Automation & Scripting * Scripting languages: Python, Bash, PowerShell.Configuration management tools (e.g., Ansible, Puppet, Chef).
Monitoring and alerting configuration (e.g., Prometheus, Cloud Monitoring).
Automated deployment and rollback procedures.
Log analysis and event correlation.
Basic Infrastructure as Code (IaC) concepts.
GCP Expertise (Required) * Compute: GCE instance management, auto-scaling, Cloud Functions.
- Storage: Cloud Storage, disk provisioning, backup operations.
- Networking: VPC configuration, firewall rules, load balancer setup.
- Monitoring: Cloud Monitoring, alerting, dashboard creation.
- Security: IAM management, access reviews, patching.
- Day-to-Day Operations
- Clear and professional communication during incidents and status updates.
- Strong time management and task prioritization skills.
- Ability to document operational procedures and share knowledge.
- Willingness to learn new technologies and adapt to evolving environments.
- Dependable and responsive for on-call and operational duties.
- Monitor and maintain GCP infrastructure and services to ensure high availability and performance.
- Respond to incidents, troubleshoot system issues, and perform root cause analysis.
- Manage backups, disaster recovery procedures, and patching schedules.
- Optimize system performance and resource utilization.
- Implement and maintain monitoring, alerting, and logging systems.
- Automate routine operational tasks using scripting and configuration tools.
- Maintain operational documentation, runbooks, and standard operating procedures.
- Collaborate with development and platform teams to support deployments and resolve issues.
- Participate in on-call rotations and ensure timely incident communication.
Dou