Roles & Responsibilities
Job Description & Requirements
We are seeking an experienced DevOps Engineer to join our dynamic team in Singapore. The ideal candidate will be responsible for designing, implementing, and maintaining our cloud infrastructure, CI / CD pipelines, and monitoring systems to ensure high availability, scalability, and security of our services.
Required Technical Skills
Core Programming & Frameworks :
- Proficiency in Python with hands-on experience in FastAPI or Flask frameworks
- Strong understanding of RESTful API development and microservices architecture
Containerization & Orchestration :
Extensive experience with Docker and Docker ComposeContainer lifecycle management and optimizationGoogle Cloud Platform (GCP) Infrastructure :
Google Cloud Storage (GCS) automated backup solutionsGoogle Container Registry for private Docker repositoriesGoogle Artifact Registry for private PyPI package managementCloud IAM for access control and permission managementVPC Firewall Rules configuration and network securityCloud Load Balancing and Cloud CDN setupMySQL / PostgreSQL Database Administration :
Database performance monitoring and query optimizationDisk space management and storage optimizationAutomated backup strategies and data archival / purging policiesDatabase maintenance and cleanup proceduresLogging & Observability :
Implementation of centralized logging solutions (ELK Stack, Cloud Logging)Log aggregation, parsing, and visualization using tools like Grafana, Kibana, or Cloud MonitoringStructured logging best practicesMonitoring & Alerting :
Performance monitoring setup using Prometheus, Grafana, or Cloud MonitoringApplication Performance Monitoring (APM) tools integrationAlert configuration and incident response automationSLA / SLO monitoring and reportingMessage Queue Management :
Apache Kafka cluster setup, configuration, and maintenanceTopic management, partition optimization, and consumer group monitoringKafka Connect and Schema Registry managementCI / CD Pipeline :
Design and implementation of automated deployment pipelinesGitHub CI / CD integration and pipeline optimizationAutomated testing integration and deployment strategiesBlue-green and canary deployment patternsNetwork & Security :
Cloudflare Tunnel (cloudflared) configuration and managementSSH tunneling and secure remote access solutionsNginx web server configuration and optimizationsystemd service management and daemon configurationNetwork troubleshooting and performance optimizationPreferred Additional Skills
Container Orchestration :
Kubernetes (K8s) deployment and cluster management experienceHelm charts creation and managementPod autoscaling and resource optimizationTask Management & Automation :
Python Celery for distributed task queue managementCron job centralization and monitoringWorkflow orchestration tools experienceData Analysis & Documentation :
Jupyter Notebook and pandas for data analysis and reportingData pipeline automation and ETL processesKnowledge Management :
Experience creating and maintaining internal wikis or runbooksTechnical documentation and knowledge sharing platformsGitBook, Confluence, or similar documentation toolsAdditional Technical Areas :
Redis caching solutionsCost optimization and resource management in cloud environmentsQualifications
Bachelor's degree in Computer Science, Engineering, or related field3+ years of experience in DevOps, Site Reliability Engineering, or related rolesStrong problem-solving skills and ability to work in a fast-paced environmentExcellent communication skillsExperience working in Agile / Scrum development environmentsTell employers what skills you have
Confluence
Scalability
Kubernetes
Data Analysis
Pipelines
High Availability
Google Cloud Platform
Knowledge Management
Nginx
Logging
Python
Containerization
Google Cloud
Docker
Orchestration
API