Company DescriptionTenderBoard empowers businesses by streamlining transactions between buyers and suppliers across the source-to-pay process. The platform offers real-time data, competitive analysis, and business opportunities while ensuring compliance with corporate governance standards. Trusted by prominent companies in the real estate, finance, hospitality, and technology sectors in Asia, TenderBoard is a leading solution for smarter procurement decisions.
Role DescriptionWe are seeking a talented DevOps Engineer to support our production and staging Kubernetesinfrastructure. You'll be responsible for maintaining high availability, optimizing performance, andensuring the security and reliability of our multi-application platform that serves enterprise procurementworkflows. This is an excellent opportunity to work with modern cloud-native technologies whilecollaborating with experienced development teams to deliver seamless deployments and robustinfrastructure.
Key ResponsibilitiesMaintain and optimize Kubernetes clusters for staging and production environmentsConfigure and maintain monitoring stack with custom alerts and dashboardsManage CI/CD pipelines for automated deployments across multiple repositoriesAdminister database clusters Implement and test backup, disaster recovery, and business continuity strategiesOptimize resource utilization and performance to reduce infrastructure costsHarden servers and applications against security threats following security benchmarksManage Cloud Infrastructure Configure server logging and centralized log management systems using Fluent BitScale applications dynamically based on traffic demands and usage patternsImplement and manage canary deployments and A/B testing infrastructureCollaborate with development teams to ensure smooth releases and deploymentsUpdate server software, dependencies, and security patches on regular schedulesDocument infrastructure configurations, procedures, and runbooks for knowledge sharingTroubleshoot and resolve production issues as part of on-call rotationDocument root cause analysis and solutions for all troubleshooting activitiesManage Helm charts for multi-service deployments Dockerize applications and create optimized container images for production deploymentsConfigure and maintain service mesh architecture for inter-service communication in the backendImplement and manage VPN solutionsDesign and implement zero trust network security principles for remote access
Required Skills & Experience3+ years of DevOps or Site Reliability Engineering experience2+ years of production experience with Kubernetes on AWS or similar managed platformsExperience with AWS services and Docker containerizationExperience with Helm chart management and monitoring tools Experience with CI/CD, database administration, message broker administrationExperience with infrastructure as code, Bash scripting for deployment and infrastructure automation, loggingExperience with VPC networking, security groups, load balancers, VPN and SSL/TLSGood to have: AWS Solutions Architect Associate or Certified Kubernetes Administrator (CKA)Good to have: Background in SaaS platforms or enterprise procurement systems
What We're Looking ForStrong troubleshooting and problem-solving abilities with systematic debugging approachExcellent communication skills for cross-functional collaboration with development teamsSelf-motivated with ability to work independently and manage multiple prioritiesDetail-oriented mindset with commitment to comprehensive documentation practices, includingpost-incident reportsProactive approach to identifying and resolving issues before they impact production