Working at Thinking MachinesThinking Machines is a technology consultancy building AI & data platforms to solve high impact problems for our client. Our vision is for Southeast Asia to become a global hub for data science. To do that, we create data cultures, one organization at a time.
We're a company made up of intellectually curious, civic-minded, forever-learning individuals. We believe that great data science products are built with care for people, and that the best way to drive inclusive innovation is to start with a diverse team.
Our field of work is incredibly dynamic, so we want to work with people who are committed to growing with us. We want to hire people who can demonstrate an ability to learn, then provide them with personalized coaching, growth opportunities, and a great working environment to get them to world-class.
Role DescriptionAs a DevOps Engineer, you will play a critical role in ensuring the robustness, scalability, and security of our enterprise clients' infrastructure. You will work under the guidance of the DevOps Lead and collaborate closely with our clients' IT and technical teams to implement and maintain infrastructure that supports massive amounts of critical data with virtually zero downtime.
You will be responsible for designing, implementing, and managing CI/CD pipelines, content delivery systems for web applications, and ensuring high availability and disaster recovery plans are in place. You will also handle the scaling of web applications, virtual networking, data pipelines, database management, and comprehensive logging and monitoring systems.
Key Responsibilities Infrastructure EngineeringDesign, deploy, and maintain cloud infrastructure in Azure, AWS, and GCP platforms.Navigate and manage integrations with existing client infrastructure in cloud or on-prem.Implement and manage Infrastructure as Code pipelines to automate deployments.Ensure infrastructure is scalable, secure, and highly available.Oversee virtual networking and data pipelines at scale.Manage database systems, ensuring data integrity, availability, and performance.Site Reliability Engineering (SRE)Develop and implement strategies for maintaining system reliability and uptime.Automate manual processes to enhance efficiency and reliability.Implement monitoring and logging solutions to proactively identify and resolve issues.Conduct performance tuning, load testing, and capacity planning.Collaborate with development teams to ensure that applications are designed for reliability and scalability.DevOps EngineeringDevelop and manage deployment procedures to ensure smooth releases.Implement disaster recovery and business continuity plans.Conduct load testing of applications to ensure they can handle expected traffic.Collaboration & CommunicationWork closely with the DevOps Lead and other team members to develop internal systems, best practices specialized for TM's core offerings.Communicate technical solutions effectively to both technical and non-technical stakeholders.Foster a culture of continuous improvement and innovation within the team.
RequirementsProven experience as a DevOps Engineer or similar role in a cloud environment (Azure, AWS).
Expertise in CI/CD, content delivery for web apps, high availability, disaster recovery, and scaling web applications.
Strong knowledge of virtual networking, data pipelines at scale, database management at scale, and logging & monitoring.
Experience with site reliability engineering (SRE) practices.
Strong external communication skills to effectively communicate technical solutions to clients.
Quick learning agility and a strong sense of initiative.
Relevant certifications (e.g., Azure Solutions Architect, AWS Certified Solutions Architect) are a plus.
Benefits and PerksWe offer the following compensation and benefits:
Competitive salary — the compensation amount is positively correlated with the difficulty of the job, relevant experience, fit, and skill factors.
Hybrid Set-Up — Hybrid-remote means employees are required to come in an average of two days a week for client engagements and internal in-person days intended for collaboration, socials, and strategic planning.
Individual professional development budget — an annual budget for conferences, training courses, books, and software is available to sharpen your skills and build new ones to help you grow in your role.
Full health benefits — generous health insurance package upon hiring, with options to include dependents.
Apprenticeship and yearly performance reviews with the leadership team to discuss career and personal goals, job progress and any questions and concerns.
#J-18808-Ljbffr