Senior Cloud Platform Site Reliability Engineer (Wallet Focus)

Moledao

$5-15K[Monthly]
Remote5-10 Yrs ExpBachelorFull-time
Share

Remote Details

Open CountryWorldwide

Language RequirementsEnglish | Chinese

Job Description

Show original text

We are hiring a Senior SRE Engineer (Wallet Operations) responsible for ensuring the stability, availability, and performance of core business infrastructure on AWS; managing global production environments; building scalable, highly available systems; advancing automation and observability platforms; and maintaining security and compliance standards.

Remote work, with optional bases in Singapore, Malaysia, or Abu Dhabi

Job Purpose

  • Responsible for deployment-related tasks
  • Ensure reliable and efficient system operation at scale
  • Develop tools to enhance availability, performance, and incident response capabilities

Responsibilities

  1. Ensure the stability, availability, and high performance of AWS global infrastructure, and own the production environment SLAs.
  2. Design, operate, and troubleshoot cloud-native components such as Kubernetes, Envoy, Service Mesh (Istio/Linkerd), and Ingress.
  3. Improve operational efficiency through automation and platform tools (IaC, CI/CD), building observability, self-healing, and rapid recovery capabilities.
  4. Implement and maintain operational security: access controls (AWS IAM/K8s RBAC), network security policies, vulnerability management, and incident response.
  5. Develop a global operations framework covering capacity planning, monitoring and alerting (Prometheus/ELK), CI/CD (GitLab/Jenkins), disaster recovery, and automated failover.
  6. Gain deep understanding of the business architecture, participate in designing and reviewing high availability/disaster recovery solutions, and continuously optimize costs.

Qualifications

  • Over 5 years of Linux operations/SRE/DevOps experience with large-scale distributed systems operations expertise
  • Proficient in core AWS services (EC2/S3/VPC/IAM/ELB/RDS, etc.), with experience in architecture, operations, and cost optimization
  • Deep understanding of Kubernetes, with experience in production operations, performance tuning, and troubleshooting of large-scale clusters
  • Familiar with Envoy, Istio/Linkerd, Nginx/Istio Ingress (L7 traffic management)
  • Strong security awareness, with knowledge of common system/network/application vulnerabilities and mitigation strategies
  • Proficient in at least one scripting/programming language (Go/Python/Shell) for automating and engineering complex operations tasks
  • Experience with observability platforms such as Prometheus and ELK, including capacity planning and performance testing

Preferred Qualifications

  • Experience managing or leading SRE/platform/tooling teams
  • Advanced hands-on experience with Prometheus, Grafana, and ELK
  • Certifications in AWS (SAA/SAP) or Kubernetes (CKA/CKS, etc.)
Preview

Dorothy Mole

HR OfficerMoledao

Reply 0 Times Today

Posted on 25 December 2025

Moledao

<50 Employees

DAOs

View jobs hiring

Report this job

Bossjob Safety Reminder

If the position requires you to work overseas, please be vigilant and beware of fraud.

If you encounter an employer who has the following actions during your job search, please report it immediately

  • withholds your ID,
  • requires you to provide a guarantee or collects property,
  • forces you to invest or raise funds,
  • collects illicit benefits,
  • or other illegal situations.
Tips
×

Some of our features may not work properly on your device.

If you are using a mobile device, please use a desktop browser to access our website.

Or use our app: Download App