Senior Software Engineer- Reliability, Global E-Commerce at TikTok
San Jose, CA 95101
About the Job
DescriptionTikTok is the leading destination for short-form mobile video
At TikTok, our mission is to inspire creativity and bring joy
TikTok's global headquarters are in Los Angeles and Singapore, and its offices include New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo
Why Join UsCreation is the core of TikTok's purpose
Our platform is built to help imaginations thrive
This is doubly true of the teams that make TikTok possible
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team
Status quo? Never
Courage? Always
At TikTok, we create together and grow together
That's how we drive impact - for ourselves, our company, and the communities we serve
Join us.Global e-commerce is a content e-commerce business with international short video product as the carrier
It is committed to becoming the first choice for users to discover and purchase good products with affordable prices
Global e-commerce business team hopes to provide users with more tailored and efficient consumption experience, enabling merchants to receive reliable platform services in different scenarios such as live e-commerce, short video content e-commerce, so as to make more affordable and high-quality products sell easily and a better life within reach.Responsibilities:1
Be part of global SRE oncall rotation and be responsible for Tier-1 online incident response and devops support2
Be responsible for service levels of mission critical, revenue-generating E-commerce platform as well as all supporting infrastructure and services
This role will focus on service reliability, highly-scalable design, and release management in a cloud-native environment3
Define service level indicators and data-driven objectives, and develop devops / SRE standards, processes and methodologies, to uphold and improve uptime, latency, and system health of a core global e-commerce production platform4
Collaborate cross-team with engineering and product to ensure that key stability and maintainability requirements, such as capacity planning and launch reviews, are performed to enable transparent service delivery to customers5
Design strategies for risk detection and mitigation, disaster recovery & simulation, release management, cost optimisation, engineering quality etc6
Automation geared towards infrastructure-as-code, scalability and service resiliency7
Implement best practices around incident management, post-mortems while being part of on-call rotationsQualificationsMinimum Qualifications:1
Bachelor's or higher degree in Computer Science, similar technical field of study, or equivalent practical experience2
5 years experience developing, provisioning or maintaining production-grade large scaled distributed systems3
High level of proficiency in Linux OS internals, networking, microservices, databases, caches etc in cloud-native environments4
Demonstrable familiarity with programming or scripting languages (Go/Python/Bash/C++ etc)5
Demonstrable experience in the development and implementation of devops and SRE methodologiesPreferred Qualifications:1
Experience in designing, analyzing, and troubleshooting large-scale distributed systems2
Systematic problem-solving approach, coupled with effective communication skills and a sense of driveTikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives
Our platform connects people from across the globe and so does our workplace
At TikTok, our mission is to inspire creativity and bring joy
To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach
We are passionate about this and hope you are too.TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws
If you need assistance or a reasonable accommodation, please reach out to us at
At TikTok, our mission is to inspire creativity and bring joy
TikTok's global headquarters are in Los Angeles and Singapore, and its offices include New York, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo
Why Join UsCreation is the core of TikTok's purpose
Our platform is built to help imaginations thrive
This is doubly true of the teams that make TikTok possible
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team
Status quo? Never
Courage? Always
At TikTok, we create together and grow together
That's how we drive impact - for ourselves, our company, and the communities we serve
Join us.Global e-commerce is a content e-commerce business with international short video product as the carrier
It is committed to becoming the first choice for users to discover and purchase good products with affordable prices
Global e-commerce business team hopes to provide users with more tailored and efficient consumption experience, enabling merchants to receive reliable platform services in different scenarios such as live e-commerce, short video content e-commerce, so as to make more affordable and high-quality products sell easily and a better life within reach.Responsibilities:1
Be part of global SRE oncall rotation and be responsible for Tier-1 online incident response and devops support2
Be responsible for service levels of mission critical, revenue-generating E-commerce platform as well as all supporting infrastructure and services
This role will focus on service reliability, highly-scalable design, and release management in a cloud-native environment3
Define service level indicators and data-driven objectives, and develop devops / SRE standards, processes and methodologies, to uphold and improve uptime, latency, and system health of a core global e-commerce production platform4
Collaborate cross-team with engineering and product to ensure that key stability and maintainability requirements, such as capacity planning and launch reviews, are performed to enable transparent service delivery to customers5
Design strategies for risk detection and mitigation, disaster recovery & simulation, release management, cost optimisation, engineering quality etc6
Automation geared towards infrastructure-as-code, scalability and service resiliency7
Implement best practices around incident management, post-mortems while being part of on-call rotationsQualificationsMinimum Qualifications:1
Bachelor's or higher degree in Computer Science, similar technical field of study, or equivalent practical experience2
5 years experience developing, provisioning or maintaining production-grade large scaled distributed systems3
High level of proficiency in Linux OS internals, networking, microservices, databases, caches etc in cloud-native environments4
Demonstrable familiarity with programming or scripting languages (Go/Python/Bash/C++ etc)5
Demonstrable experience in the development and implementation of devops and SRE methodologiesPreferred Qualifications:1
Experience in designing, analyzing, and troubleshooting large-scale distributed systems2
Systematic problem-solving approach, coupled with effective communication skills and a sense of driveTikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives
Our platform connects people from across the globe and so does our workplace
At TikTok, our mission is to inspire creativity and bring joy
To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach
We are passionate about this and hope you are too.TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws
If you need assistance or a reasonable accommodation, please reach out to us at
https://shorturl.at/cdpT2RegularExperienced