עדיין מחפשים עבודה במנועי חיפוש? הגיע הזמן להשתדרג!
במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.
מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.
As a Site Reliability Engineer at DeviantArt you will be responsible for ensuring the robustness, scalability, and security of the platform infrastructure that supports over 1.5 billion monthly page views. This involves balancing daily operations of troubleshooting, server maintenance, and small tasks, alongside architecting, developing, and completing larger infrastructure projects often in conjunction with other teams and stakeholders.
Infrastructure Scalability and High Load Management:
Maintain and architect a scalable, highly available infrastructure on AWS through load balancing and auto-scaling, capable of handling over 1.5 billion page views monthly with optimal performance
Ensure high availability of site and critical infrastructure, addressing downtime and degradation issues quickly to restore critical systems and services
Maintain a developer environment in parity with production systems, to ensure changes can be appropriately tested before release
Develop and maintain CI/CD pipelines using Terraform and Kubernetes, enhancing deployment strategies for high efficiency and zero downtime
Utilize configuration management tools to automate and streamline infrastructure provisioning and management, including writing tests and documentation
Database Performance and Scalability:
Optimize, maintain, and scale sharded MySQL databases to ensure fast, efficient, and reliable data access and storage amidst increasing data ingest
Troubleshoot slow queries and bottlenecks on MySQL servers to quickly mitigate production issues
Security and DDOS Mitigation:
Develop and enforce stringent security protocols to protect infrastructure from threats, with a particular focus on DDOS attack mitigation
Upgrade AWS components, servers, containers, and packages regularly to proactively and retroactively address any security issues
Continuously monitor, analyze, and improve internal security measures to ensure the highest level of protection against evolving threats
Cost Optimization and Resource Management:
Monitor and optimize cloud resource utilization to ensure cost-effective operation and efficient use of computing, aiming for low infrastructure costs without compromising performance
Qualifications:
Minimum of 4 years of experience working with systems at scale in either a Dev Ops, Platform Engineer, or Site Reliability Engineer role.
Excellent analytical skills with the ability to troubleshoot complex problems, analyze system bottlenecks, and implement effective solutions, from frontend site through to backend systems sometimes during production degradation or outage.
Exceptional command line linux skills.
In-depth knowledge of AWS services, infrastructure as code using Terraform, and container orchestration with Kubernetes.
Proficiency in Python and Bash for scripting and automation.
Experience with sharded MySQL databases.
Proven track record in security compliance standards on large-scale web infrastructures and in-depth knowledge of DDOS mitigation strategies.
A proactive mindset in identifying potential issues and taking pre-emptive actions to prevent downtime or performance degradation.
Excellent communication skills, open minded, and capable of effectively collaborating with cross-functional teams and articulating technical concepts to non-technical stakeholders.
Bonus points: if you have experience container building with Docker
Availability for on-call (including weekends) is a must.
About the team
DeviantArt. Founded in 2000 and a part of Wix since 2017, DeviantArt is the largest online social network for artists and art enthusiasts. For emerging and established artists, DeviantArt is the foremost platform to exhibit, promote, and share works with an enthusiastic, art-centric community. We have over 86 million registered users worldwide, and our users - lovingly referred to as “deviants” - upload tens of thousands of original pieces of art every day, from painting and sculpture to digital art, pixel art, films, and anime.
About Wix:
Wix makes it possible for anyone to succeed online.
Since 2006, we've grown to 5,000 employees in 17 countries, launched over 40 products, and serve over 230 million users and their visitors worldwide.
At Wix, we push you to innovate, evolve in non-traditional ways, and grow outside your comfort zone. We operate in small teams that work closely together to create incredible things.
We're proud to be an equal opportunity employer. Wix was built around the idea that everyone has the right to be successful, online. This same vision defines us as an employer: creating a work environment where everyone is welcome, and anyone has the right to succeed.
במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.
מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.
משרות נוספות מומלצות עבורך
-
STAGE CESURE S3NS - Automatisation & Outillage Customer Journey (H/F)
-
אשדוד
Thales
-
-
Ingénieur SRE
-
אשדוד
Thales
-
-
SRE Data (H/F)
-
אשדוד
Thales
-
-
[S3NS] Site Reliability Engineering - NetDevOps (H/F)
-
אשדוד
Thales
-
-
Site Reliability Engineer (SRE)
-
תל אביב - יפו
Wiz
-
-
Senior Site Reliability Engineer
-
תל אביב - יפו
Viz.ai
-