עדיין מחפשים עבודה במנועי חיפוש? הגיע הזמן להשתדרג!
במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.
מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.
Job Description
As a Site Reliability Engineer at DeviantArt you will be responsible for ensuring the robustness, scalability, and security of the platform infrastructure that supports over 1.5 billion monthly page views. This involves balancing daily operations of troubleshooting, server maintenance, and small tasks, alongside architecting, developing, and completing larger infrastructure projects often in conjunction with other teams and stakeholders.
Infrastructure Scalability and High Load Management:
- Maintain and architect a scalable, highly available infrastructure on AWS through load balancing and auto-scaling, capable of handling over 1.5 billion page views monthly with optimal performance
- Ensure high availability of site and critical infrastructure, addressing downtime and degradation issues quickly to restore critical systems and services
- Maintain a developer environment in parity with production systems, to ensure changes can be appropriately tested before release
- Develop and maintain CI/CD pipelines using Terraform and Kubernetes, enhancing deployment strategies for high efficiency and zero downtime
- Utilize configuration management tools to automate and streamline infrastructure provisioning and management, including writing tests and documentation
- Optimize, maintain, and scale sharded MySQL databases to ensure fast, efficient, and reliable data access and storage amidst increasing data ingest
- Troubleshoot slow queries and bottlenecks on MySQL servers to quickly mitigate production issues
- Develop and enforce stringent security protocols to protect infrastructure from threats, with a particular focus on DDOS attack mitigation
- Upgrade AWS components, servers, containers, and packages regularly to proactively and retroactively address any security issues
- Continuously monitor, analyze, and improve internal security measures to ensure the highest level of protection against evolving threats
- Monitor and optimize cloud resource utilization to ensure cost-effective operation and efficient use of computing, aiming for low infrastructure costs without compromising performance
- Minimum of 4 years of experience working with systems at scale in either a Dev Ops, Platform Engineer, or Site Reliability Engineer role.
- Excellent analytical skills with the ability to troubleshoot complex problems, analyze system bottlenecks, and implement effective solutions, from frontend site through to backend systems sometimes during production degradation or outage.
- Exceptional command line linux skills.
- In-depth knowledge of AWS services, infrastructure as code using Terraform, and container orchestration with Kubernetes.
- Proficiency in Python and Bash for scripting and automation.
- Experience with sharded MySQL databases.
- Proven track record in security compliance standards on large-scale web infrastructures and in-depth knowledge of DDOS mitigation strategies.
- A proactive mindset in identifying potential issues and taking pre-emptive actions to prevent downtime or performance degradation.
- Excellent communication skills, open minded, and capable of effectively collaborating with cross-functional teams and articulating technical concepts to non-technical stakeholders.
- Bonus points: if you have experience container building with Docker
About The Team
DeviantArt. Founded in 2000 and a part of Wix since 2017, DeviantArt is the largest online social network for artists and art enthusiasts. For emerging and established artists, DeviantArt is the foremost platform to exhibit, promote, and share works with an enthusiastic, art-centric community. We have over 86 million registered users worldwide, and our users -- lovingly referred to as “deviants” -- upload tens of thousands of original pieces of art every day, from painting and sculpture to digital art, pixel art, films, and anime.
About Wix:
Wix makes it possible for anyone to succeed online.
Since 2006, we've grown to 5,000 employees in 17 countries, launched over 40 products, and serve over 230 million users and their visitors worldwide.
At Wix, we push you to innovate, evolve in non-traditional ways, and grow outside your comfort zone. We operate in small teams that work closely together to create incredible things.
We're proud to be an equal opportunity employer. Wix was built around the idea that everyone has the right to be successful, online. This same vision defines us as an employer: creating a work environment where everyone is welcome, and anyone has the right to succeed.
במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.
מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.
משרות נוספות מומלצות עבורך
-
Linux SRE Specialist
-
תל אביב - יפו
comblack
-
-
MATRIX - מהנדס/ת SRE
-
תל אביב - יפו
MATRIX
-
-
Site Reliability Engineering (SRE)
-
תל אביב - יפו
Riskified
-
-
Site Reliability Engineer
-
תל אביב - יפו
NetNut.io
-
-
Senior HPC Site Reliability Engineer
-
תל אביב - יפו
NVIDIA
-
-
Senior HPC Site Reliability Engineer
-
יקנעם עילית
NVIDIA
-