Who We Are:
Inspired by a shared love of fashion, sneakers and the surrounding culture, END. was founded in Newcastle upon Tyne back in 2005. Our first UK store brought together a curated mix of celebrated designers and emerging brands, many of which were previously hard to find outside of London. From day one, we've nurtured a community of like-minded individuals united by a love for this ever-evolving culture.
Today, END. serves over 2 million customers worldwide through a seamless blend of online and physical retail. Our industry-leading stores in Newcastle, Glasgow, Manchester, London, and Milan reflect our commitment to innovation, influence, and inspiration. We offer a carefully curated selection of menswear, womenswear, sneakers, homeware, and lifestyle products to a global audience.
At the heart of END. is our customer and it's our people and culture that make the difference. With over 600 employees across our HQ, offices, and retail locations, our customer-first mindset continues to guide everything we do.
The Role:
Own the reliability, security, performance, and day-to-day operational excellence of our Shopify Plus platform and its critical integration ecosystem (Shopify Patchworks D365). You'll lead observability, incident response, stability and problem management, and the operational governance that keeps trading safe and predictable. The role also takes BAU ownership of vendor-delivered CI/CD pipelines—maintaining release controls, access and configuration, and continuous improvement—while operating effectively within Shopify's SaaS constraints (Shopify-managedhosting/CDN/WAF, with clear escalation paths where required).
Here's a breakdown of what you'll be doing:
Service ownership \& SLOs: define ownership boundaries, SLIs/SLOs (availability, latency, data freshness), alert thresholds, and regular reporting for platform and integration services.
Incident management, stability \& problem management: lead triage, stakeholder comms, mitigation, RCA and preventative actions; maintain a stability backlog and drive repeat-incident themes to closure to improve MTTR and reduce recurrence.
Security \& compliance controls (operational): secrets management, least privilege, auditability, dependency hygiene, and secure configuration for apps/services; coordinate security reviews and remediation plans.
Peak readiness \& resilience: trading-event readiness plans (monitoring uplift, incident playbooks, change risk controls/freeze coordination, validation checklists) and business continuity/degraded-mode procedures for integration failures.
Who we're looking for:
and operational discipline. * Experience maintaining CI/CD pipelines delivered by others (e.g., GitHub Actions/Azure DevOps):
permissions, secrets, environment configs, release controls/quality gates, and troubleshooting. * Strong observability and reliability engineering experience (New Relic/Sentry/Datadog/etc.): defining
SLIs/SLOs, improving alert quality, incident analysis, RCA/problem management, and measurable
MTTR/incident reduction. * Strong integration operations experience across iPaaS/ERP flows: queues/backpressure, rate-limiting,
idempotency, retries/DLQs, replay/backfill, reconciliation, and data integrity monitoring. * Practical understanding of operating within Shopify Plus SaaS constraints (limited edge/WAF control) and
implementing controls in the layers you do own (apps, services, integrations, tooling, processes, vendor
escalation). * Experience operating/supporting D365 environments (Dev/Sandbox, UAT, Prod): environment management,
release coordination, upgrade rehearsals/testing in lower environments, and go/no-go support. * Strong governance capability: access reviews (joiner/mover/leaver), least privilege, audit evidence, and
secure operational practices across Shopify/Patchworks/D365 and monitoring tools. * Experience with D365 licensing and capacity/storage management, plus cost observability and optimisation. * Strong stakeholder management under pressure: clear comms, structured updates, escalation with evidence
packs, and calm coordination during peak/trading events. * Strong Shopify Plus operational knowledge: Admin API/GraphQL + webhooks, rate-limit strategies, bulk
operations, and navigating Shopify Plus support/escalation processes. * Patchworks (or similar iPaaS) experience: mapping/versioning, error handling patterns, replay/reprocessing,
and operational reporting. * Hands-on D365 platform administration: environment refreshes, release validation, upgrade rehearsals,
role/security model familiarity, and capacity/storage optimisation. * Infrastructure-as-Code familiarity (Terraform or equivalent) for the integration/runtime layer (cloud resources,
queues, secrets, monitoring as code). * Experience with SLO tooling and incident/problem management practices (post-incident reviews, stability
backlogs, automation/runbook maturity). * Experience implementing synthetic monitoring and RUM for ecommerce journeys, including peak-readiness
monitoring uplift.
Besides a competitive salary and an engaging and inclusive work place we can offer you:
We know that great talent comes in many forms. So even if you don't meet every single criteria, we would still love to hear from you. If you're passionate, driven, and believe you can contribute to our future success, we encourage you to apply.
Please note: Employment is conditional upon having the legal right to work in the UK for the role offered.