The Platform team owns Recart's cloud infrastructure, deployment pipelines, and internal developer tooling. The layer that makes it possible for everyone else to ship fast and safely.
Recart runs on ~40 micro-services across AWS, Kubernetes, and MongoDB. The Platform team makes sure that infrastructure is reliable, secure, and as invisible as possible to the engineers who depend on it. When a CircleCI build fails, when a dead-letter queue needs to be redriven, when an engineer needs temporary production access to fix a critical bug, Platform handles it.
But they're not just a reactive ops team. They build internal tooling that changes how the whole engineering org works. One of their biggest initiatives: a Windmill-based internal developer platform with a Slack bot interface, so engineers can request and receive temporary AWS IAM permissions or MongoDB access without ever leaving Slack. Approved, provisioned, and auto-revoked, all automated. The result is that the entire engineering team moves faster, with less bottleneck and more audibility.
They're also deep in the AI tooling wave: setting up Claude Code across repositories, generating claude.md files for each service, and integrating MCP servers so engineers can interact with Shortcut and other tools directly from their coding environment.
On any given day, our engineers are fielding questions from CS, syncing with Product, and reviewing each other's scopes on Notion before a single line of code is written. We do standups, run design reviews, and write internal docs when something new gets built, so knowledge doesn't live only in Slack threads.
We move fast, but we're deliberate: new features go through feature flags, changes are scoped before they're picked up, and on-call rotations mean someone's always got the weekend covered.
"Every time an engineer needs temporary production access, it goes through a manual approval chain and takes half a day. Automate it."
"We need to dump the messagelogs MongoDB collection and restore it into SingleStore. Scale up the cluster, manage IOPS, execute the migration without downtime."
"Renovate stopped opening dependency PRs across multiple repos. Figure out why and fix it before the dependency graph drifts."
"Set up Claude Code for the entire engineering org. Generate claude.md files for 40+ services, wire in MCP integrations, make it work."
You'll do well here if you find satisfaction in building systems that make other engineers faster, and don't need credit for it. The Platform team's best work is invisible: nobody notices the deployment pipeline, the access management bot, or the cluster configuration until it breaks. That's the goal.
Deep AWS and Kubernetes knowledge matters, as does a bias toward automation over manual processes. The Windmill bot is a good example of the team's philosophy: if you're doing something manually more than twice, build the tool that does it for you.
Interest in AI developer tooling is increasingly relevant. The team is actively expanding how engineers interact with Claude Code and MCP tooling across the stack.
2 engineers, working closely with the CTO.