Platform Operations ensures the integrity, reliability, and controllability of the live trading environment. This function safeguards uptime and production stability across all user-facing and admin systems. It acts as the operational backbone between engineering, compliance, support, and product — ensuring that every change to the platform is controlled, reversible, and fully documented.
Duties / Responsibilities:
Monitor uptime, latency, and error rates across critical components (e.g. matching engine, swap aggregator, auth/login, wallet APIs)
Coordinate real-time incident response across engineering, support, and compliance teams
Own the platform change calendar, enforcing pre-release QA, rollback plans, and sign-off procedures
Manage all production config changes, including trading pairs, fee tiers, circuit breakers, leverage/margin toggles, etc.
Manage access control across wallet infrastructure, trading admin dashboards, back-office panels, and vendor tools
Maintain logs of all changes made to production environments, with audit trail visibility for Compliance and Risk
Track vendor uptime and coordinate resolution of service disruptions (e.g. third-party vendors such as Fireblocks, Sumsub and others)
Work with Support to identify and eliminate sources of recurring user issues or operational bugs
Define and maintain platform health KPIs: uptime %, MTTR, deployment success rate, config change errors