You’ll be the technical custodian of KwikDone’s live platform: keeping it reliable, secure, and fast; owning releases; and leading the migration to a modern host (Render/Vercel/Railway/Fly, etc.). You’ll work closely with our product/dev partner, tighten our CI/CD, and put monitoring, alerting, and runbooks in place.
What You’ll Do
Ops & Reliability
Own uptime, health checks, error budgets, incident response, and post-mortems.
Instrument logging/metrics (e.g., Sentry, Logtail, Grafana/Prometheus, Datadog) and set actionable alerts.
Implement backups/restore drills for PostgreSQL (Supabase) and any object storage.
Deployment & Environments
Migrate our monolith from Replit to a modern PaaS (Render preferred; open to Vercel/Railway/Fly).
Build CI/CD pipelines (GitHub Actions preferred), blue-green/zero-downtime deploys, and env-specific configs.
Harden secrets management and environment variables.
Web App & APIs
Support a Node/TypeScript Express + Vite/React app with Supabase (Postgres) backend.
Maintain integrations: Stripe/Paystack (live keys, webhooks), Zoho SMTP, file uploads, and escrow flows.
Triage & fix production issues (CORS, 401s, asset serving, routing, JSON responses) with crisp root-cause notes.
Networking, DNS & Security
Configure GoDaddy DNS, custom domains, HTTPS/SSL, HSTS, security headers, and WAF/CDN where appropriate.
Basic security reviews: dependency updates, least-privilege on DB/service accounts, webhook signature checks.
QA & Support
Run smoke/E2E checks before/after deploys; maintain a release checklist & rollback plan.
Document runbooks, playbooks, and “how to” guides; train non-technical teammates on basic checks.
Must-Have Experience
Migrating Node/TypeScript web apps from Replit (or similar) to Render/Vercel/Railway/Fly with CI/CD.
PostgreSQL/Supabase administration (schemas, migrations, roles, backups).
Stripe and/or Paystack live integrations (keys, webhooks, test plans).
DNS & TLS (GoDaddy, A/CNAME, SSL certs, redirects) and production security basics.
Production debugging of Express/React/Vite apps (routing, asset serving, CORS, auth/session issues).
Excellent documentation habits and clear async communication.
Nice to Have
Docker & containerized deploys; Nginx reverse proxy basics.
Sentry/Datadog/Grafana experience; synthetic checks.
React Native/Expo familiarity (for later mobile builds).
NDPR/GDPR-aware data handling and audit logging.
Success in 30/60/90 Days
30 days: Monitoring & alerts live; deploy pipeline in place; secrets standardized; health checks & rollback doc.
60 days: Replit → chosen PaaS migration completed; stable custom domain + SSL; webhooks verified; backup/restore tested.
90 days: SLOs defined; incident runbooks adopted; performance budget & cost dashboard established.
Quick Screening Questions (please answer briefly)
Which PaaS would you pick for a Node/TS + Postgres monolith and why (Render vs Vercel vs Railway/Fly)?
How do you verify Stripe/Paystack webhooks end-to-end in production?
What’s your go-to checklist before flipping live DNS to a new host?
Outline a minimal backup & restore plan for Supabase Postgres.
Would you like WhatsApp and Email notifications on more Platform Reliability & Tech Support Engineer job offers?
Subscribe to our Premium Job Alert for customised job alerts, access to our Database Subscription WhatsApp Group and profile recommendation to hiring employers.
SUBSCRIBE TO PREMIUM ALERT