🛒 Shopify Store Audit

RL environment with real Shopify product data — 45 products, 184 discoverable issues, shaped rewards

Easy · Full hints

8 issues · 25 steps
Issues listed with suggested commands. Agent fills in params.

Medium · Descriptions only

12 issues · 35 steps
Issue descriptions shown. Agent picks commands & params.

Hard · Explore

20 issues · 50 steps
Only category counts. Agent must discover issues itself.

API Endpoints

GET/health — health check

GET/tasks — enumerate tasks & graders

POST/reset — reset (body: {})

POST/step — action (body: {"action":{"command":"...","params":{}}})

GET/state — current state

GET/schema — action/observation schemas

POST/grade — run grader on sample

WS/ws — persistent WebSocket session