Skip to content

What is Shiplight AI?

Shiplight gives your coding agent eyes and hands in a real browser — your agent becomes the primary author and maintainer of your YAML E2E tests, while humans stay in control by reviewing readable tests and hand-tuning complex flows when needed. It works with Claude Code, Cursor, Codex, and any MCP-compatible agent.

Agent Workflow

Shiplight V2 is built around an agent-native testing workflow:

  • Use the browser MCP server and Shiplight Skills with your coding agent
  • Let the agent inspect your app in a real browser and verify UI behavior
  • Let the agent author and maintain YAML E2E tests — end-to-end tests written in YAML format with natural language steps
  • Review tests as readable specs, then hand-tune complex UI flows in the visual debugger
  • Keep tests in your repo alongside application code
  • Debug test failures locally with the Shiplight CLI and visual debugger
  • Run the same YAML E2E tests in GitHub Actions or on Shiplight-hosted CI runners
  • Manage CI test results in Shiplight for enterprise teams

How It Works

  1. Install MCP and Skills so your coding agent can launch browsers, inspect pages, and use Shiplight workflows.
  2. Verify UI changes by asking the agent to exercise the feature in a real browser.
  3. Author and maintain YAML E2E tests from the same browser session, using natural language steps that are easy to review.
  4. Review and tune locally when tests need human control, with screenshots, traces, and a visual debugger for complex UI flows.
  5. Run in CI through GitHub Actions or Shiplight-hosted CI runners, then review managed test results in Shiplight.

Why This Workflow

  • Agent-authored — coding agents do the primary test creation and maintenance work
  • Human-controlled — YAML keeps tests easy to review, and the debugger lets you hand-tune complex UI cases
  • Repo-native — YAML E2E tests are versioned with your code and run from your project
  • CI-ready — local tests run the same way in GitHub Actions and Shiplight CI
  • Debuggable — failures include browser evidence, traces, and step-level context

Get started: Setup Guide

Released under the MIT License.