Health

Playwright course expands full-stack testing with mocking, visual/a11y checks, and scale-focused stability

AI-Generated Summary

1 sources

1 day ago

2 views

Playwright course expands full-stack testing with mocking, visual/a11y checks, and scale-focused stability — Photo: Dev.to

Key Points

The course uses separate Playwright projects for API (no browser) and UI (Chromium) and centralizes base URLs in an env helper.
It recommends stable selectors and assertions: use getByRole/getByLabel with strict mode, and prefer expect(locator).to... web-first auto-waiting over waitForTimeout.
It improves reliability by centralizing unique data generation and by waiting for actual page readiness signals in Page Objects.
It adds determinism with page-level network mocking (route.fulfill) for error/edge UI states, rather than relying on hard-to-produce backend conditions.
It extends coverage with visual regression (toHaveScreenshot with font/animation handling) and accessibility scanning (axe-core via @axe-core/playwright), while triaging or excluding third-party components.

Across multiple chapters, the Playwright + TypeScript course describes how to build a full-stack end-to-end test suite that stays reliable as it grows. It starts with a minimal setup for separate API and UI projects, centralized environment URLs, and early smoke tests that reveal real issues like incorrect URL construction and test interference from parallel resets. The framework evolution emphasizes “web-first” assertions and resilient locators (role- and label-based), strict mode to prevent ambiguous element targeting, and avoiding timing-based flakiness by relying on Playwright’s auto-waiting. For stability at scale, it centralizes tricky utilities for unique test data, waits for real readiness signals in page objects instead of using fixed timeouts, and reuses fixtures/page objects to add new UI flows with minimal marginal cost. It also notes specific application-level bugs uncovered by the suite, including an always-true password-hashing condition causing 500 errors on profile updates.

Later chapters add network mocking for deterministic UI states, visual regression via screenshot baselines with platform-specific handling, and automated accessibility checks using axe-core with triage for third-party widgets. The capstone outlines a 100-test suite with isolated identities, API + UI coverage, and fixes for multiple real defects, while also recommending practices like sharding, parallelization, trace-on-retry debugging, and maintaining deterministic baselines.

How Outlets Covered This Story

DEV

Dev.to

Playwright For Full-Stack Testing: Auth, Fixtures, Mocking, Snapshots, And Parallel Runs Without The Flake

Here's a Playwright test that looks completely reasonable and silently lies to you: tests/dashboard.spec.ts import { test, expect } from "@playwright/test"; test.use({ storageState: "playwright/.auth/user.json" }); test("dashboard shows the user's name", async ({ page }) => { await page.goto("/dashboard"); await expect(page.getByTestId("user-name")).toHaveText("Nazar"); }); It logs in once, saves the auth state, reuses it across every test. Textbook. Except storageState saves cookies and localStorage. It does not save sessionStorage. If your app stores its JWT in sessionStorage (which a lot of SPAs do, because it dies on tab close and product wants that), every test in your suite is silently running as an unauthenticated user that happens to land on /dashboard and follow the redirect to /login. Your assertions don't fail loudly. They just match the wrong page. The fix is documented in one sentence on the Playwright auth page. Almost nobody reads it. This is the shape of full-stack testing with Playwright: the surface API is delightful, and the failure modes hide one level below it. Let's walk through what actually keeps tests green in CI (authentication, fixtures, API mocking, visual checks, and parallel runs) and the gotchas that quietly take suites down. Set Up Authentication Once, Not On Every Test The naive approach is beforeEach that fills the login form. Don't. A 60-test suite at 800ms of login per test is 48 seconds of pure setup that you pay every CI run, for nothing. Playwright's storageState lets you log in once, dump cookies and localStorage to a JSON file on disk, and load that file into every test as a starting context. The recommended shape uses project dependencies. You declare a setup project that runs a single auth.setup.ts file before everything else, and your real test projects depend on it: playwright.config.ts import { defineConfig } from "@playwright/test"; export default defineConfig({ projects: [ { name: "setup", testMatch: /.*\.setup\.ts/ }, { name: "chromium", use: { storageState: "playwright/.auth/user.json" }, dependencies: ["setup"], }, ], }); The setup file does the actual sign-in once: tests/auth.setup.ts import { test as setup, expect } from "@playwright/test"; const authFile = "playwright/.auth/user.json"; setup("authenticate", async ({ page }) => { await page.goto("/login"); await page.getByLabel("Email").fill("e2e@example.com"); await page.getByLabel("Password").fill(process.env.E2E_PASSWORD!); await page.getByRole("button", { name: "Sign in" }).click(); // Verify we actually got in before saving — this catches CAPTCHA, MFA, broken envs. await expect(page.getByTestId("user-menu")).toBeVisible(); await page.context().storageState({ path: authFile }); }); The verification line is not optional. If your login flow ever fails (wrong env, expired test account, a new "verify it's you" challenge) and you skip the assertion, you save an unauthenticated state to disk and ship 200 tests that all hit the login page. The whole suite reports green if your assertions happen to also pass on /login. Trust me, this is how zero-coverage suites get born. The sessionStorage / IndexedDB Trap Back to the opener. storageState captures cookies and localStorage by design. If your auth lives anywhere else, you have to do extra work: sessionStorage: never persisted. There's no flag for it. Apps that store tokens here have to script the storage write themselves after loading the saved state, or move the token to localStorage (with the security tradeoff that implies). IndexedDB: added in Playwright 1.51 with storageState({ indexedDB: true }). If your app is built on top of a client database like RxDB, Dexie, or Firebase's offline cache, you want this flag on or your saved state is missing huge chunks of the app's actual state. The fix for sessionStorage looks like this: tests/auth.setup.ts (sessionStorage variant) setup("authenticate", async ({ page }) => { await page.goto("/login"); // ... sign in flow ... await expect(page.getByTestId("user-menu")).toBeVisible(); // Pull the token out so we can replay it later. const token = await page.evaluate(() => sessionStorage.getItem("jwt")); await page.context().storageState({ path: "playwright/.auth/user.json" }); // Stash the token separately — storageState won't save it. await import("node:fs/promises").then((fs) => fs.writeFile("playwright/.auth/token.json", JSON.stringify({ token })) ); }); Then a fixture re-injects it on every test (we'll get to fixtures in a moment). It's ugly, but the alternative is an entire test suite hallucinating signed-in behavior. Multiple Roles Without The Setup Tax Real apps have admin / editor / viewer / billing-only / whatever. The temptation is to chain them all in one setup project. Don't. Every test run pays for every role, even if your shard only touches the admin tests. A cleaner pattern is one storage file per role, each generated lazily by its own fixture, only when a test actually asks for it. That's the topic of the next section, but here's the spoiler: a worker-scoped fixture per role lets each shard pay only for the auth it uses. Use Fixtures To Move The Repetition Out Of Your Tests @playwright/test ships its own fixture system that has almost nothing in common with Jest's beforeEach style. Instead of setup hooks scattered across files, you define a fixture as a function, declare it once, and Playwright wires it into any test that names it. A minimal fixture that gives every test a logged-in API context: tests/fixtures.ts import { test as base, request } from "@playwright/test"; type Fixtures = { api: Awaited<ReturnType<typeof request.newContext>>; }; export const test = base.extend<Fixtures>({ api: async ({}, use) => { const ctx = await request.newContext({ baseURL: process.env.API_URL, extraHTTPHeaders: { Authorization: `Bearer ${process.env.E2E_TOKEN}` }, }); await use(ctx); // tests run here await ctx.dispose(); // teardown after every test }, }); Now every test that imports test from ./fixtures.ts instead of @playwright/test can do async ({ page, api }) => ... and call api.post("/seed/orders", { data: ... }) to set up backend state before driving the browser. No beforeEach, no module-level globals, no leaks between tests. Playwright disposes the context after every test on its own. Test-Scoped vs Worker-Scoped: The Performance Knob By default fixtures are test-scoped: they run before and after every individual test. That's the right default for anything that holds mutable state (an API context, a seeded database row, a temp file). It's the wrong default for expensive read-only setup like "spin up a fresh Postgres schema". For those, mark the fixture as worker-scoped: tests/fixtures.ts (worker-scoped DB) type WorkerFixtures = { dbSchema: string }; export const test = base.extend<{}, WorkerFixtures>({ dbSchema: [ async ({}, use, workerInfo) => { const schema = `e2e_${workerInfo.parallelIndex}`; await execSql(`CREATE SCHEMA ${schema}`); await runMigrations(schema); await use(schema); await execSql(`DROP SCHEMA ${schema} CASCADE`); }, { scope: "worker" }, ], }); workerInfo.parallelIndex is a small integer that's unique per parallel worker but reused across workers as they're recycled. Most "isolate per worker" patterns key off it: schema names, mailbox addresses, port numbers, fake-user emails. The full key with retries is workerInfo.workerIndex, which keeps incrementing; parallelIndex stays bounded. A Per-Worker Auth Fixture For State-Mutating Tests Tests that mutate data (change a user's profile, place an order, archive a workspace) need their own user account, or they race each other. The pattern is one user per worker, authenticated once per worker: tests/fixtures.ts (per-worker auth) export const test = base.extend<{}, { storageState: string }>({ storageState: [ async ({ browser }, use, workerInfo) => { const file = `playwright/.auth/user-${workerInfo.parallelIndex}.json`; if (!existsSync(file)) { const ctx = await browser.newContext(); const page = await ctx.newPage(); await page.goto("/login"); await page.getByLabel("Email").fill(`e2e+${workerInfo.parallelIndex}@example.com`); await page.getByLabel("Password").fill(process.env.E2E_PASSWORD!); await page.getByRole("button", { name: "Sign in" }).click(); await expect(page.getByTestId("user-menu")).toBeVisible(); await ctx.storageState({ path: file }); await ctx.close(); } await use(file); }, { scope: "worker" }, ], }); Now each worker logs in exactly once, for exactly the role its tests need, and never collides with another worker's data. A 5-worker run with admin + viewer + member roles spread across tests pays for 5 logins (one per worker, for whichever role it happens to need first), not 15. Mock The API Layer When It Matters, And Don't When It Doesn't This is where opinions get loud. The orthodox e2e position is "mock nothing, hit the real stack". The CI-cost position is "mock everything, hope your contracts hold". The honest answer is that a full-stack suite needs both, in different tests, deliberately chosen. Playwright's mocking primitive is page.route(pattern, handler). It hooks the browser's network layer and lets you intercept anything before it leaves: tests/checkout-error.spec.ts test("shows a friendly error when payment is declined", async ({ page }) => { await page.route("**/api/payments", (route) => route.fulfill({ status: 402, contentType: "application/json", body: JSON.stringify({ error: "card_declined" }), }) ); await page.goto("/checkout"); await page.getByRole("button", { name: "Pay" }).click(); await expect(page.getByRole("alert")).toHaveText(/card was declined/i); }); That's the move for error-path tests. You cannot reliably trigger a real 402 from Stripe on demand, and you don't want your CI suite making real test-mode charges anyway. Mock the route, drive the UI, assert the user-visible behavior. The same primitive lets you do partial mocking, where the real backend handles most of a response and you patch one field: tests/feature-flag.spec.ts await page.route("**/api/me", async (route) => { const response = await route.fetch(); const body = await response.json(); body.flags = { ...body.flags, new_dashboard: true }; await route.fulfill({ response, body: JSON.stringify(body) }); }); This pattern is gold for testing feature-flagged UI without actually flipping a flag in your config service. Real auth, real user, real DB, one tiny patch on the response. HAR Files: Record Once, Replay Forever For pages that pull from a dozen endpoints, hand-writing mocks is miserable. Playwright's routeFromHAR captures every network request the first time the test runs, stores it in an HTTP Archive file, then replays from disk on subsequent runs: tests/landing.spec.ts test("landing page", async ({ page }) => { // First run: pass { update: true } to record. // After that: omit it, and requests are served from disk. await page.routeFromHAR("hars/landing.har", { url: "**/api/**" }); await page.goto("/"); await expect(page.getByRole("heading", { name: "Welcome" })).toBeVisible(); }); Run it once with { update: true }, commit the HAR file, and the test is now hermetic. No backend dependency, no flake from a slow upstream, no API quota burn. The trap: HAR matching is strict on URL and HTTP method, and for POST requests it also matches the request payload. If your test sends a POST with a timestamp, a UUID, or anything else that changes between runs, the replay misses, and by default Playwright aborts the unmatched request (notFound: 'abort'), so your test dies on a confusing network error. Set notFound: 'fallback' and misses fall through to your other route handlers and, from there, the real network, which is arguably worse because now it's silent. There are long-standing GitHub issues about exactly this failure mode for state-mutating requests. The pragmatic answer is: use HAR for GET-heavy read paths, and write explicit page.route mocks for anything that POSTs. When To Reach For Each Tool A working heuristic: No mocking: happy-path smoke tests that prove the whole stack actually integrates. Keep a handful of these. They're slow, they're flaky, they're worth it. page.route with fulfill: error states, edge cases, anything you can't reliably trigger live. page.route with fetch + patch: feature flags, A/B variants, anything where the response shape is mostly real but one field needs forcing. routeFromHAR: read-heavy pages with lots of upstream calls and stable responses. APIRequestContext: backend-only assertions, or seeding state before a UI test. Doesn't drive a browser, doesn't pay the browser cost. The mistake is going all-in on any one of them. A pure no-mock suite is brittle and slow; a pure mock suite drifts from reality the day your API changes. Pick per-test based on what you're actually trying to verify. Visual Checks Without The Flake toHaveScreenshot is the assertion that tempts you with "just snapshot the page", and then teaches you over the next month why visual diffing is a discipline, not a one-liner. The baseline call is short: tests/visual.spec.ts test("pricing page matches baseline", async ({ page }) => { await page.goto("/pricing"); await expect(page).toHaveScreenshot("pricing.png", { fullPage: true }); }); First run, Playwright writes pricing-chromium-linux.png to your test folder. Every subsequent run, it diffs the live screenshot against that baseline. The match is per-platform: Linux Chromium and macOS Chromium render differently at the subpixel level because of font rendering, so your local-vs-CI snapshots will diverge unless you generate both. The Three Tolerance Knobs The defaults are not generous, and tightening or loosening them without understanding the difference is the most common mistake: threshold (default 0.2): a 0-to-1 color-difference threshold per pixel. 0 means exact pixel match; 1 means anything goes. This controls how different a pixel has to be before it counts as a diff. Anti-aliasing and font hinting move pixels by tiny amounts, so a strict 0 will fail on benign rendering differences. maxDiffPixels: an absolute integer. "Allow up to 500 pixels to differ before failing." Useful when you know your page has a small dynamic region. maxDiffPixelRatio: a fraction of total pixels (0 to 1). "Allow up to 0.1% of pixels to differ." Scales with image size. Setting threshold higher hides real visual bugs because it lets every pixel drift a little. Setting maxDiffPixels higher is usually safer: it caps the area of allowed difference rather than weakening the per-pixel comparison. The two combine: a diff fails only if more than maxDiffPixels pixels each exceed the threshold color delta. Killing The Three Causes Of Flake Visual tests fail for three reasons that have nothing to do with your code: Animations still running: pause them. await page.addStyleTag({ content: "*{animation: none !important; transition: none !important;}" }) is the brutal but effective version. Fonts not loaded: wait for them. await page.evaluate(() => document.fonts.ready) blocks until web fonts have actually rendered. Without it, the first run captures the system fallback font and every subsequent run that loads the web font fails. Dynamic content: timestamps, randomized testimonials, ad slots, the user's own avatar. Mask them with { mask: [page.getByTestId("clock"), page.getByTestId("hero-ad")] }. Playwright paints a solid color over the masked regions on both baseline and live, so they're identical by definition. toHaveScreenshot already auto-retries until the page stabilizes: it takes a screenshot, waits, takes another, and stops when two consecutive captures match. That handles small layout shifts on load. It does not handle any of the three reasons above, because those are deterministic-but-different, not transient. A Sane Visual-Test Default After enough self-inflicted CI fires, the configuration that holds up across teams looks like this: playwright.config.ts export default defineConfig({ expect: { toHaveScreenshot: { threshold: 0.2, // the default — don't lower without a reason maxDiffPixels: 100, // tiny budget for AA/hinting noise animations: "disabled" // auto-stop CSS animations before snapshot }, }, }); animations: "disabled" is a Playwright option, not a CSS hack: it freezes CSS animations and transitions before each screenshot. It's also already the default for toHaveScreenshot (plain page.screenshot() defaults to "allow"), so the config line is less about flipping a switch and more about pinning behavior your suite relies on. Either way, it's the cleanest answer to reason #1, no style injection of your own needed. Parallel Runs And Sharding Without Stepping On Yourself Playwright runs tests in parallel by default. Each worker is a separate OS process with its own browser instance: total isolation, no shared variables, no leaked cookies. The defaults are: Test files run in parallel. Different files go to different workers. Tests within a file run serially. Inside one file, tests share a worker process. That second rule trips people up. A file with 20 tests all hitting the same worker means slow workers and underused parallelism. The fix is one config line: playwright.config.ts export default defineConfig({ fullyParallel: true }); With fullyParallel: true, Playwright distributes individual tests across workers regardless of file. The scheduling unit drops from "file" to "test". On a 4-worker box with 20 tests in one file, you finish in roughly a quarter of the time. Isolating State Per Worker If your tests mutate shared resources (a database, a message queue, a third-party sandbox account), parallelism turns into a race condition factory. The standard pattern is keying per-worker resources off process.env.TEST_WORKER_INDEX (or testInfo.workerIndex inside tests): tests/fixtures.ts export const test = base.extend<{ user: User }>({ user: async ({}, use, testInfo) => { // Each worker gets its own email — no two parallel tests fight over the same row. const email = `e2e-${testInfo.workerIndex}-${Date.now()}@example.com`; const u = await api.createUser({ email }); await use(u); await api.deleteUser(u.id); }, }); workerIndex increments forever (1, 2, 3, ...), so retries land in a fresh worker with a fresh number. parallelIndex cycles through 0..workers-1. Use it when you want a stable index that can be reused (like the auth-per-worker storage files above). Sharding For CI: Split The Suite Across Machines Workers parallelize on one machine. Sharding splits the suite across machines. CLI: npx playwright test --shard=1/4 npx playwright test --shard=2/4 npx playwright test --shard=3/4 npx playwright test --shard=4/4 Four CI jobs, each runs roughly a quarter of the suite. Playwright distributes tests deterministically based on the shard index, so you don't have to coordinate. The official docs explicitly recommend pairing sharding with fullyParallel: true: at the file level, shards risk being uneven because one file with 50 tests counts as one unit. At the test level, work splits much more evenly. The mental model is two-dimensional: shards split tests across machines, workers split tests across CPU cores on each machine. A 4-shard / 4-worker setup gives you 16-way parallelism. The bottleneck flips from CPU to your backend's ability to handle 16 concurrent test users, which is its own conversation. The One CI Setting That Actually Matters: Traces If you change exactly one Playwright config when you wire it into CI, change this: playwright.config.ts export default defineConfig({ retries: process.env.CI ? 2 : 0, use: { trace: "on-first-retry", }, }); trace: 'on-first-retry' tells Playwright to record a full trace (DOM snapshots at every action, network requests, console logs, screenshots before and after each step) only when a test fails and is being retried. The first attempt runs lean. The retry records everything. When the retry passes, the trace is discarded. When it also fails, you get a trace.zip attached to the test report. Open it with npx playwright show-trace trace.zip. You get a timeline of every action, with a DOM snapshot at each step. You can hover the timeline and see the page change. You can click any locator call and see exactly what was on the page at that moment. The Network tab shows every request, including the 401s your auth token didn't survive into CI. The Console tab shows the JS error that fired on a slower machine. This is the difference between "the test failed in CI but I can't reproduce locally" being a half-day investigation and a five-minute one. If you don't have retries enabled at all, swap in trace: 'retain-on-failure': same idea, fires on first failure instead of first retry. Tip The trace file lives in your artifacts. Wire it into your CI job to be uploaded on failure, and the Playwright HTML reporter will surface a "View trace" link in the failure report. The wiring is two lines in most CI systems; the payoff is permanent. What Stays With You Full-stack tests with Playwright work the way furniture works: every piece looks simple in the catalog, and the project succeeds or fails on how the pieces fit. Save authentication once with storageState, mind the sessionStorage blind spot, and prefer project dependencies for the setup step. Push everything you'd otherwise put in beforeEach into a fixture, and pick test scope vs worker scope based on whether the fixture is per-test state or per-process state. Mock the API at the layer that hurts least: page.route for error paths, HAR for read-heavy pages, the real backend for the small set of tests that prove the integration. Treat visual checks as a discipline: kill animations, wait for fonts, mask the volatile bits, leave threshold alone. Lean on fullyParallel and sharding for speed, and key every shared resource off workerIndex so parallelism never silently corrupts your data. Turn on trace: 'on-first-retry' before you ship anything to CI. Do those seven things and the suite stops being a chore you maintain. It starts being the thing that catches the bug you would otherwise have shipped. Originally published at nazarboyko.com.

2 hours ago

DEV

Dev.to

Capstone: A 100-Test Suite, End to End (Playwright + TypeScript, Ch.26)

This is where everything comes together. Over the course we built a layered Playwright TypeScript framework — fixtures, Page Objects, API + UI + integration tests, CI, reporting — against a real dockerized app. The capstone makes it whole: end-to-end journeys, broad coverage, and a suite that's green, fast, and deterministic. ── Run summary ─────────────────────────────── result: passed tests: 100 (✓ 100 ✘ 0 ⤿ flaky 0 – skipped 0) projects: setup 1 api 66 ui 33 Code for this chapter is tagged ch-26 in the repo: https://github.com/aktibaba/playwright-qa-course. End-to-end journeys The headline tests exercise the whole product the way a user does — and each owns a fresh identity, so it's fully isolated: test("a new user signs up, publishes an article, and sees it on their profile", async ({ signUpPage, articleEditorPage, articlePage, page, }) => { const username = uniqueId("author").replace(/-/g, ""); await signUpPage.signUp({ username, email: `${username}@test.io`, password: "Password123!" }); const title = `Capstone article ${Date.now()}`; await articleEditorPage.publishArticle({ title, description: "…", body: "…", tags: "capstone" }); await articlePage.expectTitle(title); await page.goto(`/#/profile/${username}`); await expect(page.getByRole("heading", { name: title })).toBeVisible(); }); Sign up → author → view → profile, all through the UI, reusing every Page Object and fixture we built. The marginal cost of a journey this rich is a dozen readable lines. What the 100 tests cover API (66): articles CRUD, comments, favorites, follows, profiles, tags, pagination, the personalized feed, auth & sessions, validation, and authorization. UI (33): locators & assertions, login/signup/logout, the article editor, comments, settings, profile & feed, tag filtering, network mocking, visual regression, accessibility, and the end-to-end journeys. Cross-cutting: seed-via-API/verify-in-UI, storageState auth, sharded CI, a custom reporter, and unique-data isolation throughout. The bugs the suite found Run honestly at scale, the suite did what good tests do — it found seven real bugs in the application, all fixed in sut/: createArticle crashed when tagList was omitted. A null-author race (un-awaited setAuthor) crashed GET /articles under load. slug wasn't unique → duplicate slugs → favorite primary-key collisions. offset pagination violated the RealWorld contract (offset * limit). WCAG-AA color-contrast failures across the UI. updateUser 500'd on every profile update (|| that's always true) and risked clobbering passwords. An invalid token returned 500 instead of 401. That's the real return on a framework: not just "do the tests pass," but a suite trustworthy enough that when it goes red, you believe it — and it catches what the UI alone never would. Where to take it next More browsers/devices — add WebKit and Firefox projects, and a mobile viewport. Visual coverage in CI — generate Linux baselines in the Playwright Docker image. Data & trends — ship json/blob results to a dashboard; track flaky-rate over time (Chapter 25). Contract testing — assert the API against the published RealWorld OpenAPI spec. Performance budgets — fail a test when a key request blows past a threshold. The framework is the foundation; these are afternoons, not rewrites — because the architecture (Part 2) was built to extend. Thank you That's the course: from "why a framework" to a production-grade, 100-test, API+UI suite that runs in CI and even improved the app it tests. Clone the repo, check out any ch-NN tag, and make it yours. If this series helped, star the repo and tell me what you built with it. Happy testing. 🎭

1 day ago

DEV

Dev.to

Stability & Maintainability at Scale (Playwright + TypeScript, Ch.23)

As a suite grows, two things decide whether it stays an asset or becomes a liability: is it stable (does it fail only for real reasons?) and is it maintainable (can you add the next flow without copy-paste?). This chapter is about the habits that keep both true — demonstrated by adding comment and settings flows. Code for this chapter is tagged ch-23 in the repo: https://github.com/aktibaba/playwright-qa-course — see src/utils/unique.ts, src/pages/SettingsPage.ts, and the new comment-ui / settings-ui specs. Centralize the tricky bits The flaky slug bug a few chapters back came from generating "unique" data that wasn't unique across parallel workers. The lesson isn't "be careful" — it's put the hard thing in one place so nobody gets it wrong again: // src/utils/unique.ts let counter = 0; export function uniqueId(prefix = "id"): string { counter += 1; return `${prefix}-${Date.now()}-${counter}-${Math.floor(Math.random() * 1e9)}`; } Now the article factory and the user factory both call uniqueId() — one proven recipe, zero chances to reintroduce the collision. That's maintainability: the correct way is the only way. Wait for the right signal, not a guess The settings screen loads the current user asynchronously, then fills the form. Editing a field before that load lands would submit empty values over the real ones. The stable fix is never a waitForTimeout — it's waiting for the actual readiness signal: // src/pages/SettingsPage.ts async goto(): Promise<void> { await this.page.goto("/#/settings"); await expect(this.updateButton).toBeVisible(); await expect(this.username).not.toHaveValue(""); // the form has loaded } Encapsulating that wait in the Page Object means every settings test inherits the stability for free — the test just calls goto(). New flows, same machinery Adding comments and settings didn't require new infrastructure — they reuse the fixtures and Page Objects we already have. A comment test reads as behavior: test.use({ storageState: ".auth/playwright.json" }); test("post a comment and see it appear", async ({ makeArticle, articlePage }) => { const article = await makeArticle(); // seed via API await articlePage.goto(article.slug); const body = `Nice article ${Date.now()}`; await articlePage.postComment(body); // act in UI await expect(articlePage.comment(body)).toBeVisible(); // verify in UI }); The settings test goes further on isolation: it registers a fresh user through the API and logs in as them, so changing a profile never contends with other tests on the shared seed user. New surface, but the same registerUser, loginPage, and settingsPage building blocks. That's what "scales" means here — the marginal cost of the next flow is small. …and another real bug Writing the settings flow, the UI test failed — and so did a direct API check. The SUT's update endpoint 500'd on every profile update: // the original, buggy condition if (password !== undefined || password !== "") { // always true! loggedUser.password = await bcryptHash(password); // bcryptHash(undefined) -> 500 } a !== x || a !== y is always true, so every update tried to hash an absent password ("data and salt arguments required") — and on a real save would have clobbered the user's password. One character — || → && — fixed it. The suite didn't just verify the settings screen; it proved the whole feature was broken. Next up Chapter 24 — Framework maturation & docs: we tidy the project, document how to run and extend it, and round out coverage so a newcomer can be productive in minutes. Tag: ch-24. Following along? Star the repo and tell me the one helper that removed the most flakiness from your suite.

1 day ago

DEV

Dev.to

Advanced: Network Mocking, Visual & Accessibility (Playwright + TypeScript, Ch.22)

Welcome to Part 6. The framework is solid; now we add three powerful kinds of test that go beyond "click and assert text." Code for this chapter is tagged ch-22 in the repo: https://github.com/aktibaba/playwright-qa-course — see src/tests/ui/: network-mock.spec.ts, visual.spec.ts, a11y.spec.ts. Network mocking — test the UI in isolation page.route intercepts requests so the UI runs against a response you control. That makes states that are awkward to set up in a real backend — empty, error, exotic data — trivial and deterministic: test("shows the empty state when the feed is empty", async ({ page }) => { await page.route("**/api/articles?*", (route) => route.fulfill({ json: { articles: [], articlesCount: 0 } }), ); await page.goto("/"); await expect(page.getByText("Articles not available.")).toBeVisible(); }); test("survives an API error without crashing", async ({ page }) => { await page.route("**/api/articles?*", (route) => route.fulfill({ status: 500, json: { errors: { body: ["boom"] } } }), ); await page.goto("/"); await expect(page.getByRole("link", { name: "Sign up" })).toBeVisible(); }); These need no database and no auth — the test owns the data. Use mocking for UI behavior on hard-to-produce responses; keep real-backend integration tests (Part 4) for the contract itself. Both, not either. Visual regression — catch the unintended toHaveScreenshot pixel-compares a page against a committed baseline, catching changes no text assertion would — a broken layout, a wrong color, a clipped button: test("login page matches its baseline", async ({ page }) => { await page.goto("/#/login"); await expect(page.getByRole("button", { name: "Login" })).toBeVisible(); await page.evaluate(() => document.fonts.ready); // avoid web-font swap flicker await expect(page).toHaveScreenshot("login.png", { maxDiffPixelRatio: 0.02 }); }); Two things make visual tests trustworthy instead of flaky: Settle the page first. Waiting on document.fonts.ready removes the most common cause of jitter — a screenshot taken mid web-font swap. A small maxDiffPixelRatio absorbs sub-pixel anti-aliasing. Baselines are platform-specific. A macOS baseline won't match Linux CI, so we test.skip visual specs on CI and document generating Linux baselines in the Playwright Docker image. Never commit a baseline from one OS and diff it on another. Accessibility — and real bugs we fixed We scan with @axe-core/playwright and fail on serious/critical violations: const results = await new AxeBuilder({ page }) .withTags(["wcag2a", "wcag2aa"]) .exclude(".pagination") // third-party widget, see below .analyze(); const serious = results.violations.filter( (v) => v.impact === "serious" || v.impact === "critical", ); expect(serious).toEqual([]); The first run failed — and the violations were real: Color contrast. The navbar links (2.1:1), the banner subtitle, muted dates, and the green feed toggle (3.0:1) all fell short of WCAG AA's 4.5:1. We fixed the app (sut/): darkened the brand green and the muted greys to meet AA. Orphaned list items came from the react-paginate widget rendering its <ul> with role="navigation". That's a third-party limitation we can't fix from app code, so we .exclude(".pagination") with a comment and would report it upstream — triaging what you don't own instead of letting it mask your own regressions. This is the realistic a11y workflow: scan, fix what's yours, triage the rest. And fixing contrast is a genuine product improvement, not just a green test. Next up We've widened what we can assert. Chapter 23 — Stability & maintainability at scale: the utilities and habits that keep a large suite trustworthy — taming animations and async, safe waiting, and helpers that stop flakiness before it starts. Tag: ch-23. Following along? Star the repo and tell me which of the three — mocking, visual, or a11y — your suite is missing.

1 day ago

DEV

Dev.to

Debugging & Developer Experience (Playwright + TypeScript, Ch.6)

A test fails on CI with expect(locator).toBeVisible() failed. Now what? Guessing is slow and demoralizing. Playwright ships a genuinely excellent debugging toolchain — this chapter is the one you'll come back to every time something breaks. We added a few scripts to make it one command each. Code for this chapter is tagged ch-06 in the repo: https://github.com/aktibaba/playwright-qa-course — see the scripts in package.json and the reporter/trace config in playwright.config.ts. // package.json "scripts": { "test": "playwright test", "test:ui": "playwright test --ui", // time-travel UI mode "test:debug": "playwright test --debug", // step-through inspector "test:report": "playwright show-report", // open the last HTML report "codegen": "playwright codegen http://localhost:3000" } UI Mode — start here npm run test:ui UI Mode is a watch-mode cockpit: a list of every test, a live browser, and a time-travel timeline. Hover any action and the browser snaps to that moment — DOM, network, console, and the locator that was used, all at that step. Edit a test and it re-runs on save. This is exactly how you'd catch the strict-mode flake from Chapter 3. Stepping to the failing assertion, the timeline shows getByRole('heading', { name: 'inkwell' }) highlighting two elements — the banner and the "Welcome to Inkwell" article — making the substring-match bug obvious in seconds. The Trace Viewer — for failures you didn't watch UI Mode is for local exploration. Traces are for failures you weren't there for — especially on CI. Our config records one automatically: // playwright.config.ts use: { trace: "on-first-retry", screenshot: "only-on-failure" } on-first-retry is the production-friendly setting: no overhead on green runs, but the moment a test fails and retries, Playwright captures a full trace. A trace is a zip with the complete timeline, DOM snapshots, network, console, and source. Open the last HTML report (which embeds traces and the failure screenshot): npm run test:report To force a trace locally even without a retry: npx playwright test article-editor --trace on On CI, upload playwright-report/ (or test-results/) as an artifact and you can open any failure's trace on your own machine — the single biggest upgrade to debugging flaky pipelines. The Inspector — step through live npm run test:debug --debug opens the Playwright Inspector: execution pauses before each action, you step forward one command at a time, and a locator picker lets you hover the page to get the exact recommended locator. Drop a breakpoint anywhere in code with: await page.pause(); // halts here, hands you the Inspector This is how you'd dissect the login redirect race from Chapter 5 — pause right after the submit click and watch the app fire its navigate("/") out from under you. Codegen — record locators, don't guess them npm run codegen codegen opens Inkwell and writes a test as you click, choosing role-based locators automatically. It's not for generating finished tests — it's the fastest way to discover the right locator for a tricky element, which you then lift into a Page Object. Treat its output as a first draft, never a final one. The VS Code extension The official Playwright extension gives you the whole loop inside the editor: run/debug a test from a gutter icon, set breakpoints in test code, and a "Pick locator" / "Record at cursor" button. If you live in VS Code, install it — it replaces most of the CLI flags above with a click. A debugging workflow When something breaks, in order: npm run test:ui — reproduce and time-travel to the failing step. npm run test:report — if it only fails on CI, open the trace from the downloaded artifact. page.pause() + npm run test:debug — when you need to poke the live page. npm run codegen — when the real problem is "what locator should this be?". Notice none of these involve sprinkling console.log or waitForTimeout. The tools show you state directly. Next up That wraps Part 1 — you can locate, assert, model pages, handle forms and dialogs, and debug all of it. Part 2 is the heart of the course: we turn these Page Objects and our repeated setup into fixtures, so a test just asks for loginPage (already constructed) or user (already seeded) and gets it. Chapter 7 — Custom fixtures: beyond beforeEach. Tag: ch-07. Following along? Star the repo and tell me which Playwright debugging tool you reach for first.

1 day ago

DEV

Dev.to

Locators & Web-First Assertions (Playwright + TypeScript, Ch.3)

In Chapter 2 we wrote our first tests and hit two bugs. Before we add more, we need the one skill everything else rests on: finding elements reliably. Get this right and your tests survive redesigns; get it wrong and they break every sprint. Code for this chapter is tagged ch-03 in the repo: https://github.com/aktibaba/playwright-qa-course — see src/tests/ui/locators.spec.ts. Locate the way a user perceives The brittle instinct is to grab elements by their structure — CSS classes, nth-child, XPath. All of that changes the moment a developer touches the markup. Playwright's recommended locators instead target what a user (and a screen reader) perceives: the role, the label, the visible text. Use them in this order of preference: getByRole — the role + accessible name (covers the vast majority of cases) getByLabel — form fields by their <label> getByPlaceholder — inputs without a label getByText — non-interactive content getByTestId — a deliberate data-testid, only when nothing semantic fits Here's the top of the priority list, live against Inkwell's home page: import { test, expect } from "@playwright/test"; test("prefer role-based locators over CSS", async ({ page }) => { await page.goto("/"); await expect(page.getByRole("button", { name: "Global Feed" })).toBeVisible(); await expect(page.getByRole("link", { name: "Sign up" })).toBeVisible(); await expect(page.getByRole("heading", { name: "inkwell" })).toBeVisible(); }); getByRole("button", { name: "Global Feed" }) asserts two things at once — that an element with the button role exists and that its accessible name is "Global Feed". If a dev swaps the <div class="feed-btn"> for a real <button>, this locator keeps working; a CSS selector wouldn't. Strict mode is your friend Playwright locators are strict: if a locator matches more than one element, the action throws instead of silently picking the first. That catches ambiguous tests before they pick the wrong element in production. Inkwell shows the brand "inkwell" as a link in both the navbar and the footer — a perfect example: test("strict mode forces you to disambiguate", async ({ page }) => { await page.goto("/"); const brand = page.getByRole("link", { name: "inkwell" }); await expect(brand).toHaveCount(2); // two matches — a bare click would throw await expect(brand.first()).toBeVisible(); }); .first() is the quick escape hatch, but the better fix is usually to scope the search so it's unambiguous — locate within a region first: // Only the navbar's brand link, unambiguous by construction. const navBrand = page.getByRole("navigation").getByRole("link", { name: "inkwell" }); Scoping (locator chaining) is how you keep locators readable as pages grow. Web-first assertions: stop writing waits The biggest source of flaky tests is timing — asserting before the app has rendered. Playwright's expect(locator) matchers are web-first: they auto-wait and re-poll until the condition is true or a timeout hits. You almost never need waitForTimeout. test("form locators: placeholder, role, and state", async ({ page }) => { await page.goto("/#/login"); const email = page.getByPlaceholder("Email"); const submit = page.getByRole("button", { name: "Login" }); await expect(email).toBeVisible(); // waits for it to appear await expect(submit).toBeEnabled(); // waits for it to become enabled await email.fill("playwright@test.io"); await expect(email).toHaveValue("playwright@test.io"); }); A few you'll reach for constantly: toBeVisible, toHaveText, toHaveValue, toHaveCount, toBeEnabled, toHaveURL. Crucially, pass the locator to expect, not a resolved value — expect(locator).toHaveText("x") re-polls, while expect(await locator.textContent()).toBe("x") snapshots once and reintroduces the flake you were trying to avoid. Filtering and scoping a list Real pages have lists. After a reset, Inkwell's home feed shows the seeded "Welcome to Inkwell" article. You locate within the list rather than by position: // Each article preview is a card; find the one we want by its heading. const card = page .locator(".article-preview") .filter({ hasText: "Welcome to Inkwell" }); await expect(card.getByRole("heading", { name: "Welcome to Inkwell" })).toBeVisible(); await expect(card.getByRole("link", { name: "alice" }).first()).toBeVisible(); Note: .article-preview is a CSS locator — the card wrapper has no semantic role, so this is a legitimate use of CSS to scope, after which we go back to role locators inside it. CSS as a scalpel, not a crutch. The habits to keep Reach for getByRole first; drop down the list only when you must. Let strict mode push you toward scoped, unambiguous locators. Assert on locators, never on snapshotted values — auto-waiting is the whole point. Delete every waitForTimeout you're tempted to write. Run it npx playwright test locators ✓ [ui] prefer role-based locators over CSS ✓ [ui] strict mode forces you to disambiguate ✓ [ui] form locators: placeholder, role, and state 3 passed Next up We now locate and assert cleanly — but the steps still live inside tests. In Chapter 4 we introduce the Page Object Model: wrapping these locators and actions behind names like loginPage.loginAs(user), so tests read as behavior and a UI change has exactly one place to fix. Tag: ch-04. Following along? Star the repo and share the worst CSS-selector test you've had to maintain.

1 day ago

Karmelo Anthony convicted of murder in Austin Metcalf stabbing trial

Karmelo Anthony is found guilty of first-degree murder in the fatal stabbing of 17-year-old Austin Metcalf during an Apr...

7 sources 4 days ago

Health

Judge spares prison for daughter convicted of stealing £43,000 from mother with dementia

A 49-year-old woman, Catherine Barningham, is sentenced for taking control of her 78-year-old mother’s affairs and steal...

2 sources 57 minutes ago

Health

Most GPs report practising defensive medicine amid rising complaints

Multiple reports say a large majority of general practitioners have adjusted their clinical behaviour in response to an...

2 sources 1 hour ago