k6 has quietly become really good

pakistani_tester1947 · 2 months ago

The “performance theater” description is painfully accurate and way more common than people admit.

On the staging environment problem — proportional scaling is the key concept that helped us. You don’t need a full production replica, you need a consistent environment and you need to interpret results relatively, not absolutely.

If staging is consistently 2x slower than production under equivalent load, that ratio becomes your calibration factor. You care less about the raw p95 number and more about whether it changed compared to last run.

The regression detection piece is more valuable than the absolute number anyway.

pakistani_tester1947 · 2 months ago

Great question. Honestly it’s still our biggest pain point.

Current approach:

Pre-generate a CSV of test user credentials (10k accounts in our staging environment)
k6 reads from the CSV using SharedArray so each VU gets a unique user
Product/inventory data is refreshed from a staging snapshot weekly via a script

It’s not elegant. But it works well enough that our load results are meaningfully realistic. The alternative — fully synthetic data — produces numbers that don’t translate to production at all in our experience.

pakistani_tester1947 · 2 months ago

Yes! But let me be transparent;

What’s still annoying

Browser-based performance testing via k6 browser is still maturing
Protocol support isn’t as broad as JMeter (though it covers 95% of our needs)

Happy to answer questions if anyone is mid-evaluation between the two.

pakistani_tester1947 · 2 months ago

k6 has quietly become really good

pakistani_tester1947 · 2 months ago

The one that gets missed most often in my experience: focus returning correctly after a dynamic UI change.

Close a modal → focus goes to the button that opened it ✅ Submit a form and get a validation error → focus moves to the error summary ✅ Delete an item from a list → focus goes… somewhere reasonable ✅

pakistani_tester1947 · 2 months ago

Fellow EU tester here (based in Poland, also dealing with public sector a11y compliance). The inconsistency between JAWS and NVDA is genuinely one of the most frustrating parts of this work.

Our workflow that’s helped:

We define a priority screen reader matrix based on our actual user analytics. For our user base NVDA + Chrome is dominant, so that gets the most rigorous testing.
JAWS gets a second pass focused only on critical user journeys — login, form submission, key navigation flows
VoiceOver Safari gets tested at the end on real hardware, not simulators

We stopped trying to achieve identical behavior across all readers and shifted to “no journey-blocking failures in any of them.” Subtle announcement differences are logged but not always blockers.

Also — if you’re not already using it, the Accessibility Insights for Web extension is great for structured manual audits. Free, from Microsoft, actually good.

pakistani_tester1947 · 2 months ago

Hi, Context, I’m Based in Pakistan, QA architect at a mid-size logistics company.

Copilot for writing unit tests is legitimately good when the code under test is well-structured. Give it a pure function with clear inputs/outputs and it’ll cover edge cases you’d forget. Give it a tangled service class with 10 dependencies and it produces confident-looking garbage.

The “autonomous testing” tools — I’ve evaluated three of them now. They work okayish for simple happy-path flows but they fail completely when your app has complex auth, dynamic content, or anything stateful.

pakistani_tester1947 · 2 months ago

Honest take from someone in QA consulting (based in Germany, work with clients across the EU):

Neither tool is objectively better. It depends entirely on your team’s JavaScript maturity and what your app actually does.

Lots of multi-tab flows, iframes, file downloads? → Playwright
Small team, fast onboarding, simple SPA? → Cypress is still totally valid

The “Playwright won” narrative on social media is a bit overstated imo.