8 comments

  • Laurel1234 12 minutes ago
    Seems interesting, but I wonder about this

    > Traditional E2E tests are slow to set up and expensive to maintain.

    Isn't this just using agents to create e2e tests or is there some better new approach I'm missing?

    • okwasniewski 1 minute ago
      We use agents to navigate the app, making real-time decisions based on its state. I prefer to compare it more to a manual QA engineer than to static e2e tests. We spent a lot of time on the harness to make sure the results are reliable. This allows you to assert on dynamic content like AI-generated content. We also support validation of email flows since the agent can read its own email.
  • dbbk 1 hour ago
    "Traditional E2E tests are slow to set up and expensive to maintain." I don't really understand this. If I'm already using Opus to write the code, surely it would know best what E2E tests to write to be able to verify its own output? This seems like an unnecessary external step.
    • okwasniewski 1 hour ago
      Unfortunately from our experience tests don’t scale as well as code. First of all static tests are very brittle, you rely on selectors, need wait times and can’t really test a lot of dynamic content (think AI chats/interactions). Then it’s all the infrastructure around it: solving captchas, handling auth, handling email OTP (each of our agents has access to its own inbox) and handling video recording and screenshots. So with the traditional testing approach you end up mocking a lot of services. I highly recommend you to give it a try!
  • msencenb 1 hour ago
    Have you been able to nail down a loop where your tool can take an open pr, guess the code path and do some testing?

    We use cypress heavily for our core flows which has a similar ai prompt thing but it’s not quite ad hoc enough for smaller fixes which is where the bottleneck still comes in for us.

    • okwasniewski 40 minutes ago
      Yes! We spent quite a lot of time on this, and we are currently creating a test plan based on PR changes and sending an agent to verify it. We have some customers who are only using this feature.
  • yohguy 1 hour ago
    Does it work of mobile native applications or expo apps that have native modules?

    Pricing question, the usage on the plans seems low considering in the demo you said that you have 25 tests per pr which would mean you get only 10 PRs per month on the hobby plan?

    • okwasniewski 1 hour ago
      Yes, it works for any framework. We just get the built native binary and run it in the cloud.

      Regarding pricing, the self serve options are currently only for lower usage. We will add more plans further down the line. Currently the most popular one is the startup plan. If you need more usage I’m happy to discuss it on a call!

  • rpunkfu 29 minutes ago
    Congratulations on launch, I’ve been tracking your progress since you’ve been accepted for spring batch.

    Always happy to see cool products from Poland! :)

  • iknownthing 41 minutes ago
    .army?
    • okwasniewski 39 minutes ago
      We are thinking whether to change this.. We also have testerarmy.com/.ai
  • maxothex 29 minutes ago
    [flagged]