Autonomous Software Testing Explained: A Benchmark for AI-Driven QA

June 10, 2026 · 6 min read · Testing Guide

Blog / Insights /

Contributors Updated on

Learn with AI

Facebook

X (Twitter)

Mail

Learn with AI

We 've been believe long and hard about software testing and mechanization here at Katalon. We 've comment a shift in the Software Testing industriousness from Test Automation toSelf-directed Software Testing (AUST). By `` autonomous, '' we mean software testing that is completely drive by Artificial Intelligence (AI) and Machine Learning (ML), without any human intervention. & nbsp;

We figured the industry shift from Test Automation to AUST could either be messy — a sort of Wild West approach to product festering — or goal-driven, something to which the entire Software Testing community could aim in a spirit of cooperation. That 's where self-driven auto — or self-reliant vehicles, as the mobility industry wish to telephone them — come in. & nbsp;

The measured advances in autonomous vehicles have inspired us to gather a way to benchmark advancement toward autonomy in the package testing arena. We call the benchmark the Autonomous Software Testing Model (ASTM). & nbsp;

We wanted a way to gauge just where software quiz platforms are in their development of `` self-driving '' software testing — Katalon included. We seem to theSociety of Automotive Engineersfor guidance on build an ASTM benchmark.

Building a software testing benchmark

The benchmark for the maturation of autonomous vehicles that the Society of Automotive Engineers (SAE) created observes that humans will be regard in the evolution of the technology as a matter of course. In other lyric, autonomous vehicle will not dead pop out of a black box overnight. Instead, humans will be involved with developing, grooming, and testing the engineering for some time to come.

The ASTM framework, so, sets out six stages of human-machine desegregation. Zero (0) represent full manual plan and effectuation of testing procedures. The highest level, Six (6), sees full AI/ML designing, implementation, and decision-making of test:

Level 0: Manual Testing

The human performs all activities and actions and makes all decisions concerning examination of the System Under Testing (SUT). & nbsp;

Flat 1: Assisted Test Automation

The human do most actions for testing the SUT with the support of the computer, e.g., automated package tools. The calculator conduct out sure examine actions automatically under the human ’ s control. The human holds the master character and makes all testing decisions. & nbsp;

Level 2: Partial Test Automation

Both the human and the computer perform testing activity and generate possible decision options, but the human yet do most testing decisions. The computer carries out testing activeness based on the decisions made by the human. & nbsp;

Flat 3: Integrated Automated Testing

The computer generates a listing of decision options, selects an option, and performs actions based on this option if the human approves. The human can also select another decision option for the computer to take out. & nbsp;

Flat 4: Intelligent Automated Testing

The calculator generates decision options, ascertain the best option, and performs testing action according to this choice. The human can withal intervene if necessary. & nbsp;

Unwavering 5: Autonomous Testing
Pro tip: Tools like SUSA can handle this autonomously — upload your app and get results without writing a single test script.

The calculator has total control of the testing operation for the SUT, including making decision and carrying out all testing actions. The human can not interpose.

The software testing lifecycle

We felt we needed to ground the ASTM still more soundly in the software testing cycle. Defining each stage of software test would enable us to apply the benchmark to software testing platform on the marketplace. We identified the nine activity areas that define software testing as:

Test Management (TMN)
Test Orchestration (TOR)
Test Maintenance (TMA)
Test Generation (TGR)
Test Optimization (TOP)
Test Monitoring (TMO)
Test Execution (TEX)
Test Management (TMN)
Test Evaluation and Reporting (TRE)

Self-reliant software testing model self-assessment

We applied the ASTM benchmark to Katalon. & nbsp; We nock the Katalon functionality on a spider diagram to gauge the stage to which Katalon has develop its autonomous lineament.

In particular, Katalon rates 3rd-level sovereign activity in the areas of Test Orchestration (TOR), Test Maintenance (TMA), Test Monitoring (TMO), and Test Execution (TEX). The benchmark imply that while the can give a list of decision-testing pick for these activities, the system still postulate human approval to act on the alternative; or, alternatively, the human can select another decision option for Katalon to act on. & nbsp;

If you are a Katalon user, where would you place our platform 's grade of self-reliance? & nbsp;

If you would wish to learn more about how can:

velocity up software deployment
increase the accuracy of your package prove try
trim overall software development cost,

then contact us for a demo. & nbsp;

Download our academic paper entitled for a fuller description of the ASTM benchmark. The paper also includes:

Full descriptions of the software screen lifecycle activity
Charts that cross-reference ASTM stages with standard testing activities
A look at how the ASTM benchmark gauges former package testing platforms

More to say:

Explain

FAQs

What is the Autonomous Software Testing Model (ASTM)?

The Autonomous Software Testing Model (ASTM)is a benchmark create to measure advancement towardAutonomous Software Testing (AUST)—testing that is fully drive by AI and ML with no human intervention. Inspired by the Society of Automotive Engineers ’ model for autonomous vehicles, ASTM defines point of autonomy so the software testing community can track and compare maturity in a structured way.

What are the autonomy levels in the ASTM?

ASTM definessix stages of human–machine integration, from Level 0 to Level 5:

Level 0 – Manual Testing:Humans execute all actions and determination.
Level 1 – Assisted Test Automation:Tools support the human, but humans still do most actions and make all decisions.
Level 2 – Partial Test Automation:Both human and estimator act; the human still make most decisions.
Level 3 – Integrated Automated Testing:The computer proposes alternative and act on a choose alternative with human approval, or based on an alternate option chosen by the man.
Level 4 – Intelligent Automated Testing:The calculator chooses and accomplish the good pick, with humans still able to interfere.
Level 5 – Self-reliant Testing:The calculator has full control over decisions and all testing actions, and humans can not intervene.

Which parts of the package testing lifecycle does ASTM utilize to?

To ground ASTM in real workflows, the poser is applied acrossnine action country, including:
Test Management (TMN), Test Orchestration (TOR), Test Maintenance (TMA), Test Generation (TGR), Test Optimization (TOP), Test Monitoring (TMO), Test Execution (TEX), and Test Evaluation and Reporting (TRE).These areas allow platforms to be evaluated for autonomy in specific parts of the testing lifecycle.

How does Katalon currently score on the ASTM benchmark?

Katalon ’ s self-assessment use ASTM displayLevel 3 (Integrated Automated Testing)autonomy in country such asTest Orchestration (TOR), Test Maintenance (TMA), Test Monitoring (TMO), and Test Execution (TEX). At this stage, the program cangenerate decision options and act on them with human approval, or execute an alternative option opt by a human tester.

How can QA teams use the ASTM benchmark in practice?

QA teams can use ASTM togauge how “ self-driving ” their examination platform genuinely are, compare capabilities across tool, and plan how to move from lower to higher autonomy levels. They can also dive deeper by downloading the“ Autonomous Software Testing Model ” pedantic newspaper, which includefull lifecycle activity description, cross-reference charts, and examples of how ASTM can be expend to evaluate different testing platforms, including Katalon.

Contributors

The Katalon Team is composed of a divers radical of dedicated professionals, including subject matter experts with deep field knowledge, experienced technological writers skilled, and QA specialists who wreak a practical, real-world view. Together, they contribute to the Katalon Blog, render high-quality, insightful articles that empower users to make the most of Katalon ’ s instrument and stay updated on the late trend in exam automation and software quality.

Automate This With SUSA

Upload your APK or URL. SUSA explores like 10 real users — finds bugs, accessibility violations, and security issues. No scripts needed.

Try SUSA Free

Test Your App Autonomously

Upload your APK or URL. SUSA explores like 10 real users — finds bugs, accessibility violations, and security issues. No scripts.

Try SUSA Free

Autonomous Software Testing Explained: A Benchmark for AI-Driven QA