Why Property-based Testing?

Take a look at property-based testing and its advantages over standard testing techniques.

We'll cover the following...

Testing
Property-based testing

Promises of property-based testing

Example 1: Project FIFO
Example 2: Google’s levelDB

Testing

Testing can be a boring task, but it’s a necessity we can’t avoid. Tests are critical to creating a reliable program, especially one that changes over time. They can also prove useful in helping design programs. They are ideal for helping us write tests as users as well as implementers. But mostly, tests are repetitive and sometimes burdensome work.

Take a look at this example test that checks that an Erlang function can take a list of presorted lists and always return them merged as one single sorted list:

merge_test() ->
    [] = merge([]),
    [] = merge([[]]),
    [] = merge([[],[]]),
    [] = merge([[],[],[]]),
    [1] = merge([[1]]),
    [1,1,2,2] = merge([[1,2],[1,2]]),
    [1] = merge([[1],[],[]]),
    [1] = merge([[],[1],[]]),
    [1] = merge([[],[],[1]]),
    [1,2] = merge([[1],[2],[]]),
    [1,2] = merge([[1],[],[2]]),
    [1,2] = merge([[],[1],[2]]),
    [1,2,3,4,5,6] = merge([[1,2],[],[5,6],[],[3,4],[]]),
    [1,2,3,4] = merge([[4],[3],[2],[1]]),
    [1,2,3,4,5] = merge([[1],[2],[3],[4],[5]]),
    [1,2,3,4,5,6] = merge([[1],[2],[3],[4],[5],[6]]), [1,2,3,4,5,6,7,8,9] = merge([[1],[2],[3],[4],[5],[6],[7],[8],[9]]), Seq = seq(1,100),
    true = Seq == merge(map(fun(E) -> [E] end, Seq)),
    ok.

This is slightly modified code taken from the Erlang/OTP test suites for the lists module, one of the most central libraries in the entire language. Here, the developer is trying to think of all the possible ways the code can be used and make sure that the result is predictable. We can probably think of another ten or thirty lines that could be added, that are significant and explore the same code in somewhat different ways. Nevertheless, it’s perfectly reasonable, usable, readable, and effective test code. The problem is that it’s so repetitive that a machine could do it. In fact, that’s exactly the reason why traditional tests are boring. They’re carefully laid out instructions to tell the machine which test to run every time, with no variation, as a safety check.

Property-based testing

This is why property-based testing is one of the software development practices that generated the most excitement in the last few years. It promises better, more solid tests than nearly any other tool out there, with very little code. This means, that the software we develop with it should also get better accordingly. Although this comes with a steep learning curve, property-based testing allows us to automate the boring stuff and get consistent results as our software develops and evolves. Here’s what an equivalent property-based test could look like:

Not only is this test shorter with just four lines of code, but it also covers more cases. In fact, it can cover hundreds of thousands of them. Right now, the property-based test probably looks like a bunch of gibberish that can’t be executed —at least not without the PropEr framework—, but in due time, this should be easier and faster to read than a long-form traditional test.

In this chapter, we’ll see the results we should expect from property-based testing, and we’ll cover the principles behind the practice and how they influence the way we write tests. We’ll also pick the tools we need to get going since, as we’ll see, property-based testing does require a framework to be useful.

Promises of property-based testing

Property-based tests are different from traditional ones and require more thinking. A lot more. Good property-based testing is a learned and practiced skill, much like playing a musical instrument or using a paintbrush. We’ll always have areas to improve and constantly be finding ways to innovate our approach.

Although like any skill, property-based testing can take years to master, there are plenty of benefits for a beginner user. We’ll be able to write simple, short, and concise tests that automatically comb through our code. Our code coverage will be high and consistent even as we modify the program without changing the tests. We’ll even be able to use these tests to find new edge cases without modification.

With a bit more experience, we’ll be able to write straightforward integration tests for stateful systems that find even the most complex and convoluted bugs.

Example 1: Project FIFO

Overall, we’ll find that property-based testing doesn’t just involve using a bunch of tools to automate boring tasks, but is actually a wholly different way to approach testing and software design itself.For example, Thomas Arts’ slide set and presentation from the Erlang Factory 2016 conference mentions using QuickCheck, the canonical property-based testing tool, to run tests on Project FIFO, an open-source cloud project. With a mere 460 lines of property tests, they covered 60,000 lines of production code and uncovered twenty-five important bugs, including:

Timing errors
Race conditions
Type errors
Incorrect use of library APIs
Errors in documentation
Errors in the program logic
System limits errors
Errors in fault handling
One hardware error

Considering that some studies estimate that developers average six software faults per 1,000 lines of code, finding twenty-five important bugs using 460 lines of tests is quite a feat. That’s finding over fifty bugs per 1,000 lines of the test, with each of these lines covering 140 lines of production code.

Example 2: Google’s levelDB

Let’s take a look at some more expert work. Joseph Wayne Norton ran a QuickCheck suite of under 600 lines over Google’s levelDB to find sequences of first seventeen, then another specific sequence of thirty-one specific calls that could corrupt databases with ghost keys. No matter how dedicated someone is to the task, it would have been very difficult to come up with the proper sequence of thirty-one calls required to corrupt a database.

Again, we required a surprisingly low amount of code to find a high number of nontrivial errors on software that was otherwise already tested and running in production. Property-based testing is so impressive that it has wedged itself in multiple industries, including:

Mission-critical telecommunication components
Databases
Components of cloud providers’ routing
Certificate-management layers
IoT platforms
Cars

It’s important to remember property-based testing is not a thing reserved for advanced programmers. The effort required to improve is continuous, but the benefits of property-based testing obviously makes that effort well worth it. Just remember that each wall we hit reveals an opportunity for improvement. We’ll get there together, one step at a time.

Ask

Foundations of Property-Based Testing

Writing Properties

Thinking in Properties

Custom Generators

Responsible Testing

Properties-Driven Development

Shrinking

Stateful Properties

Case Study: Bookstore

State Machine Properties

Conclusion