That TDD Fellow | Tech Blog | Screencasts

Why Best Software Engineering Practices Are Insanely Important, Really

2022-02-05T14:43:04+02:00

“So, what software engineering practices do you like to use?” — I asked another developer the other day. And the answer has wholly shaken me:

“What do you mean by “software engineering practices?” — was the developer’s response.

When I explain what I mean by these three words (apparently, I took for granted), the person starts listing technologies. This conversation has been repeating a bit too frequently for my liking recently.

Look, I get it. Technologies knowledge and experience are vital for a software engineer. However, the fundamental practices of software engineering are insanely important nevertheless.

Let’s discover why.

Holy Grail of Software Development

Why do these practices exist in the first place, and why are countless books written on them? And why is the industry trying to improve on the existing ones and invent new ones?

Why just a bunch of technology skills is not enough?

That’s because we all want to get stuff done fast. And we want it done with a high level of quality. And we want it to be valuable.

The “Holy Grail of Software Development” is the ability to deliver fast and with high quality.

While innovative programming language or framework can improve speed and reduce the chance of making mistakes, the improvement effect is negligible in “speed x quality x value.”

What matters then?

How the software is developed creates a tremendous effect on speed and quality.

It’s the processes applied, techniques used, disciplines practiced.

Whether You Know it or Not, You’re Still Using Practices.

Just not necessarily the good ones.

Slap some code together, use a lot of copy-paste from StackOverflow and your previous project, and get it to production as quickly as possible? — That’s a software engineering practice.

It gives you great speed (at first) and, of course, reduces quality substantially. Do that for a few weeks, and you’ll inevitably slow down because you’ll constantly be debugging all the bugs produced.

Then, on the other hand, there is another practice of unit-testing and integration-testing the code and refactoring it until it becomes “Clean Code.” — This is a simple example of a few good practices applied:

Automated testing.
Refactoring.
Clean Code.

Not Every “Best” Practice is Always Good — Context is King

Let’s say that you’ve committed to using a whole bunch of best practices, such as keeping your code decoupled and flexible using design patterns. Moreover, you’ve decided to exhaustively implement and test all the different scenarios that you think could happen in the future.

These are all that many would consider best practices.

That sounds quite reasonable, however, applied in the wrong context, that could lead to the following:

The business never leveraged that extra flexibility because no need in the future came to pass.
You still had to maintain more code that was more complex to support these flexibilities for many years. That’s an extra cost — WASTE.
It took you originally longer to implement the functionality — WASTE.
Oh, and half of the scenarios you came up with also never happened. The feature changed significantly after customer feedback, so most cases became impossible later on. — WASTE.
Additionally, one of the flexibilities constrained the implementation of another flexibility that became required later on. So the business had to go for a trade-off and deliver less value to the customer. — ACTUAL BUSINESS VALUE DAMAGE.

As you can see, the idea was to increase quality, increase speed in the future, and deliver as much customer and business value as possible.

What happened instead?

The quality was excellent — that’s all good!

Now, the speed was initially slower, and it kept being slower in the future because none of the expected came to pass.

The best thing you can expect in software development is change (not the one you expected). Also, prepare to have your expectations and assumptions hammered all the time.

Finally, too much flexibility harmed the business value.

The same set of practices could’ve been fantastic in another context, where you, for example, precisely know what will happen in the future. An example scenario is when you’re rebuilding an existing successful system.

In that case, you would have gotten lots of speed, quality, and business value throughout the endeavor.

Software Engineering Practices Are Based on Principles

What are these practices based on? How to detect whether the context is proper or not? When to apply and when not? How to apply?

You guide the choice of practices and patterns by the software engineering principles!

When you adopt multiple principles (such as YAGNI, DRY, SOLID, etc.), they will guide you when you need to act in specific ways (e.g., apply one practice or another, or make this or that choice).

The challenge would be when multiple principles you’ve adopted conflict with each other.

That’s when you’ll have to make trade-offs.

Where do Principles Come From? How to choose?

Principles should be based on values. The values that you, your team, and your organization adopts.

For example, when I was talking about the “Holy Grail of SWE,” I’ve mentioned the “speed x quality x value” as the three most important ones. That’s a value statement.

Also, who said that these three values are the silver bullet to success? Nobody knows. Every person and organization can have its own set of values.

However, for any organization or group to achieve a big goal and execute a complex strategy, it would need to align everybody involved on the same set of values.

Once you and your group have agreed on what values you should adopt to achieve the challenging long-term goals, you must pick or create the principles that will get you there.

Picking existing principles that are tried and tested to improve software engineering in terms of the values you’ve chosen is the most straightforward approach. However, that shouldn’t deter you from thinking about a custom set of principles that would work for your group’s values in particular.

Example Values That I Like to Optimize For

I’ll give you a few values that I usually go for in most software products where the only constant is the change itself:

Shortest lead time — how long does it take from idea inception to value delivery to the hands of the user.
Lowest inventory — how much work is “done” but isn’t being used by anybody (by users, other developers, or other team members).
Highest quality — how many bugs or failures the software has. It should behave precisely as the development team expects it to.
Largest valuable impact — how valuable is the software to the user, customer, and the business.

Once these values are defined, it’s pretty simple to pick the principles to guide the decisions.

Example Software Engineering Principles

LEAN product and development
YAGNI/KISS
Evolutionary Design
Simple Design
SOLID
High-coverage automated testing (no manual QA)
No throwing responsibilities over to another department — Extreme ownership
Code stewardship (next level after ownership)

And given the principles, you can pick or design your software engineering practices.

Example Software Engineering Practices

TDD & BDD & ATDD
Continuous Refactoring
CI/CD
Trunk-Based Development (TBD) & “Live on Main”
Clean Code
Slicing work items as small as possible
Instant Code Review (aka Pair-programming)
Ensemble programming
Rolling pair rotation
Spike & PoC (for experimentation)
Design Patterns

Most of these are parts of original Extreme Programming, and this is no surprise because the values I have listed above are almost the same as values of XP.

Moreover, there are a few more principles and practices that are modern, and they could be considered as the evolution of XP.

Values, Principles & Practices Are Technology-Agnostic

Now I want to return to that situation where the developer responded with a list of technologies when asked about practices.

Here’s the thing: the practices, principles, and values are universal. No matter which technology you use, you can rely on your practices, given that your principles and values haven’t changed.

Moreover, when you master the practices and principles, you’ll be able to change technologies and tools like gloves while maintaining high parameters on all your values and delivering excellent results!

Conclusion

Of course, if you are starting in software engineering, knowing specific technologies is most essential for you.

That’s because your effectiveness is a product of your tools experience and the practices you apply. And to use and understand the practices, you need to have at least a bit of skill with the programming language and basic problem-solving.

However, beyond that, the equation:

TOOLS SKILL x PRACTICES SKILL x SOFT SKILLS

kicks in!

So the most effective way to develop as a software engineer is to have both a robust set of hard and soft skills and learn and apply the best software engineering practices (that are applicable in your context).

Thank you for reading!

If you have other ideas on the topic, feel free to send me a Tweet!

Are We Addicted to Complexity?

2021-09-08T14:49:04+02:00

As software engineers, we tend to be lifelong learners. Suppose our current job becomes less challenging, and there is nothing more to learn. In that case, we usually resort to one of the following:

Find a more exciting team or company.
Start something on the side in our free time.
Bring more advanced concepts to the current team and codebase.

It’s the third one that is the most worrying because it happens way too often, and it’s probably more harmful than you think.

This article isn’t preaching against new technologies, architectures, etc.

Instead, I’d like to point out that in about 95% of the cases, they are introduced years before they are actually necessary. Instead of making the software more straightforward, this makes the software more complex.

What should we do instead if we want to learn more new stuff then?

Well, one thing is to invert your focus. Instead of chasing something more complex, take on the challenge of making what you have much simpler.

In fact, making the existing codebase and architecture more straightforward than it already is — is a considerable challenge, in the process of which you are guaranteed to learn new things!

Suppose you feel the urge to learn something new and deal with a more complex task. Perhaps, you should talk to your business stakeholders to see if they have an idea that they don’t even want to mention to product & engineering. (Because they think it’s too challenging or impossible)

These types of ideas are where there is real gold for learning, innovation, and valuable complexity!

If you have other ideas on the topic, feel free to send me a Tweet!

Feedback Time at the End of the Pairing Session

2017-03-22T08:01:29+01:00

Hi there! Today I want to share the technique that we are using to improve the collaboration inside of our teams. This technique is most useful in synergy with the pairing. It is so powerful that it can remove the tension from our work and make us more productive in two or three days.

First, let me give you some context.

Pairing ~100% of the Time

At Pivotal we pair all the time. It is very rare to see anyone solo. Pairing does not end with pair-programming, it also includes other roles such as design and product. Also, pairing is, at times, cross-functional, e.g.: product designer pairs with a software engineer, or product designer pairs with a product manager, etc. I love that.

With pairing, not everything is so shiny.

Chemistry

Now and then, you need to pair with someone, you have never paired with before. That happens in the event of new hires, team member rotations between teams, and cross-team pairing sessions.

We are all humans, so, from time to time, it feels like you are not getting along very well with your pair. There is a certain amount of tension. Of course, that harms your productivity, and, also, drains your energy.

On the contrary side, the chemistry between pairs might be so good, that you are just having fun the whole day and enjoy the pairing session, and the amount of work being done is suboptimal.

The pairing session might go not so well not only because of chemistry. Pairs could have chosen a non-suitable style of pairing (one of: ping-pong, switch on red, driver-navigator, etc.). From time to time, how pairs solve the problem, can also be improved.

Five Minutes Feedback Time

That is a technique that we are utilizing all the time when we have such problems. First, ask your pair if they want to apply this technique and reserve 5 minutes at the end of the pairing session (end of the work day is the most suitable). Then, when the time comes, give each other the feedback in the format of “Pluses and Deltas”:

First, tell what went well.
Second, describe what could be improved.

When both pairs are proficient with this technique, then most of the problems can be resolved in 2-4 pairing sessions. By “proficient” we mean here: both are open to receiving the feedback, and are capable of calling out complicated things without triggering defensiveness on the receiving side.

If the whole team applies this technique daily for two or three weeks, they will have to stop using it because there will be nothing to talk about anymore. That means that the team has solved most of the problems, and there is no longer need in a daily application of such technique.

Conclusion

All these problems are not unique to pairing. They are inherent to the collaboration. Essentially, any team will have these problems. It is just that, in non-pairing environments, these problems will become apparent only after months of work. That is all while they continue harming productivity and people’s happiness for these long months.

With pairing these problems become apparent immediately. So you can start fixing them on the day one, and not after the half a year of the broken collaboration.

If you pair often, or if you have any other sort of collaboration within the team, I recommend trying out this technique. It will take some time to get proficient at giving feedback.

There is a marvelous talk from Dan North on how to provide an effective feedback in different contexts.

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks, Reddit, and Hacker News, and follow me on Twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

On Being More Productive in the Morning. Even if You Are an Owl

2017-03-21T07:58:17+01:00

Hi there! Today I want to share my experience of a productivity shift from the late evening/night to the morning. First, let me tell you a story on how my day was structured before for countless years.

Mornings That Start with a Lunch

A year ago and before I was pulling out late nights almost every single day. Making an open-source contribution, working on the astounding side project idea, writing a blog post, watching a conference talk or a fun video. As an owl type, I have a lot of bright ideas at that time of day. Also, my productivity is at its top. Or at least it feels that way.

That usually meant, that I would go to work next day somewhere between 10 AM and 12 PM. Mostly, that was okay because a lot of other developers around me did the same thing.

So, imagine having your work day (for your employer, client or yourself) start this way:

Come to the work
Drink coffee and eat breakfast, because, obviously you didn’t have time to do that at home
Chit chat with your colleagues (offline or online), go through the email inbox
Do some work for half an hour
It is a lunch time
Food coma for 1 hour
Be not-so-productive (because you’re tired and an owl type) for what you have left in a day (2-4 hours)
Maybe pull out overtime because there are things to deliver and you are the most productive in the evening/night

You can see, that my work day wasn’t as productive as it should. One may say that I’m just shifting focus to my activities and invest less productivity into the work for my employer/client/etc. And I have another story to tell here.

A Lot of Rework

I have been noticing for some time that all the work that I do in the late evenings and the night for myself needs to be re-worked all the time. To put it simply, quality of my most “productive” work was not as good as I would expect it to be.

Additionally, since I was working in a pair-programming environment and everybody started their work day at a different time, it was hard to pair-program all the time - there was a lot of time I had to solo. When I have joined Pivotal I was amazed by the fact that they pair-program nearly 100% of the time. I was enjoying that right from the first day. Pairing 100% of the time also means that the whole team has to start the work day and end it at the same time. We are starting early in the morning, and ending our work day at 5 PM every day.

Also, at Pivotal we finish all of our daily meetings in 10 minutes right at the beginning of the day. We’re off to work right after. That builds a crazy momentum of making things done and makes us productive.

Productive Mornings with Momentum

That kind of momentum results in a lot of done work. The amount of work being done before lunch ultimately exceeds the amount of work I remember doing after lunch and, especially, in the evening.

Even though I feel more productive in the late evening, and in the night, practical results show that most productive work is done in the morning if I maintain that kind of momentum.

Maybe it is only about work, what about personal things that I need to do? - One might ask. I have another story for you. A couple of months ago I started developing a “mini habit”: wake up every day at 6 AM. That gives me about 50 minutes of free time before I need to go to the office. Add to that 35-45 minutes commute on the subway. Now and then, when I am talking to my pair during our break (we are using the pomodoro productivity technique), I realize that at 9:05 AM I have done all the personal things that I have to do today. These things include morning exercise, household chores, contributing to open-source, writing a blog post, reading a book, sending a letter, etc. By the way, most of these things are also mini habits.

Another thing that I am experiencing right now is that the quality of the work is much higher than before - I have to do much less reworking. I still get the most remarkable ideas in the evening and before going to sleep, so what do I do about them? Note them down, and go through them next morning or later.

Thanks

I am amazed so far, how waking up in the morning every day, earlier than you normally have to, allows to build up a significant momentum, complete all the personal tasks that need to be done and transfer that increased momentum to my work. It works very well for me, and I recommend trying it, even if you are an owl type.

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on Twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Drake's 24 Hours Challenge or How to Be Less Negative

2017-03-20T07:53:37+01:00

Did you ever look at the chunk of code that is not very pretty, and immediately uttered some bad word or bad remark, such as “Who wrote this?” And not even noticed it?

Or did you ever had an incident happening in production, and half of your vocabulary at that time was not normative? Also, did you feel stressed and down afterward?

Or did you ever had to converse with a difficult person or someone holding a strong opinion, and thought not very good about them? Or even dismissed a valid argument just because you didn’t like the conversation?

Did you ever felt or did something negative, and thought afterward that it would be better not to? Then the technique described in this article is perfect for you!

Shall we get the ball rolling?

Drake’s Challenge

About one year ago I was talking to Daniel Irvine at the Software Craftsmanship meetup in Berlin about the technique that he was applying daily. I was amazed by how challenging it is and what results it can potentially yield. Since then I’m using this technique to increase awareness of how I feel, what I think, and what I am about to say.

It gives me the ability to respond to events around me instead of reacting to them. The reaction is the immediate reply executed by the subconscious part of our brain. The response is the delayed reply produced by the concious part of our brain. Responding is much better because it gives us time to think what would be the best response to the current situation. Especially it is better when something negative or weird is happening around us.

Enough preaching, let me tell you the challenge itself. For twenty-four hours do not say anything negative and do not think negatively.

How I Apply It

Get the stopwatch application running on my smartphone.
Start the stopwatch and leave the app in the background.
Every time I notice a negative thought, or I am saying something negative, reset the stopwatch.

My Experience So Far

Over this year, I felt like I can detect the negative feeling before it manifests as a negative thought, judgement, or a phrase. That helps me to be calmer, be more patient, and not spend time on insignificant things. If I know that something makes me unhappy or confuses me, I can call it out, and fix the problem without causing people around me to feel bad about it. Additionally, it makes me feel better.

Another thing that I have learned that, as soon as I am tired, I lose these abilities and fall back to reacting instead of responding. Good night sleep helps a lot with that ;)

I highly recommend you try this challenge. Caution: it is very hard.

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on Twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Build Your Own Testing Framework. Part 6: Test Suite Does Not Run All Tests!

2017-03-13T23:13:28+01:00

Welcome back to the new issue of “Build Your Own Testing Framework” series! When trying to implement better formatting, we have discovered that some of our test suites do not run all tests! Today we are going to fix that, and we will make sure that such test suite will fail if it didn’t execute all tests.

This article is the sixth one of the series “Build Your Own Testing Framework” so make sure to stick around for next parts! Find all posts of these series here.

Shall we get started?

Verify All Tests Run

We will start from the RunTestSuiteTest and run the test suite with the single test. Then we are going to assert that for that the test with the name testOk has been reported as passing:

// test/RunTestSuiteTest.js

this.testItOutputsOkForThePassingTest = function () {
    runTestSuite(function (t) {
        this.testOk = function () {
            t.assertTrue(true);
        };
    }, {reporter: reporter});

    reporter.assertHasReportedPassingTest("testOk");
};

If we run this test suite, we can see that only one test executes!

RunTestSuiteTest
    testItCallsAllTestMethods

Process finished with exit code 0

Oh, that is interesting. This test suite does not run. Upon investigating, it turns out, that process.exit(0) is being called during the runTestSuite(...) function run. That is because of the latest feature that we have implemented - “exit with an appropriate exit code (zero for success, and one for failure).” We should be able to fix that by providing the process spy in the options of the runTestSuite function that we are calling from the inside of the individual tests in the RunTestSuiteTest test suite. And we ought to alleviate this kind of mistake somehow - we need a mechanism that would alert us if not all tests have been run. Maybe something like verifyAllTestsRun: true option for the runTestSuite. For that let’s write a test:

this.testVerifyAllTestsRun = function () {
    t.assertThrow("Expected all tests to run", function () {

        runTestSuite(function SuiteWithTwoTests(t) {

            this.testWithRunTestSuite = function () {
                runTestSuite(function (t) {
                    this.testOne = function () {};
                }, {reporter: reporter});
            };

            this.testThatShouldAlsoRun = function () {};

        }, {reporter: reporter,
            process: process,
            verifyAllTestsRun: true});

    });
};

That might be a bit complex at first. Let’s take a closer look how this test is supposed to work:

First of all, we do assert that there was an assertion failure about all tests required to run.
Inside of the action for this assertion we create and run the new test suite with two tests:
- test with the runTestSuite without process spy provided
- empty test that should also execute

If we run this test, it will pass. That is unexpected because we wanted it to fail. Apparently, most inner runTestSuite is doing process.exit(0).

For that to work, we will need to be able to provide a hook into process.exit(code) function. For that, we would need to create a SimpleProcess class, that allows installation of such hooks. Let’s test-drive it!

`process.exit` with hooks

First, we should start from the normal behavior without any hooks:

// test/SimpleProcessTest.js
var runTestSuite = require("../src/TestingFramework");
var SimpleProcess = require("../src/TestingFramework").SimpleProcess;
var ProcessSpy = require("./ProcessSpy");

runTestSuite(function SimpleProcessTest(t) {
    this.testWithoutHooks = function () {
        var globalProcess = new ProcessSpy();
        var process = new SimpleProcess(globalProcess);

        process.exit(0);

        t.assertEqual(0, globalProcess.hasExitedWithCode);
    };
});

When running this test, we will get a failure about SimpleProcess being undefined. So let’s define it:

// src/TestingFramework.js

// ...
// define the class itself
function SimpleProcess(globalProcess) {

}

// ..
// and don't forget to export it
module.exports.SimpleProcess = SimpleProcess;

If we run our test suite now, we will get an error TypeError: process.exit is not a function. To fix that failure we will have to define the exit(code) method on our newly created class SimpleProcess:

function SimpleProcess(globalProcess) {
    this.exit = function (code) {

    };
}

After doing that we will get an assertion failure Error: Expected to equal 0, but got: null, as expected. To make the test pass, it would be enough to call globalProcess.exit(0):

this.exit = function (code) {
    globalProcess.exit(0);
};

If we run our test suite now, we will get no failures. That is great! Now, we can see that globalProcess.exit(0) is probably not exactly what we want to have there. We ought to pass the code parameter to the exit function. To test-drive this properly, we will have to triangulate, i.e.: add another test with the different value of the code parameter:

this.testWithoutHooks_andDifferentExitCode = function () {
    var globalProcess = new ProcessSpy();
    var process = new SimpleProcess(globalProcess);

    process.exit(1);

    t.assertEqual(1, globalProcess.hasExitedWithCode);
};

That fails as expected: Error: Expected to equal 1, but got: 0. To make it pass we can either write some weird “if” statement or we could pass the code parameter to the globalProcess.exit function. The second option is simpler. According to the third rule of test-driven development, we should go for it:

this.exit = function (code) {
    globalProcess.exit(code);
};

That change makes our test suite pass. We probably should refactor the test suite to reduce the level of the duplication by extracting common variables from the tests:

runTestSuite(function SimpleProcessTest(t) {
    var globalProcess = new ProcessSpy();
    var process = new SimpleProcess(globalProcess);

    this.testWithoutHooks = function () {
        process.exit(0);

        t.assertEqual(0, globalProcess.hasExitedWithCode);
    };

    this.testWithoutHooks_andDifferentExitCode = function () {
        process.exit(1);

        t.assertEqual(1, globalProcess.hasExitedWithCode);
    };
});

At that point, we should move on to tests for the hook installation functionality. Because right now we need only at most one hook we will not support multiple hooks at the same time - only one:

this.testCanInstallOneHook = function () {
    var aSpy = t.spy();

    process.installHook(aSpy);
    process.exit(0);

    aSpy.assertCalled();
};

When we run this test, it fails because installHook function is not defined: TypeError: process.installHook is not a function. So we should define it:

function SimpleProcess(globalProcess) {
    // ..
    this.installHook = function (aHook) {

    };
}

Upon running these tests, we get Error: Expected to be called because we didn’t call this hook yet. The simplest way to make it pass is to just call the hook from the installHook function:

function SimpleProcess(globalProcess) {
    // ...
    this.installHook = function (aHook) {
        aHook();
    };
}

While that will make the tests pass it is not the behavior that we are after. To drive out the correct behavior, we ought to check that the function is being called only after process.exit(..), not earlier. For that we will need to have a sanity-check assertion:

this.testCanInstallOneHook = function () {
    var aSpy = t.spy();

    process.installHook(aSpy);
    aSpy.assertNotCalled();

    process.exit(0);

    aSpy.assertCalled();
};

That fails as expected with the error Error: Expected not to be called. To make it pass we need to store the function in the variable and call it from the process.exit(..):

function SimpleProcess(globalProcess) {
    var hook = null;

    this.exit = function (code) {
        if (hook !== null)
            hook();

        globalProcess.exit(code);
    };

    this.installHook = function (aHook) {
        hook = aHook;
    };
}

All the tests pass now! Finally, we want to be able to uninstall the hook, so let’s write the test for it:

this.testCanUninstallTheHook = function () {
    var aSpy = t.spy();

    process.installHook(aSpy);
    process.uninstallHook();

    process.exit(0);

    aSpy.assertNotCalled();
};

To make it work it is enough to introduce this function and set hook variable back to null in it:

function SimpleProcess(globalProcess) {
    // ...
    this.uninstallHook = function () {
        hook = null;
    };
}

And all the tests will pass. Now we, also, want to replace the default value for the options.process option with the instance of SimpleProcess object. And all the tests should work as they were working before:

var simpleProcess = new SimpleProcess(global.process);

function TestSuiteRunContext(testSuiteConstructor, options) {
    // ...

    var process = options.process || simpleProcess;
    // instead of just "global.process"

    // ...
}

Installing the “verify all tests run” hook

Now, we can get back to our “verify all tests run” test. It still doesn’t fail as expected, so we need to install the hook, count all tests, count tests that had already run and compare them in the hook:

function TestSuiteRunContext(testSuiteConstructor, options) {
    // ...
    var verifyAllTestsRun = options.verifyAllTestsRun || false;
    var testCount = 0;
    var testRun = 0;

    this.invoke = function () {
        if (verifyAllTestsRun)
            installVerifyAllTestsRunHook();  // <---

        reportTestSuite();
        countAllTests();                     // <---
        runAllTests();
        finishTestRun();
    };

    function installVerifyAllTestsRunHook() {
        simpleProcess.installHook(function () {
            if (testRun < testCount) {
                throw new Error("Expected all tests to run");
            }
        });
    }

    // ...

    function countAllTests() {
        for (var testName in createTestSuite())
            if (testName.match(/^test/))
                testCount++;
    }

    // ...

    function handleTest(testName) {
        testRun++;                            // <---
        reportTest(testName);
        runTest(createTestSuite(), testName);
    }
}

At this point, this throws an error Expected all tests to run and finishes the test fully without reaching our assertThrow(..) assertion. That happens because we catch this error in the function runTest, where we mark the test as failed, log the error and ignore the error object itself from there. One way to solve this problem is to have a particular error, that can propagate up the stack:

function installVerifyAllTestsRunHook() {
    simpleProcess.installHook(function () {
        if (testRun < testCount) {
            var error = new Error("Expected all tests to run");
            error.bubbleUp = true;
            throw error;
        }
    });
}

// ...

function runTest(testSuite, testName) {
    try {
        testSuite[testName]();
    } catch (error) {
        if (error.bubbleUp) throw error;  // <---
        if (!silenceFailures) console.log(error);
        status.markAsFailed();
    }
}

Now our current test is passing, and the next test is failing with the error Expected all tests to run. That happens because we have not uninstalled the hook as soon as it has triggered. Let’s do that:

function installVerifyAllTestsRunHook() {
    simpleProcess.installHook(function () {
        if (testRun < testCount) {
            simpleProcess.uninstallHook();   // <---

            var error = new Error("Expected all tests to run");
            error.bubbleUp = true;
            throw error;
        }
    });
}

That makes the next test run, succeed and exit immediately after that with error code zero. Let’s see what will happen if we put verifyAllTestsRun: true on the top test suite here:

runTestSuite(function RunTestSuiteTest(t) {
    // ...
}, {verifyAllTestsRun: true});

That doesn’t work because we re-install different hook inside of this test and as soon as this test finishes, we uninstall it. So we have two ways out of this situation: allow multiple hooks, or move that single test to its own test suite file. I think the second options is much simpler. Also, we will add the test for the negative case, where all tests run correctly (when we provide proper process spy):

// test/VerifyAllTestsRunTest.js
var runTestSuite = require("../src/TestingFramework");
var ReporterSpy = require("./ReporterSpy");
var ProcessSpy = require("./ProcessSpy");

runTestSuite(function VerifyAllTestsRunTest(t) {
    var reporter = new ReporterSpy(t);
    var process = new ProcessSpy(t);

    this.testVerifyAllTestsRun = function () {
        t.assertThrow("Expected all tests to run", function () {

            runTestSuite(function SuiteWithTwoTests(t) {

                this.testWithRunTestSuite = function () {
                    runTestSuite(function (t) {
                        this.testOne = function () {};
                    }, {reporter: reporter});
                };

                this.testThatShouldAlsoRun = function () {};

            }, {reporter: reporter,
                process: process,
                verifyAllTestsRun: true});

        });
    };

    this.testVerifyAllTestsRun_withoutFailure = function () {
        t.assertNotThrow(function () {

            runTestSuite(function SuiteWithTwoTests(t) {

                this.testWithRunTestSuite = function () {
                    runTestSuite(function (t) {
                        this.testOne = function () {};
                    }, {reporter: reporter,
                        process: process});
                };

                this.testThatShouldAlsoRun = function () {};

            }, {reporter: reporter,
                process: process,
                verifyAllTestsRun: true});

        });
    };
});

And this new test suite passes as expected. Just to double-check that these tests verify anything, we can break them (change expected error message and change assertNotThrow to assertThrow) and see if there is a failure and if it looks as expected:

// was: t.assertThrow("some error", ...);
t.assertThrow("some error", function () {
    // ...
});
// => Error: Expected to equal some error,
//    but got: Expected all tests to run

// was: t.assertNotThrow(...);
t.assertThrow("some error", function () {
    // ...
});
// => Error: Expected to throw an error,
//    but nothing was thrown

And it fails as expected, which means that our refactored tests still work as they should.

We have just applied a neat technique here: whenever we do a major refactoring in tests, we need to make sure they are still functioning correctly. For that, we break every single one of them (by changing the assertion or breaking the production code). Then we see if they fail as we expect them to. When they don’t, we know that refactoring didn’t quite work.

Fixing test suites to run all tests

Now we can go back to the RunTestSuiteTest and see if it works as expected without that test. And it does: Error: Expected all tests to run. To fix that we need to provide a process spy in every inner call to runTestSuite. For that we will first extract {reporter: reporter} as a common variable of the test suite:

var options = {reporter: reporter};

// ...
// all the inner calls to "runTestSuite":
runTestSuite(function(t) {
    // ...
}, options);

And to make the error go away, we now can create a process spy and provide it through options:

var process = new ProcessSpy(t);
var options = {
    reporter: reporter,
    process: process
};

If we run tests now, they all pass. And we can see that they all execute. Now we just need to double-check that all tests, that have inner calls to runTestSuite have verifyAllTestsRun option enabled. The only other test suite is the FailureTest. Adding the option does not produce a failure because this test suite already uses process spy in all inner calls to runTestSuite.

Conclusion

Today we learned that it is tricky to work with process.exit or any function that can exit our program in the middle of the test. Such functions need to be mocked out completely inside of the tests. Also, we learned that it is possible to make sure we don’t forget to do that. That is quite important because, if we do forget, everything runs smoothly, and we don’t know that we made a mistake.

There is still a lot to go through. In a few next episodes we will:

Report OK and FAIL for each test;
Output carefully formatted failures to the STDERR;
Enable our testing framework to run multiple test suite files at once;
Enable our testing framework to run in a browser (it is javascript after all).

See you reading the next exciting article of the series: “Formatting the Output”!

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on Twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Learning Test-Driven Development With Javascript: Laws of TDD

2017-02-03T22:19:40+01:00

Level: Beginner

Today we are going to learn the basic principles behind the Test-Driven Development Discipline. We will learn three rules of TDD. We will learn what are the benefits of doing Test-Driven Development. And we will take a look at the example application of these laws.

" feedback " (CC BY-SA 2.0) by Sonti Malonti

Articles of these series have exercises and going through them would make for more effective learning. Also, if you feel stuck with these exercises, fell free to shoot me an email or get in touch on Twitter: @tdd_fellow.

“Learning Test-Driven Development with Javascript” is a series of articles and you, my dear reader, can shape the content by providing an invaluable feedback. To do that, shoot me an email - oleksii@tddfellow.com.

Basics of Test-Driven Development

A test is a single program, procedure or function, that executes our system under the test and verifies that it works as expected. The system under the test is any other program, part of the program or library, procedure or function. The system under the test is called SUT for short.

SUT is the code that executes with a purpose of satisfying needs of our end users. Using the term “end user” we mean actual people using our system via the user interface (graphical and non-graphical), and other automated systems using our system via application programming interface (for short, API). A different term for the “end user” is “consumer of the system.” Another term for the SUT is “production code” - we are going to use the latter, as it is used in Test-Driven Development much more often.

Test-Driven Development is based on a simple concept of writing production code only when there is a failing test that demands that production code for it to pass.

We call a test failing when there is some error happening when we execute it. Possible failures are the following: there is a syntax error, or code does not compile, or there is a runtime failure during test setup or production code execution, or there is an assertion error. The assertion failure is the particular kind of error that happens during the final phase of the test, where we are verifying the outcomes after running the production code under the test. An assertion error signals that the production code executed successfully, but it produced incorrect results or changed the state of the system in a wrong fashion.

We call a test passing when there are no errors happen when we execute it. We, also, call the failing test “a red test” and we call the passing test “a green test.” That is because we can not deliver the code to our customers or consumers when there are “red” tests - think of red traffic light, and we are free to go when all the tests are “green” - think of green traffic light. Most of the testing tools and frameworks format their output to present failing tests in the red color and passing tests in the green color.

Multiple tests aiming to test single SUT or a particular feature of the system are usually called test suite. Depending on the context, test suite could mean that collection of tests all testing the same thing, or it could mean all tests of the entire system. For example, in the sentence “Let’s read User class’ test suite” that phrase means a collection of tests testing class User. On the other hand, in the sentence “Let’s run the whole test suite and see if we can deploy that right now” that phrase means all tests of the entire system. The latter, sometimes, is called “suite of tests.”

When the system behaves in an unexpected way, and the expected behavior was previously defined or present in the code, it is called a “bug.” For example, the behavior that is specified by the development team and not implemented correctly considered a bug. The behavior that is defined by the development team and implemented correctly, but now it is not working, considered a bug. And, finally, the behavior that was not defined may or may not be considered as a bug. The latter depends on the produced results - if it harms or brings any value. This phenomenon is called a “bug” for historical reasons: the first bug in computing was a real bug, that stuck in the computer’s hardware and was causing short circuits which made the computer misbehave.

Three Rules of Test-Driven Development

In the core of Test-Driven Development there are three steps that we need to follow:

We start with a failing test. We consider any error a failure, including compilation and syntax errors. Meaning, that our first test for any part of the production code will fail for the reason that there is no production code to execute yet. For example class, method or function is undefined.
Then, we resolve the failure by writing the production code. For example: in the case of missing class, we would create a class; in the event of a missing method, we would create it; and in the instance of a failing assertion error, we would fix the logic to pass the test.
We write no more than the simplest production code, which makes our current failing test pass, and, also, still passes all other tests.

We repeat these steps over and over until we finish the implementation of the system under the test. Strictly following these steps will lock us in a very tight loop, where we will be switching between test code and production code all the time: write one or two lines of the test code and write or change one or two lines of the production code, repeat. This cycle is, probably, twenty or thirty seconds long. It, of course, depends on how fast we can run our tests. Ideally, we want our test suite for the current system under the test to run as fast as one clap of hands, or blink of an eye.

Feedback Loop Benefits

This tight cycle gives us following benefits:

Alerts us to the mistakes we do while coding immediately. Usually, it is enough to do one or two “Undo” commands in our editor to get back to the “green” state of the system - when all our tests are passing (occasionally, except for the last test we have added, as we will undo the incorrect implementation for it).
Provides the safety net from bugs for the future development thanks to the fact, that no single line of production code is written other than to pass a failing test. Meaning, that whenever production code is broken in any fashion, there is a test that will point it out via failure. When adding new features or modifying the behavior of the existing ones, we might inadvertently introduce a bug. Our suite of tests will detect that bug as soon as we run all our tests, which we would certainly do during development and, especially, before the deployment or release of our systems. This is, basically, a near 100% test coverage. Surely, there are subtle and complex bugs that involve whole sub-systems and 3rd-party systems to come together in some special edge case to appear - rarely, such bugs are not prevented by Test-Driven Development. Of course, we are talking about 1-3 such bugs per year - this is much better than having a bug tracker, that contains thousands of unresolved bugs. Who said it was a good idea to have a database that stores the bugs in our system? - When we apply TDD correctly, and our bug counts are so small, as 1-3 per year, we should treat bugs as “Red Alerts” and concentrate all the effort on shooting them down. Writing a test for these bugs when we already figured them out allows us to programmatically prove, that the system, indeed, does misbehave in a certain edge case, and prevents this bug from reoccurring after we have fixed it.
Gives the confidence to refactor freely and ruthlessly. Essentially, allows changing the software design of the system on the go without applying the big design upfront (BDUF). Let’s take a look at the example. Imagine, we have a system with a perfect design and without any tests. Over time the design will degrade until the code becomes nearly impossible to work with. Now, imagine, we have a system with bad design and close to 100% test coverage. Over time we can improve the design of the application gradually while responding to the changes in the needs of our end users. And we will be faster as the time passes.
Makes the development team always ready to deploy. While practicing test-driven development, development teams will keep latest implemented features in the version control system for the source code in the state, that is well-known to be “green.” This condition of the system is deliverable to the end users with one push of the button. That, also, enables teams to do continuous delivery: as soon as the code is checked-in in the version control system (VCS), it is going to be automatically deployed to production and our end users will get the value out of it sooner.
Promotes simplicity and alleviates the scope creep. “Oh, we don’t need that yet” is the most used phrase in the vocabulary of the typical test-driven development practitioner. Tests that we write specify only the useful functionality that we need to deliver the value to our end users. These tests don’t talk about performance optimizations, security concerns, rare edge cases, “re-usable” abstractions (that, usually, are harder to understand and are difficult to re-use), etc. These things need to be implemented only when there is an actual end user value to be achieved, or the system is obviously running slower than it should, or there is an actual security concern. In that case, a very specific test (or tests) will be written to drive out that additional functionality. Since the most software development cost is in the maintenance of the existing functionality leaving out as much functionality (and code) as possible is a good idea.
Promotes cleaner interfaces. As in test-driven development, we have to write the usage example before we implement (or even design) the API in the production code, it will be optimized for ease and clarity of use and less so with the internal implementation details of that production code. Generally speaking, test-driven production code is easier to use and understand.
Tests that call the production code serve as usage examples of that code. Essentially, in a test-driven code base, to know how a particular API can be used, it is enough to read and understand its test suite. These tests are the perfect low-level documentation for the APIs in our code base that developers can understand and read best. And that documentation is so precise that it can be executed and verified that it is still in sync with the actual code it is describing. Therefore this documentation is always up-to-date.

The most important of these benefits is the confidence to make any change to the software and know in one minute or two, if that change is good to be delivered to the end user or not, with a simple push of the button. No quality assurance (QA) manual testing cycles are required. Let’s take a look at the example of the application of three rules of test-driven development. We will start from a very simple example, so that we don’t have to touch more advanced TDD techniques.

English Numbers Kata

Given an integer number
When I call the system with that number
Then I receive a string representation of that number in an English language

For example:

for number 37 I receive "thirty-seven"

for number -17451 I receive "minus seventeen thousand four hundred fifty-one"

According to the first rule of TDD, we have to start the implementation from the test. In test-driven development it is important to start from the simplest tests, that can be implemented via a small, simple change to production code. For example: with number zero we expect the result to be “zero.” When writing the first test for the new functionality we are going to design its API. In our case we are going to come up with the function name and its argument list:

describe("toEnglishNumber", function() {
    it("converts 0 to zero", function() {
        // ARRANGE
        var number = 0;

        // ACT
        var englishNumber = toEnglishNumber(number);
    });
});

As soon as we write toEnglishNumber(number) we have designed the function’s signature, at least, for the single simplest case. Also, the test suite is failing now, because toEnglishNumber is not a function - in fact, it is undefined. That means that we have entered the red stage of test driven development and according to the second rule of TDD we have to switch back to the production code. And according to the third rule, we have to write just enough of it to make the failing test pass. That means writing the simplest and easiest code possible to make it succeed. In this case, we could just return nothing (null in javascript):

function toEnglishNumber(number) {
    return null;
}

This is going to turn our test suite back to the green stage. At this point it is a good idea to look at both test code and production code and see if there are any opportunities for refactoring, such as better names, extracting methods/functions, clarifying variable names, de-duplication, etc. Because we are currently in a green state, we can safely apply any refactoring, automated or manual, and see if it was successful by running the test again. If for some reason, the test is failing after the refactoring, we always have an option to CTRL/CMD+Z back to the green state, back to safety. At this point, we have finished applying one cycle of rules of TDD. Now we need to start over. We go back to our test code. There we extend the existing test to be more specific. Or we add more tests. Since we didn’t finish writing the test yet - we have only “arrange” and “act” parts, and we are missing the “assert” part of the test, - we ought to extend the current test to make it more specific. So what can we assert about the result of the toEnglishNumber call? We probably ought to return the string “zero”, aren’t we? So, let’s make an appropriate assertion:

it("converts 0 to zero", function() {
    // ARRANGE
    var number = 0;

    // ACT
    var englishNumber = toEnglishNumber(number);

    // ASSERT
    var expected = "zero";
    expect(englishNumber).toEqual(expected);
});

If we run our tests right now, they will fail. We should take a careful look at the failure and see if the failure message is readable and it is what we expected. In the current situation, we will receive an assertion failure, telling us that null was not equal to “zero.” Imagine now, that we are working on some other feature and we are not in the context of the English number conversion feature. What would be our reaction, when we run this test and see this failure? - We will probably be confused for a moment and will resort to jump to the line in the source code of the test suite that produces that error and try to understand what happened there. That is a huge context switch, and it will disrupt our current flow. So we can do better and make the failure much more readable by just adding a small thing - a cue, what that null was supposed to be: “english number”:

expect(englishNumber).toEqual(expected, "english number");

In Jasmine, the second argument to the toEqual matching function is the description of the failure. I think our test looks great, and so is the failure in the test output. And because we are having a good failing test, according to the second rule, we don’t have to write any of test code. Now, we should make the test pass according to the third rule. And what would be the simplest way to make it work? - Return “zero”:

function toEnglishNumber(number) {
    return "zero";
}

At this point, we should look out for the refactoring opportunities, and I don’t think there are any yet. So let’s start the cycle over. To apply the first rule, we might just copy the existing test and change the input and the expected output accordingly. In that case, we would want to test another simple one-digit number - “one”:

it("converts 1 to one", function() {
    // ARRANGE
    var number = 1;

    // ACT
    var englishNumber = toEnglishNumber(number);

    // ASSERT
    var expected = "one";
    expect(englishNumber).toEqual(expected, "english number");
});

Now, because the test is failing, we apply the second rule and switch back to the production code. To make the test pass, according to the third rule, we will need to introduce the simplest change possible: in that case, an “if” statement (and we will use === instead of == for comparison to avoid implicit type conversions in javascript):

function toEnglishNumber(number) {
    if (number === 1) {
        return "one";
    }

    return "zero";
}

That is going to make all our tests pass. If we continue adding tests for the single-digit numbers while following these three rules, we will wind up with something like that:

function toEnglishNumber(number) {
    if (number === 1) {
        return "one";
    }

    if (number === 2) {
        return "two";
    }

    if (number === 3) {
        return "three";
    }

    if (number === 4) {
        return "four";
    }

    if (number === 5) {
        return "five";
    }

    if (number === 6) {
        return "six";
    }

    if (number === 7) {
        return "seven";
    }

    if (number === 8) {
        return "eight";
    }

    if (number === 9) {
        return "nine";
    }

    return "zero";
}

At this point, we can see a clear pattern: one-to-one correspondence of a single-digit integer to the string. This code can be simplified as a pre-defined array of strings at the corresponding indices:

var singleDigitNumbers = [
    "zero",
    "one",
    "two",
    "three",
    "four",
    "five",
    "six",
    "seven",
    "eight",
    "nine"
];

We also include string “zero” in that array because return "zero" is only happening when the number is equal to zero since we don’t have any other tests right now - only from zero to nine. And the function itself will look much simpler:

function toEnglishNumber(number) {
    return singleDigitNumbers[number];
}

Since we are done with the refactoring, we should proceed using the first rule of test-driven development, which means we need to write a test that fails. So let’s increase the complexity of our tests and go for teen numbers. We’ll start from ten:

it("converts 10 to ten", function() {
    // ARRANGE
    var number = 10;

    // ACT
    var englishNumber = toEnglishNumber(number);

    // ASSERT
    var expected = "ten";
    expect(englishNumber).toEqual(expected, "english number");
});

That will fail because so far our production code tries to fetch a string representation of the english number from the array using the number itself as an index. At the index ten, we don’t have anything, so our function returns nothing. According to the second rule of test-driven development, we need to switch to production code to make it pass. There are a few ways to fix the current problem: add specific if statement to the production code, or add a string “ten” to the array. Second seems to be simpler, and we know that we can do it for the teen numbers because they can not be composed of any other small parts. So, according to the third rule, we should go for it because it is a much simpler solution:

var singleDigitNumbers = [
    // ...
    "ten"
];

That makes our tests pass. And I think we have a refactoring opportunity: variable name singleDigitNumbers does not make any sense anymore - it contains not only single digit numbers, but, also, number “ten,” which is a two-digit number. So what is common between single digit numbers and ten? They are simple numbers, i.e.: can not be composed out of other english number string representations. Let’s just call them simpleNumbers in this case:

var simpleNumbers = [
    // ...
];

function toEnglishNumber(number) {
    return simpleNumbers[number];
}

After making this refactoring, we need not forget to run the test suite to see if we didn’t make a mistake. When we run it, then all our tests pass. So we can go back to the first rule of test-driven development again. Going through this cycle again for a few times will produce tests for numbers from eleven to nineteen. The production code will only have those numbers’ english string representations added to the simpleNumbers array:

var simpleNumbers = [
    "zero", "one", "two", "three", "four",
    "five", "six", "seven", "eight", "nine",

    "ten", "eleven", "twelve", "thirteen", "fourteen",
    "fifteen", "sixteen", "seventeen", "eighteen", "nineteen"
];

function toEnglishNumber(number) {
    return simpleNumbers[number];
}

Now it is time to introduce the concept of a complex number, such as twenty-three. It is a number that consists of the “tens” part and single-digit part. According to the first rule we have to start with the failing test, and I think we should just go for the number twenty-three:

it("converts 23 to twenty-three", function() {
    // ARRANGE
    var number = 23;

    // ACT
    var englishNumber = toEnglishNumber(number);

    // ASSERT
    var expected = "twenty-three";
    expect(englishNumber).toEqual(expected, "english number");
});

If we run our test suite, that test will fail. According to the second rule of test-driven development, now we have to switch to the production code. According to the third rule, we will have to choose the simplest code that makes this test pass (and doesn’t break any other test). In our case, we have multiple options, which are similar in their simplicity. One of them is to return “twenty-three” if the number is greater than or equal to twenty:

function toEnglishNumber(number) {
    if (number >= 20) {
        return "twenty-three";
    }

    return simpleNumbers[number];
}

If we run our test suite, all tests will pass. Now we should see if there are any opportunities for making the code more readable and easier to understand. While the whole if statement returning a constant might feel strange, there is a concept that we can already give a name to in there: “three.” We already can obtain a string “three” from number three using the method toEnglishNumber(number). Let’s try this refactoring:

function toEnglishNumber(number) {
    if (number >= 20) {
        return "twenty-" + toEnglishNumber(3);
    }

    return simpleNumbers[number];
}

That code now looks interesting. And it passes all its tests. Since, of course, we are not done yet with the implementation, according to the first rule of test-driven development, we ought to write another failing test. And we have a multitude of choices what it could be, we can just come up with other random two-digit number, such as forty-two, or we could leave the “twenty-” part in and change the “three” part to “seven,” for example. Also, we could change “twenty-” part to “thirty-.” Generally, in test-driven development it is better to go for the test, that will cause the smallest change to the production code, - later we will explore more on why that is. So, we could go for twenty-seven, as it will cause the smallest change to our production code:

it("converts 27 to twenty-seven", function() {
    // ARRANGE
    var number = 27;

    // ACT
    var englishNumber = toEnglishNumber(number);

    // ASSERT
    var expected = "twenty-seven";
    expect(englishNumber).toEqual(expected, "english number");
});

This test is failing, as expected. According to the second rule, we have to switch to the production code and make it pass. Also, the simplest change (third rule) that we could do is to change “3” to the last digit of the number, the remainder of the division by ten - “number % 10”:

function toEnglishNumber(number) {
    if (number >= 20) {
        return "twenty-" + toEnglishNumber(number % 10);
    }

    return simpleNumbers[number];
}

Now if we run our test suite all the tests will pass. “The remainder of division by ten” part looks right and “twenty-” constant still feels like it is not gonna work for every two-digit number. I think it is time to write a new failing test (first rule). That will be the test that will prove that “twenty-” is not correct code. In that case, we just need to change the first digit of the number so that we could go for forty-two:

it("converts 42 to forty-two", function() {
    // ARRANGE
    var number = 42;

    // ACT
    var englishNumber = toEnglishNumber(number);

    // ASSERT
    var expected = "forty-two";
    expect(englishNumber).toEqual(expected, "english number");
});

As soon as we finish writing the assertion we will have a test failure: we are returning “twenty-two” instead of “forty-two.” So it is time to switch to the production code (second rule). And we need to write just enough of it to make this test pass (third rule). We can do that by having yet another if statement:

function toEnglishNumber(number) {
    if (number >= 20) {
        if (number / 10 == 2) {
            return "twenty-" + toEnglishNumber(number % 10);
        }

        return "forty-" + toEnglishNumber(number % 10);
    }

    return simpleNumbers[number];
}

And this will make the test pass. It looks very similar to what we had with one-digit numbers, where we had an “if” statement checking that some value is equal to some number and returning an appropriate string. That is where we converted it to an array with string values, and in the function, we were fetching these strings by their index. To see if that pattern applies here we could write another similar test that will make us write another if statement:

it("converts 39 to thirty-nine", function() {
    // ARRANGE
    var number = 39;

    // ACT
    var englishNumber = toEnglishNumber(number);

    // ASSERT
    var expected = "thirty-nine";
    expect(englishNumber).toEqual(expected, "english number");
});

And this fails as expected because our production code in no case can return “thirty-.” So let’s write the simplest if statement to make it pass. Also, to make it uniform, we would wrap “forty-” case in its appropriate if statement as a refactoring after we have a passing test suite:

function toEnglishNumber(number) {
    if (number >= 20) {
        if (number / 10 == 2) {
            return "twenty-" + toEnglishNumber(number % 10);
        }

        if (number / 10 == 3) {
            return "thirty-" + toEnglishNumber(number % 10);
        }

        if (number / 10 == 4) {
            return "forty-" + toEnglishNumber(number % 10);
        }
    }

    return simpleNumbers[number];
}

Certainly, there is a fair bit of duplication right now: “number / 10” and “number % 10”. Let’s extract them as variables. Also, let’s extract “twenty”, “thirty”, “forty” and “toEnglishNumber(lastDigit)” parts as variables:

function toEnglishNumber(number) {
    if (number >= 20) {
        var firstDigit = number / 10;
        var lastDigit = number % 10;

        var firstPart;
        if (firstDigit == 2) {
            firstPart = "twenty";
        }

        if (firstDigit == 3) {
            firstPart = "thirty";
        }

        if (firstDigit == 4) {
            firstPart = "forty";
        }

        var secondPart = toEnglishNumber(lastDigit);

        return firstPart + "-" + secondPart;
    }

    return simpleNumbers[number];
}

Now, we could extract the function for conversion of the first digit to the first english part, such as twenty or thirty:

function convertTens(digit) {
    if (digit == 2) {
        return "twenty";
    }

    if (digit == 3) {
        return "thirty";
    }

    if (digit == 4) {
        return "forty";
    }
}

function toEnglishNumber(number) {
    if (number >= 20) {
        var firstDigit = number / 10;
        var lastDigit = number % 10;

        var firstPart = convertTens(firstDigit);
        var secondPart = toEnglishNumber(lastDigit);

        return firstPart + "-" + secondPart;
    }

    return simpleNumbers[number];
}

Now, it looks like the function convertTens can be simplified through usage of array in the same way as we did before with toEnglishNumber:

var tens = ["", "", "twenty", "thirty", "forty"];

function convertTens(digit) {
    return tens[digit];
}

At this point, we can write more tests to cover all different first digits, for example fifty-seven, sixty-five, seventy-three, eighty-nine and ninety-one. To make them pass we will have to add corresponding “tens” number to our array:

var tens = [
    "", "",

    "twenty", "thirty", "forty", "fifty",
    "sixty", "seventy", "eighty", "ninety"
];

So, that is how we apply three rules of test-driven development. Here is the full code:

describe("toEnglishNumber", function() {

    it("converts 0 to zero", function() {
        // ARRANGE
        var number = 0;

        // ACT
        var englishNumber = toEnglishNumber(number);

        // ASSERT
        var expected = "zero";
        expect(englishNumber).toEqual(expected);
    });

    it("converts 1 to one", function() {
        // ARRANGE
        var number = 1;

        // ACT
        var englishNumber = toEnglishNumber(number);

        // ASSERT
        var expected = "one";
        expect(englishNumber).toEqual(expected, "english number");
    });

    it("converts other one-digit numbers", function() {
        expect(toEnglishNumber(2)).toEqual("two", "english number");
        expect(toEnglishNumber(3)).toEqual("three", "english number");
        expect(toEnglishNumber(4)).toEqual("four", "english number");
        expect(toEnglishNumber(5)).toEqual("five", "english number");
        expect(toEnglishNumber(6)).toEqual("six", "english number");
        expect(toEnglishNumber(7)).toEqual("seven", "english number");
        expect(toEnglishNumber(8)).toEqual("eight", "english number");
        expect(toEnglishNumber(9)).toEqual("nine", "english number");
    });

    it("converts 10 to ten", function() {
        // ARRANGE
        var number = 10;

        // ACT
        var englishNumber = toEnglishNumber(number);

        // ASSERT
        var expected = "ten";
        expect(englishNumber).toEqual(expected, "english number");
    });

    it("converts other teen numbers", function() {
        expect(toEnglishNumber(11)).toEqual("eleven", "english number");
        expect(toEnglishNumber(12)).toEqual("twelve", "english number");
        expect(toEnglishNumber(13)).toEqual("thirteen", "english number");
        expect(toEnglishNumber(14)).toEqual("fourteen", "english number");
        expect(toEnglishNumber(15)).toEqual("fifteen", "english number");
        expect(toEnglishNumber(16)).toEqual("sixteen", "english number");
        expect(toEnglishNumber(17)).toEqual("seventeen", "english number");
        expect(toEnglishNumber(18)).toEqual("eighteen", "english number");
        expect(toEnglishNumber(19)).toEqual("nineteen", "english number");
    });

    it("converts 23 to twenty-three", function() {
        // ARRANGE
        var number = 23;

        // ACT
        var englishNumber = toEnglishNumber(number);

        // ASSERT
        var expected = "twenty-three";
        expect(englishNumber).toEqual(expected, "english number");
    });

    it("converts 27 to twenty-seven", function() {
        // ARRANGE
        var number = 27;

        // ACT
        var englishNumber = toEnglishNumber(number);

        // ASSERT
        var expected = "twenty-seven";
        expect(englishNumber).toEqual(expected, "english number");
    });

    it("converts 42 to forty-two", function() {
        // ARRANGE
        var number = 42;

        // ACT
        var englishNumber = toEnglishNumber(number);

        // ASSERT
        var expected = "forty-two";
        expect(englishNumber).toEqual(expected, "english number");
    });

    it("converts 39 to thirty-nine", function() {
        // ARRANGE
        var number = 39;

        // ACT
        var englishNumber = toEnglishNumber(number);

        // ASSERT
        var expected = "thirty-nine";
        expect(englishNumber).toEqual(expected, "english number");
    });

    it("converts other two-digit numbers", function() {
        expect(toEnglishNumber(57)).toEqual("fifty-seven", "english number");
        expect(toEnglishNumber(65)).toEqual("sixty-five", "english number");
        expect(toEnglishNumber(73)).toEqual("seventy-three", "english number");
        expect(toEnglishNumber(89)).toEqual("eighty-nine", "english number");
        expect(toEnglishNumber(91)).toEqual("ninety-one", "english number");
    });

});

var simpleNumbers = [
    "zero", "one", "two", "three", "four",
    "five", "six", "seven", "eight", "nine",

    "ten", "eleven", "twelve", "thirteen", "fourteen",
    "fifteen", "sixteen", "seventeen", "eighteen", "nineteen"
];

var tens = [
    "", "",

    "twenty", "thirty", "forty", "fifty",
    "sixty", "seventy", "eighty", "ninety"
];

function convertTens(digit) {
    return tens[digit];
}

function toEnglishNumber(number) {
    if (number >= 20) {
        var firstDigit = number / 10;
        var lastDigit = number % 10;

        var firstPart = convertTens(firstDigit);
        var secondPart = toEnglishNumber(lastDigit);

        return firstPart + "-" + secondPart;
    }

    return simpleNumbers[number];
}

Exercises

Do you see the missing edge case for two-digit numbers? Write a test for it and make it pass using three rules of test-driven development.
Add support for three-digit numbers.
Add support for negative numbers.
Add support for numbers with the floating point.

Conclusion

Today we have learned a lot of concepts from testing and test-driven development. Also, we have learned the essence of TDD - three rules of TDD. We have learned how to apply these rules on a very simple example. We have touched on how beneficial test-driven development can be when applied well.

In the next article of the series, we will discuss what different kinds of tests exist, how and when to write them and how to apply TDD in these tests. Also, we will get back to our application and implement a new feature.

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Learning Test-Driven Development With Javascript: End-to-End Testing

2016-12-23T10:13:33+01:00

Level: Beginner.

“Learning TDD with Javascript” is the series of articles where we learn basics of automated testing and test-driven development. While the language of choice for the code examples is Javascript, all described concepts are language-agnostic and are applicable in various technological stacks. In these articles, a reader is expected to do small exercises after each major topic to reinforce that theoretical knowledge with practice. Some of these exercises are practical and will involve coding, or simple writing; others will be food for thought. Also, a reader might want to get their feedback on these exercises, so don’t hesitate to send the results my way: oleksii@tddfellow.com - feedback on the practice is quite important as it helps to improve quicker, when you know what is well and what can be improved and how. Also, don’t hesitate to send any questions and feedback regarding the content of these articles. Your questions, feedback, and your practical results will help authors shape this content better.

Today we are going to learn how to write tests that imitate real user interaction for the whole application. We are going to build a small web application using Vanilla Javascript. Vanilla Javascript is a plain Javascript without any framework or library. Such tests that imitate real user interaction via User Interface (UI) are called End-to-End Tests. These tests are the most simple to write because we only need to think about our application in the same way user does:

Imitating the user clicking on the button would mean for us to trigger a button click;
Imitating the user typing text in the input field would mean for us to change input’s value;
Imitating the user clicking on the link would mean for us to trigger a link click;
And so on.

We don’t need to think about specific implementation details, such as: which functions and classes do we have in our code and how they interact with each other, is there any interaction with the back-end server or 3rd-party API. Also, we don’t need to be proficient with interaction testing - this is the topic for the future series.

Of course, for that kind of simplicity we are trading something off. In the case of End-to-End tests, they are slower, suffer concurrency, wait, and timeout problems, and are harder to maintain in the long run. We don’t have to worry about that just yet because we want to learn how to write tests in general, and this kind of simplicity is perfect for us in this case.

Such simplicity stems from the fact that End-to-End tests, mostly, are direct translations of user stories (use case scenarios) into the UI manipulation code.

User Story

User stories are scenarios describing an individual feature of the software via user story context, sequences of user interactions and user expectations. User story context is the description of the situation the user and the software system are in at the beginning of the scenario. An example of the system context: “user John is registered in the system with password ‘welcome’.” An example of the user context: “user is at the login page.” User interaction is the description of a particular action user takes inside of the system, usually, doing something within the UI, for example: “User enters email ‘john@example.org’ in the email input field” or “User clicks on the submit button.” Finally, user expectation is the description of what particular information user should receive from the system, for example: “User sees the success message on the page” or “User receives the email with the verification code.”

User stories come in different flavor and formats. It can be a free-form text, describing three parts: context, interactions, and expectations; or it can be in a formal “Given-When-Then” form. “Given” part is the sequence of the user story context descriptions, “When” part is the sequence of the user interaction descriptions, and “Then” part is the sequence of user expectation descriptions. Both forms can be used interchangeably, and some software companies use one or another formal version of the user story format consistently.

Let’s take a look at the example of the free-form user story together with the example of the Given-When-Then user story for the same feature:

Free-form	Given-When-Then

User with email ‘john@example.org’ and password ‘welcome’ exists in the system John enters his email ‘john@example.org’ into the email field and his password ‘welcome’ into the password field on the login page After that John submits the login form Finally, John expects to see their profile page with the indication of him being logged in (name ‘John’ is present on the page)	Given User with email ‘john@example.org’ and password ‘welcome’ exists And I am at the login page When I enter ‘john@example.org’ in the email field And I enter ‘welcome’ in the password field And I click on the submit button Then I see the profile page And I see my name as the title of the profile page

As we can see, free-form can be very vague and is very flexible, and formal form is more strict and precise. The free-form on its own doesn’t have much upsides or downsides - it is as good as it is written. On the other hand, the formal form does give us some value and also trades something off for that value. They are generally easy to write and, because they are so specific, are easy to translate to the automated test. On the other hand, they may hamper creativity either while creating the user story or when implementing it.

It is important to mention that these use case scenarios are not full user stories or features. One feature can have multiple scenarios like that - together they are called acceptance criteria. When all such scenarios of the given feature work correctly, the feature is done. There is another vital part of the user story - general description, that should contain the rationale behind the story and the value for the user or any other important actor in the system, such as the stakeholder. Before we write scenarios, we often come up with the rationale like that and it drives us to write a scenario. For example, for the feature above we would have used something like that:

Free-form	Given-When-Then

John needs to authenticate to the system so that he can access his private content	As John, I want to be able to authenticate to the system So that I can access my private content

Because formal form user stories are more precise, easier to write and simpler to translate to the automated test, we are going to use them to learn End-to-End testing in the context of Test-Driven Development.

Exercises

Write a scenario for authentication story, when John enters the wrong password.
Write a user story with rationale and scenarios for the sign-up feature.
Imagine we are developing an instant messaging application. What do you think would be the next feature after login/sign-up? Write the user story with rationale and scenarios for this feature.

Do you have questions? Or do you want to get quick feedback on how you did the exercises? - mail me: oleksii@tddfellow.com

Setting Up the Project

Now we will be writing a simple web application, using Vanilla Javascript (ECMAScript 5), so that our setup is rather simple. For the testing, we will be using a standalone version of the Jasmine testing framework. Also, we will be writing a single page application (SPA), so that we don’t have to worry about rendering different pages in our tests for now. We are aiming for the following directory structure of our project:

.
├── lib
├───── .. external libraries, such as Jasmine ..
├── spec
├───── .. sources with Jasmine automated tests ..
└── src
├───── .. sources with our main code ..
├── index.html       - entry point to the application
├── SpecRunner.html  - entry point to our test suite

First, create required directories: lib, spec and src. Then download the latest standalone Jasmine release here: https://github.com/jasmine/jasmine/releases (jasmine-standalone-{version}.zip file). At the time of writing, the version is 2.5.2. Unzip that file into your project directory. You should get the following files from it:

./lib/jasmine-2.5.2/ directory - contains all the resources required by Jasmine.
./SpecRunner.html - example entry point to our test suite.
./src/Player.js and ./src/Song.js - example source files.
./spec/PlayerSpec.js and ./spec/SpecHelper.js - example automated Jasmine tests.

Now, try to open SpecRunner.html in your browser. It should run these example tests, and they should all pass. Here is how it should look like:

Also, create an empty index.html:

 lang="en">

     charset="UTF-8">
    </span>Title<span class="nt">

And now we should have the desired project structure. So how does that Jasmine testing framework works, anyways?

Crash Course into Jasmine

Let’s take a look at the example test file to get the gist of how Jasmine works:

// spec/PlayerSpec.js

describe("Player", function() {
  var player;
  var song;

  beforeEach(function() {
    player = new Player();
    song = new Song();
  });

  it("should be able to play a Song", function() {
    player.play(song);
    expect(player.currentlyPlayingSong).toEqual(song);

    //demonstrates use of custom matcher
    expect(player).toBePlaying(song);
  });

  describe("when song has been paused", function() {
    beforeEach(function() {
      player.play(song);
      player.pause();
    });

    it("should indicate that the song is currently paused", function() {
      expect(player.isPlaying).toBeFalsy();

      // demonstrates use of 'not' with a custom matcher
      expect(player).not.toBePlaying(song);
    });

    it("should be possible to resume", function() {
      player.resume();
      expect(player.isPlaying).toBeTruthy();
      expect(player.currentlyPlayingSong).toEqual(song);
    });
  });

  // demonstrates use of spies to intercept and test method calls
  it("tells the current song if the user has made it a favorite", function() {
    spyOn(song, 'persistFavoriteStatus');

    player.play(song);
    player.makeFavorite();

    expect(song.persistFavoriteStatus).toHaveBeenCalledWith(true);
  });

  //demonstrates use of expected exceptions
  describe("#resume", function() {
    it("should throw an exception if song is already playing", function() {
      player.play(song);

      expect(function() {
        player.resume();
      }).toThrowError("song is already playing");
    });
  });
});

First important concept here is the describe("...", function () { ... }). describe function is used to describe a certain concept or a certain context. For example, describe("Player", ...) means that we are going to define tests for Player class or some other Player concept. Also, describes can be nested to indicate that we are describing some specific context (describe("when a song has been paused", ...)) or sub-concept of current concept, such as the method of currently described class (describe("#resume", ...)). That is a good example, of what the unit test suite might be describing. In the case of End-to-End tests we would like to describe a full feature, so describe("Login Feature", ...) is a good bet. The second argument for the describe is the function that will contain all tests and sub-describes for the described concept. This function is the capturing closure, so defining variables and functions on the outer-level describe will make them available on the inner-level describes and the tests themselves.

Second important concept here is the it("...", function () { ... }). it function is used to create a test for the currently described concept, context or sub-concept. The first argument is the description of what it does, where it is the described concept. For example, given we describe a music player and our context is when the volume is at max, then we might write the test it("is deafening", ...). In the case of End-to-End tests we are describing a feature, so it will refer to our application, or application’s user interface, for example: it("shows user's nickname", ...). The second argument to the it is the function with the test itself. Here we will setup the stage for the test, call our main code and verify that everything happened as we expect.

Finally, the third important concept is the expect(...).to.... That is Jasmine’s form of assertion. That is where we verify that our code worked as we expect it to. As an argument to expect we provide an actual value. The actual value is something that our code has returned as a result of the function or method execution or something that we have read from the UI using UI manipulation code, or something that we have read from some 3rd party service, such as our back-end server, 3rd-party API or database. Essentially, this is the value that we are verifying to be correct. The second part is the Jasmine matcher - the method defined on the object, that is returned from expect(value) call, that allows us to define what we want to assert about that value. The most used one is toEqual(...), which asserts that the value was equal to some expected value.

These three concepts should be enough to start writing tests. Don’t worry about everything else that you see in this example file from standalone Jasmine distribution. We will discover some of these concepts as we go. Now, let’s remove src/Player.js, src/Song.js, spec/PlayerSpec.js and spec/SpecHelper.js, and run our test suite - it is enough to reload the page to re-run our test suite. Test run should report that No specs found:

To get the gist of how describe("...", function () { ... }), it("...", function () { ... }) and expect(...).toEqual(...) works, let’s write our first failing test. Also, that will let us see if the testing framework is configured correctly and is capable of showing us the test failure. Let’s create a new file called spec/JasmineWorksSpec.js:

// spec/JasmineWorksSpec.js

describe("testing framework", function () {
  it("works", function () {
    expect(2 + 2).toEqual(5);
  });
});

And we need to add this test file to the SpecRunner.html:

And if we run our test suite, we should see a failure:

testing framework works
  Expected 4 to equal 5.

And we ought to make it pass by fixing our incorrect assertion: expect(2 + 2).toEqual(4);. And if we run the test suite again, by reloading the page in the browser, all the test should pass.

Writing our First Simple Tests

Now we can write some real tests to practice usage of describe, it and expect(...).toEqual: let’s create ArithmeticsSpec.js and write some tests for behavior of add function:

// spec/ArithmeticsSpec.js

describe("Arithmetics", function () {

  describe("#add(a, b)", function () {
    it("calculates the sum of two numbers", function () {
      // ARRANGE
      var a = 3;
      var b = 4;
      var expected = 7;

      // ACT
      var actual = Arithmetics.add(a, b);

      // ASSERT
      expect(actual).toEqual(expected);
    });
  });

});

Don’t forget to add the to the SpecRunner.html. This test will fail first because Arithmetics module is not defined. We will define it as an empty object. Next failure is because Arithmetics.add is not a function - it is undefined. We will define that function with two arguments inside of the Arithmetics object. Finally, the test will fail, because we expect the result to be seven, but it was undefined. We will make the simplest thing we can do to pass the failing test - return seven. That will make the test pass. The code will be in src/Arithmetics.js, which we include in our SpecRunner.html, and will look like that:

// src/Arithmetics.js

var Arithmetics = {
  add: function (a, b) {
    return 7;
  }
};

That, of course, is not correct implementation, so we need another test to drive out the proper implementation - test with different inputs and a different result. To make it pass we will have to use a + b:

// spec/ArithmeticsSpec.js
describe("Arithmetics", function () {

  describe("#add(a, b)", function () {
    it("calculates the sum of two numbers", function () { ... });

    it("calculates the sum of two other numbers", function () {
      // ARRANGE
      var a = 73;
      var b = 89;
      var expected = 162;

      // ACT
      var actual = Arithmetics.add(a, b);

      // ASSERT
      expect(actual).toEqual(expected);
    });
  });

});

// src/Arithmetics.js
var Arithmetics = {
  add: function (a, b) {
    return a + b;
  }
};

Three “A”s: Arrange, Act, and Assert

Have you noticed three comments that I have left in the example tests’ code: ARRANGE, ACT and ASSERT? These are three “A”s of writing a good test. Arrange is the part of the test, where we set up the stage: prepare input data, create objects, load resources, change the state of the system - it is the part where we create the context for our test. Act is the part of the test, where we call our system under the test. In the Arithmetics example it was a function Arithmetics.add(a, b). The system under the test can return some useful value or change its state. To verify that either is correct we, finally, use Assert section of our test - part of the test where we verify the outcome of the call to the system under the test.

The sections ARRANGE, ACT and ASSERT are spelled out in the comments only for the reader’s convenience - usually, real projects don’t have such comments. It is worth noting that for learning and practicing purposes it is a good idea to name these sections explicitly in our tests - this develops a habit of recognizing which part of the test should belong to which section. Also, the formula Act -> Arrange -> Assert makes it easier to come up with the test when we lack deep experience in testing, or with the particular testing framework, or an environment.

Going a bit back to our user story scenarios: have you noticed the connection between “Arrange, Act and Assert” and “Given-When-Then”? Arrange part of the test corresponds to the Given part of the scenario, Act part of the test corresponds to the When part of the scenario, and Assert part of the test corresponds to the Then part of the scenario. Let’s see it on of our previous example scenarios:

Test Section	Scenario Step

ARRANGE	Given User with email ‘john@example.org’ and password ‘welcome’ exists And I am at the login page

ACT	When I enter ‘john@example.org’ in the email field And I enter ‘welcome’ in the password field And I click on the submit button

ASSERT	Then I see the profile page And I see my name as the title of the profile page

Exercises

Write tests and implement functions on Arithmetics module: subtract, multiply and divide.
Inline Arrange, Act and Assert in a one-liner. Can you still recognize implicit Arrange, Act and Assert sections in that one line? Is it more or less readable? Is there a middle ground between two versions? Why would you choose one or the other?
Classify parts of the free-form story for the user log in from before as Arrange, Act, and Assert. Was it easier, than classifying formal Given-When-Then form? Was it harder? Or maybe the same? How would it be if you never saw Given-When-Then version of that free-form story?

Writing our First End-to-End Test

Now that we know approximately, how to arrange the steps of our scenario into the test, let’s give it a shot. We will start by creating a new test file for our login feature called LoginFeatureSpec.js. Don’t forget to put an appropriate script tag in the SpecRunner.html. We will start by writing the skeleton of our test suite: describe and it inside of it. Next, we will put scenario steps as comments to our test, and we will split them into three sections: Arrange, Act, and Assert. It will look like this:

// spec/LoginFeatureSpec.js

describe("Login Feature", function () {

  it("allows to login with correct credentials", function () {
    // ARRANGE
    // Given User with email ‘john@example.org’ and password ‘welcome’ exists
    // And I am at the login page

    // ACT
    // When I enter ‘john@example.org’ in the email field
    // And I enter ‘welcome’ in the password field
    // And I click on the submit button

    // ASSERT
    // Then I see the profile page
    // And I see my name as the title of the profile page
  });

});

Next step would be to change the first Given comment to the function call. We should give a good readable name to that function. One straightforward option would be givenUserExists(email, password). Another good option is addUser({email: email, password: password}). While they are not that different, I prefer addUser for its higher conceptual flexibility - we will likely need that function in some different scenario step in the future. While I prefer that, we should not do that yet, because we might never need the function like that, and givenUserExists will do us more good right now since it resembles the scenario step so much. When we need this flexibility, we’ll perform a refactoring. So for now, let’s create an empty function with that name in a new file spec/FeatureSteps.js and load this file from our SpecRunner.html.

This empty function, on its own, doesn’t do us much good because all our tests will pass. If we continue replacing our steps in comments with such functions, we will end up with one big failure at the Assert section, and we will have to write a lot of code at once to fix that. To drive ourselves to implement the function givenUserExists properly right now, we should write an assertion right after the call. This assertion is not part of Assert section - it is a part of test-driving the functionality of our feature steps. A good assertion here will be to ask our user storage mechanism if such user exists right after we created that user. Also, it would be a good idea to check that user does not exist, before we created it. Also, we will extract variables email and password because we have to repeat them all over the place already. Let’s see how it will look like:

// spec/LoginFeatureSpec.js

describe("Login Feature", function () {

  it("allows to login with correct credentials", function () {
    // ARRANGE

    // Given User with email ‘john@example.org’ and password ‘welcome’ exists
    var email = "john@example.org";
    var password = "welcome";
    expect(Users.exists(email, password)).toEqual(false);
    givenUserExists(email, password);
    expect(Users.exists(email, password)).toEqual(true);

    // ...
  });

});

When we run our tests, the breadcrumbs of test failures will drive us to create this basic functionality. First, we will create Users.js with empty Users module and import it from SpecRunner.html. Then, next failure will drive us to add method exists(email, password) on Users module, that will always return false. Next, make function givenUserExists(email, password) call Users.add(email, password), which in turn will make us create a function Users.add(email, password), that will store email-password pair to the list of users in the memory. And, finally, make Users.exists to search for the email-password pair in that in-memory list. And finally, our test will pass. Let’s take a look at how these steps will look in our code:

// => Error: Users is undefined
// in src/Users.js:
var Users = {};

// => Error: Users.exists is not a function
// in src/Users.js:
var Users = {
  exists: function (email, password) {}
};

// => Error: expected undefined to equal false
// in src/Users.js:
var Users = {
  exists: function (email, password) {
    return false;
  }
};

// => Error: expected false to equal true
// in the (second assertion)
// in spec/FeatureSteps.js:
function givenUserExists(email, password) {
  Users.add(email, password);
}

// => Error: Users.add is not a function
// in src/Users.js:
var Users = {
  users: [],

  exists: function (email, password) { ... },

  add: function (email, password) {
    this.users.push({email: email, password: password});
  }
}

// this still fails with:
// => Error: expected false to equal true
// because we need to implement Users.exists "properly"
// in src/Users.js:
var Users = {
  users: [],

  exists: function (email, password) {
    return this.users.length > 0;
  },

  add: function (email, password) { ... }
}

// => All tests PASS

It is funny, how simple Users.exists(email, password) function is. It verifies that we have at least one user. While this is not a correct code, this is good enough for our current test. As we know that this code is not entirely correct, we need to remember to write the test(s) to prove it incorrect, so that we can make it proper with confidence. Since we want first to finish the current test, we should add a to-do list item to write such test. We have two edge cases here that will not work with our implementation: when we have only one user in the system and the credentials we provided do not match and when we have multiple users in the system, and we provide correct credentials for the second one:

xdescribe("when user has different credentials", function () {
  it("does not allow to login with wrong credentials", function () {

  });
});

xdescribe("when there are more than one user", function () {
  it("allows to login with correct credentials", function () {

  });
});

Have you noticed xdescribe? It is a different form of the describe, that allows us to mark the whole context as pending. It won’t run the tests inside, and it will mark them as pending in the test run report. That is the ultimate way to maintain a Test-Driven TODO List. As we test-drive our code, we will find more of these. In the report they look like this:

Let’s finish continue implementing our steps. The comment And I am at the login page transparently becomes a function givenIAmAtTheLoginPage(). As we already have seen, it doesn’t do us any good just to replace the comment with a function call that does nothing - so we should surround it with proper assertions. We know that login page should have some text input for email and another password field for password, and we will need a button to confirm user’s intent to log in. Also, because we are developing a single page application, we would need some container for the currently active page. Let’s say we need these things:

Initially, we will have only one container with id="page" and no content. We probably ought to define it in our HTML file.
When we render our login page, we should have:
- email text input field with id="email",
- password input field with id="password",
- and button with id="do_login".

Now that we have spelt this out, it is fairly straightforward to write assertions surrounding the givenIAmAtTheLoginPage() call:

// Initially, we will have only one container with `id="page"`
var container = document.querySelector("#page");
expect(container).not.toEqual(null);
// and no content
expect(container.innerHTML).toEqual("");

That fails because we don’t have such element in our HTML. We need to create it both in our SpecRunner.html. Also, now it will be important to move all our

Now, our next failure is that givenIAmAtTheLoginPage() function is not defined. We can define it as empty function in our spec/FeatureSteps.js file for now. This will turn our tests green again. We still haven’t made our assertions about the state of the UI after the call to givenIAmAtTheLoginPage - let’s do this now:

var emailInput = container.querySelector("#email");
expect(emailInput.tagName).toEqual("input");
expect(emailInput.type).toEqual("email");

That fails with the error Cannot read property tagName of null, which means that we don’t have #email element inside of the #page container. Simplest thing to do would be to add that element to the #page container in our SpecRunner.html. And it won’t work! Because we have an assertion that verifies, that before calling to the givenIAmAtTheLoginPage we do not have anything in the #page container. Now we have to do something useful in the function givenIAmAtTheLoginPage. For example, we can call LoginPage.render(). Which does not exist yet and we will need to create it in the file src/LoginPage.js and load it from our SpecRunner.html. To fix current failure we will need to create a #email element there and append it to our #page container:

// spec/FeatureSteps.js
function givenIAmAtTheLoginPage() {
  LoginPage.render();
}

// src/LoginPage.js
var LoginPage = {
  render: function () {
    var emailInput = document.createElement("div");
    emailInput.id = "email";

    var container = document.querySelector("#page");
    container.appendChild(emailInput);
  }
};

That makes the current test failure go away, but we have two more: Expected 'DIV' to equal 'input'. and Expected undefined to equal 'email'.. To make these pass, we would need to change document.createElement(...) call to use input tag name and also we will need to set the input type to email. And as we see, the tag name stored in emailInput.tagName is all-caps, so we will have to fix our assertion also to expect that:

// first we'll fix the assertion:
// in spec/LoginFeatureSpec.js
expect(emailInput.tagName).toEqual("INPUT");

// and then we will fix the emailInput creation:
// in src/LoginPage.js
var emailInput = document.createElement("input");
emailInput.id = "email";
emailInput.type = "email";

And if we run our tests, they all pass. Great! We should now do the same for our password input field and the login button:

// in spec/LoginFeatureSpec.js
var passwordInput = container.querySelector("#password");
expect(passwordInput.tagName).toEqual("INPUT");
expect(passwordInput.type).toEqual("password");
var loginButton = container.querySelector("#do_login");
expect(loginButton.tagName).toEqual("BUTTON");
expect(loginButton.textContent).toEqual("Login");

// in src/LoginPage.js
var passwordInput = document.createElement("input");
passwordInput.id = "password";
passwordInput.type = "password";
container.appendChild(passwordInput);

var loginButton = document.createElement("button");
loginButton.id = "do_login";
loginButton.textContent = "Login";
container.appendChild(loginButton);

And all tests pass again. Let’s take another look at how our test looks like. It is quite complicated, and it has so much stuff, that is hugely detailed and precise, that it is not possible to see a user story scenario there anymore. One possible solution to that problem is to push the assertions that are related to the feature scenario step to the respective step functions. Now it looks much better. We should use the same concept for all our further steps. After refactoring test code looks like this:

// in spec/LoginFeatureSpec.js
it("allows to login with correct credentials", function () {
  // ARRANGE
  var email = "john@example.org";
  var password = "welcome";
  givenUserExists(email, password);
  givenIAmAtTheLoginPage();

  // ACT
  // When I enter ‘john@example.org’ in the email field
  // And I enter ‘welcome’ in the password field
  // And I click on the submit button

  // ASSERT
  // Then I see the profile page
  // And I see my name as the title of the profile page
});

// in spec/FeatureSteps.js
function givenUserExists(email, password) {
    expect(Users.exists(email, password)).toEqual(false);

    Users.add(email, password);

    expect(Users.exists(email, password)).toEqual(true);
}

function givenIAmAtTheLoginPage() {
    var container = document.querySelector("#page");
    expect(container).not.toEqual(null);
    expect(container.innerHTML).toEqual("");

    LoginPage.render();

    var emailInput = container.querySelector("#email");
    expect(emailInput.tagName).toEqual("INPUT");
    expect(emailInput.type).toEqual("email");

    var passwordInput = container.querySelector("#password");
    expect(passwordInput.tagName).toEqual("INPUT");
    expect(passwordInput.type).toEqual("password");

    var loginButton = container.querySelector("#do_login");
    expect(loginButton.tagName).toEqual("BUTTON");
    expect(loginButton.textContent).toEqual("Login");
}

Now, let’s follow the same pattern for our Act section. First, we will deal with When I enter ‘john@example.org’ in the email field. This seems to be a call to a function whenIEnterInTheField("#email", email) - we will implement it using Javascript’s APIs. We will do the same for the And I enter ‘welcome’ in the password field, which will use the same function whenIEnterInTheField("#password", password). Finally, we will implement whenIClickOn("#do_login") as a replacement for the comment And I click on the submit button. We will also sprinkle assertions inside of the steps to make sure that we are using the Javascript APIs correctly. The code will look like this:

// in spec/LoginFeatureSpec.js
it("allows to login with correct credentials", function () {
    // ARRANGE
    var email = "john@example.org";
    var password = "welcome";
    givenUserExists(email, password);
    givenIAmAtTheLoginPage();

    // ACT
    whenIEnterInTheField("#email", email);
    whenIEnterInTheField("#password", password);
    whenIClickOn("#do_login");

    // ASSERT
    // Then I see the profile page
    // And I see my name as the title of the profile page
});

// in spec/FeatureSteps.js
function whenIEnterInTheField(selector, value) {
    var element = document.querySelector(selector);
    expect(element).not.toEqual(null);
    expect(element.value).toEqual("");

    element.value = value;

    var elementAfterChange = document.querySelector(selector);
    expect(element.value).toEqual(value);
}

function whenIClickOn(selector) {
    var element = document.querySelector(selector);
    expect(element).not.toEqual(null);

    element.click();
}

Now comes the most interesting part of writing this feature test - Assert section. So far, Arrange and Act sections were driving us to create an infrastructure-like code of our application. Now, with Assert section we will have to implement more of our domain logic. Let’s start with the Then I see the profile page. Let’s try to figure our what that could mean:

We no longer have a login page in our #page container.
#page container should somehow indicate that we are on the Profile page:
- could be achieved by adding a sub-container with id="profile_page" to it.
We don’t know much more about what the profile page is. What we do know is:
- the profile page has a name of the user as the title of the page.

Interesting, so far, we didn’t have a concept of the Name of the User. I guess it is time to create one in our arrange block, with all the changes and additional assertions in our feature steps that we have to do:

// in spec/LoginFeatureSpec.js
// ARRANGE
var email = "john@example.org";
var password = "welcome";
var name = "John Smith";
givenUserExists(email, password, name);

// in spec/FeatureSteps.js
function givenUserExists(email, password, name) {
  expect(Users.exists(email, password)).toEqual(false);

  Users.add(email, password, name);            // <=

  expect(Users.exists(email, password)).toEqual(true);

  expect(Users.nameOf(email)).toEqual(name);   // <=
}

// also we need to add name field to the user data
// and implement nameOf(email) function
// in src/Users.js
add: function (email, password, name) {
  this.users.push({
    email: email,
    password: password,
    name: name
  });
},

nameOf: function (email) {
  return this.users[0].name;
}

And the tests will pass again. And now we have a tested concept of the name in our code. Tested - to the extent required for this test, where we have only one user in the system. Now we can replace the comment Then I see the profile page with the scenario step function call thenISeeTheProfilePage(). The implementation of it will verify that #email, #password and #do_login are no longer present on the page and it will verify that container #page-profile is present and it has #title element in it. Making it pass will require us to add a click event listener to the loginButton in the LoginPage, that will remove contents of the #page container and will call to ProfilePage.render() following the analogy of LoginPage. That drives us to create this module and its render() function. According to our next test failure, this function should create #profile_page sub-container in #page container, so we do that. Finally, we have one last failure, that drives us to create #title element in ProfilePage.render() function. And all tests are green again. Let’s take a look at these changes:

// in spec/LoginFeatureSpec.js
// ASSERT
// Then I see the profile page
thenISeeTheProfilePage();

// in spec/FeatureSteps.js
function thenISeeTheProfilePage() {
  // assert login page is gone
  expect(document.querySelector("#email")).toEqual(null);
  expect(document.querySelector("#password")).toEqual(null);
  expect(document.querySelector("#do_login")).toEqual(null);

  // assert profile page is present
  expect(document.querySelector("#page #profile_page")).not.toEqual(null);
  expect(document.querySelector("#profile_page #title")).not.toEqual(null);
}

// in src/LoginPage.js at the end of render() function:
render: function() {
  // ...

  loginButton.addEventListener("click", LoginPage.onLogin);
},

onLogin: function () {
  container.innerHTML = "";
  ProfilePage.render();
}

// in src/ProfilePage.js
var ProfilePage = {
  render: function () {
    var profileContainer = document.createElement("div");
    profileContainer.id = "profile_page";
    container.appendChild(profileContainer);

    var title = document.createElement("h1");
    title.id = "title";
    profileContainer.appendChild(title);
  }
};

At last, we can implement the last step - And I see my name as the title of the profile page. A good guess for the step name would be thenISeeTextAt("#title", name). The function will simply select element #title and verify its element.textContent. As expected, it fails with the error Expected '' to equal 'John Smith' and we should fix that within the ProfilePage.render() function by assigning title.textContent to the Users.currentUser().name. This fails because we didn’t define Users.currentUser() function and this is simple to do for our current test - just return the first user. After that all our tests pass. The code will look like this:

// in spec/LoginFeatureSpec.js
thenISeeTextAt("#title", name);

// in spec/FeatureSteps.js
function thenISeeTextAt(selector, text) {
  var element = document.querySelector(selector);
  expect(element.textContent).toEqual(text);
}

// in src/ProfilePage.js
title.textContent = Users.currentUser().name;

// in src/Users.js
currentUser: function () {
    return this.users[0];
}

We have finally, implemented our first feature test. That was quite some work. Also, the functionality of Users module is way incomplete - we need to write more tests to cover different cases. That is how our test code and production code looks like:

// spec/LoginFeatureSpec.js
describe("Login Feature", function () {

    it("allows to login with correct credentials", function () {
        // ARRANGE
        var email = "john@example.org";
        var password = "welcome";
        var name = "John Smith";
        givenUserExists(email, password, name);
        givenIAmAtTheLoginPage();

        // ACT
        whenIEnterInTheField("#email", email);
        whenIEnterInTheField("#password", password);
        whenIClickOn("#do_login");

        // ASSERT
        thenISeeTheProfilePage();
        thenISeeTextAt("#title", name);
    });

    xdescribe("when user has different credentials", function () {
        it("does not allow to login with wrong credentials", function () {

        });
    });

    xdescribe("when there are more than one user", function () {
        it("allows to login with correct credentials", function () {

        });
    });

});

// spec/FeatureSteps.js
function givenUserExists(email, password, name) {
    expect(Users.exists(email, password)).toEqual(false);

    Users.add(email, password, name);

    expect(Users.exists(email, password)).toEqual(true);
    expect(Users.nameOf(email)).toEqual(name);
}

function givenIAmAtTheLoginPage() {
    var container = document.querySelector("#page");
    expect(container).not.toEqual(null);
    expect(container.innerHTML).toEqual("");

    LoginPage.render();

    var emailInput = container.querySelector("#email");
    expect(emailInput.tagName).toEqual("INPUT");
    expect(emailInput.type).toEqual("email");

    var passwordInput = container.querySelector("#password");
    expect(passwordInput.tagName).toEqual("INPUT");
    expect(passwordInput.type).toEqual("password");

    var loginButton = container.querySelector("#do_login");
    expect(loginButton.tagName).toEqual("BUTTON");
    expect(loginButton.textContent).toEqual("Login");
}

function whenIEnterInTheField(selector, value) {
    var element = document.querySelector(selector);
    expect(element).not.toEqual(null);
    expect(element.value).toEqual("");

    element.value = value;

    var elementAfterChange = document.querySelector(selector);
    expect(element.value).toEqual(value);
}

function whenIClickOn(selector) {
    var element = document.querySelector(selector);
    expect(element).not.toEqual(null);

    element.click();
}

function thenISeeTheProfilePage() {
    // assert login page is gone
    expect(document.querySelector("#email")).toEqual(null);
    expect(document.querySelector("#password")).toEqual(null);
    expect(document.querySelector("#do_login")).toEqual(null);

    // assert profile page is present
    expect(document.querySelector("#page #profile_page")).not.toEqual(null);
    expect(document.querySelector("#profile_page #title")).not.toEqual(null);
}

function thenISeeTextAt(selector, text) {
    var element = document.querySelector(selector);
    expect(element.textContent).toEqual(text);
}

// src/Users.js
var Users = {
    users: [],

    exists: function (email, password) {
        return this.users.length > 0;
    },

    add: function (email, password, name) {
        this.users.push({
            email: email,
            password: password,
            name: name
        });
    },

    nameOf: function (email) {
        return this.users[0].name;
    },

    currentUser: function () {
        return this.users[0];
    }
};

// src/LoginPage.js
var container = document.querySelector("#page");

var LoginPage = {
    render: function () {
        var emailInput = document.createElement("input");
        emailInput.id = "email";
        emailInput.type = "email";
        container.appendChild(emailInput);

        var passwordInput = document.createElement("input");
        passwordInput.id = "password";
        passwordInput.type = "password";
        container.appendChild(passwordInput);

        var loginButton = document.createElement("button");
        loginButton.id = "do_login";
        loginButton.textContent = "Login";
        container.appendChild(loginButton);

        loginButton.addEventListener("click", LoginPage.onLogin);
    },

    onLogin: function () {
        container.innerHTML = "";
        ProfilePage.render();
    }
};

// src/ProfilePage.js
var ProfilePage = {
    render: function () {
        var profileContainer = document.createElement("div");
        profileContainer.id = "profile_page";
        container.appendChild(profileContainer);

        var title = document.createElement("h1");
        title.id = "title";
        title.textContent = Users.currentUser().name;
        profileContainer.appendChild(title);
    }
};

Now we could add

to our index.html, and load our src/* scripts after it, and add at the end to start our application. Enjoy the application that implements one happy path of our feature (we still have quite a few different paths to cover). Also, it might be a good idea to style the application slightly better, than plain inputs and buttons, but we are not going to cover that in these series. This single feature test takes a lot of time to write because it is the first feature test in the empty application. Essentially, it has driven a lot of different architectural decisions, which don’t necessary need to be done the way they were done in this article. Writing a second test for the same feature is much easier and third one, and all consecutive are also easier.

Exercises

Remove x prefix from the xdescribe tests and implement them using techniques described in this article - write a user story scenario, translate it to the test code and make sure to fix all the test failures one feature step at a time.
Discover more interesting edge cases. Make the tests for them and make them all pass.
Write sign-up feature’s user story and implement it with feature tests.

Bottom Line

Today we have learned how to write tests for the whole application that imitate real user interaction. We have seen how, test-driving the functionality can help discovering new user story scenarios. Essentially, every time we write simple, but not so correct code that makes our test pass, we need to think what would be the scenario to prove that code wrong and add that scenario on our to-do list, which is represented by pending tests, which have only their descriptions.

In the next article of the series, we will dig deeper on what exactly we did today, what it means to Test-Drive the code, what are the laws, rules, tips and tricks of Test-Driven Development. Next article of the series assumes, that login and sign-up features are fully test-driven and implemented. We will be implementing a brand new feature of our application.

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Mobile Waterfall. Being Agile Again

2016-12-07T19:54:07+01:00

Have you ever worked with mobile platforms, such as Apple Store or Google Play? How much time it takes to release a new version of the application? They have ~2-4 days manual review of your mobile application. What do you think that means? - You can release your application to production no often than two times a week. It became much better around this year - before it was 1-1.5 weeks.

Would you like to deploy a bug fix? - Half a week. Unless it makes your application crash all the time. Then you can get it going in half a day or one day. Which is still pretty slow.

Would you like to make a canary release to 1% of your users and test an assumption quickly using your analytics tools? - Forget it. Waiting for three days to get results on your assumption negates the benefits of the Canary release. You should be getting your feedback in minutes, not days!

At such disadvantage deploy takes half a week and makes software development teams switch to much more defensive mode:

more extensive end-to-end testing (slow), and
more extensive manual QA before the release.

That is Waterfall. Right there.

Being Agile Again

In that environment, how could we get our quick feedback back? - What if we had the ability to send logic in the form of Domain-Specific Language (DSL) from our server, that we can deploy to whenever we wish to? What if our mobile application could have interpreted this DSL and could have updated itself every time user starts the application while connected to the Internet?

We would be able to get a quick feedback! It would make fast deploys possible and also it would allow us to do canary releases, which can enable us to be LEAN again - To be Agile again.

This approach has a few problems:

Reviewer can reject the application release. In this case, the developer has to negotiate with reviewers and explain that their business process requires such capabilities for their application to be quickly deployable. Exchange of few email and you can be Agile again.
You would have to implement that DSL. Both: design the DSL and write the interpreter on the mobile application side. That is quite a significant investment of time. It is reasonable to implement only tiny part of such DSL and apply it only in places that need to be often changed, such as UI and business logic that everyone tries to fiddle with to optimize the KPIs of the mobile application.
The approach does not guarantee quick deployability of 100% of the changes you need to make. Some changes will require an extension of DSL, and its interpreter, and native bindings. Surely, such change will have to be deployed via regular submit -> review -> wait four days -> release cycle. The tricky challenge is to balance what gets implemented in a DSL and what gets implemented in the native bindings for that DSL to achieve high enough portion of changes to be quickly deployable: about 90-95%; and keeping the DSL and interpreter complexity as low as you can.

If you take the idea of such DSl and the interpreter to the extreme, you will wound up with the programming language. There is already such programming language and platform that has the same characteristics - Javascript + React Native. Nowadays, application stores allow to download javascript code updates from your server, but you have to deploy all native bindings via application store with a manual review; also, one can not change the essence of their application.

Why would you want to go with your DSL instead of React Native?:

You already have a native mobile application, and you have identified places that change just too often.
The performance of the application is critical.

Example of the DSL

DSL might be as simple as Abstract Syntax Tree represented in JSON. Let’s imagine that we have an application, where users can buy some items and now we want to contact the recommendation service and present them a NEW view with the list of recommendations. Normally, you would have to do a full development and full deployment via application store. With DSL you might end up just writing some JSON:

// chunk of logic A
{
  "subscribe": {
    "event": "user_has_liked_item",
    "actions": [
      {
        "request": "/api/v1/recommendations/${event.item_id}",
        "publish": "received_recommendations"
      }
    ]
  }
}

// chunk of logic B
{
  "subscribe": {
    "event": "received_recommendations",
    "actions": [
      {
        "render": {
          "view": "item_list",
          "data": "${event.response.items}"
        }
      }
    ]
  }
}

Of course, parser and interpreter for this JSON, and actions (such as: request and render) are written in the native code and, therefore, have to be deployed via application store every time you change them or add a new capability. One can implement views in the native code, or represent them in DSL (aka simplified HTML + CSS).

Bottom Line

As an industry, we understand why different application store vendors want to review every release of every application:

To maintain quality of the applications and avoid possible malware, and
To preserve their revenue streams intact (paid application payments have to go through vendors, so they will be able to take a commission).

Nevertheless, we need to reject manual review as a bad practice and aim for fully automated deployments; that deliver our new application releases to users in minutes. As an industry, we need to push companies running application stores to improve their process to enable us to do that.

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Understanding Legacy Code Using Explorative Test-Driven Development Technique

2016-12-05T23:23:36+01:00

Today we are going to learn how to eliminate the fear of changing legacy code. We will learn how to confidently and in small iterations understand the legacy code better while increasing the test coverage in the process. While code examples are in Ruby programming language, the technique applied is language-agnostic.

For this article, we will need to define what Legacy Code means.

Legacy Code

Legacy code is challenging to understand when reading. Such code has no or close to no tests. Also, any legacy code brings the value to the business and customers.

Let’s give an outline of what we will be going through today:

We will define “Knowledge” and “Mutation” concepts in the context of the production code.
We will take a look into the relation between the production system and its test suite. While the connection from test suite to the production system is simple, the reverse connection is subtle and have some unusual and unexpected properties.
We will dismantle different test coverage metrics and outline the most valuable and useful one.
We will explore existing technique called Mutational Testing, that is simple to apply to the untested code to increase its test coverage.
We will introduce the technique called Explorative Test-Driven Development, that is an improvement of Mutational Testing method, which allows us to increase understanding of the legacy code in small steps - confidently and incrementally.
We will look at the example legacy code and apply Explorative TDD to it.
We will see the opportunities to use Explorative TDD technique outside the context of the Legacy Code.

Shall we get the ball rolling?

Knowledge in Production Code

“Knowledge in Production Code” is any small bit of functionality that represents any part of the business rule or underlying infrastructure rule. Example bits of knowledge in production code:

a variable assignment (or binding): a_variable = ...,
the presence of the if statement: if ... end,
an if condition: if has_certain_property(),
an if body: if ... do_something_interesting end,
the presence of the else clause: if ... else ... end,
an else body: if ... else do_something_different end,
function (or method) call: a_function(arguments), receiver.a_method(arguments),
every argument of the function (or method) call (including receiver),
a constant: 42,
the fact that function (or method) returns early: if ... return 42 end,
what the function (or method) returns,
the presence of the iteration: ...each do |x| ... end,
what we iterate through: list.each do ...,
and how we are iterating: ...each do |x| do_something_with(x) end,
and so on.

I think the idea “Knowledge in Production Code” should be more or less precise. More interesting is what we can do with knowledge in our system: we can re-organize knowledge differently keeping all the behaviors of the system - everyone calls this Refactoring nowadays; or do the opposite: change bits of knowledge without modifying the structure of the code - we will call one such change a Mutation:

Mutation

Mutation - granular change of the knowledge in the system that changes the behavior of the application. Let’s take a look at the simple example:

if cell_is_alive
  do_this
else
  do_some_other_thing
end

This code is maybe a part of some cell organism simulation (like Game Of Life or similar). Let’s see which different mutations can be applied here:

change the if condition always to be true: if true ...,
change the if condition always to be false: if false ...,
invert the if condition: if !cell_is_alive,
commenting out the if body: # do_this,
commenting out the else body: # do_some_other_thing.

With that done, let’s take a look at how production code and its test suite relate to each other.

Code and Test Suite Relationship

So, how does the test suite affect production code? First, it makes sure the production code is correct. Also, good test suite enables quick and ruthless refactoring by eliminating (or minimizing) the risks of breaking it. Well-crafted test suite gives us the power and courage to introduce changes. Also, test code always couples to the production code it is testing in one way or another.

Okay, how does the production system affect its test suite? As tests couple to the production code they test, the changes in production system may cause ripple effects on its test suite. Practically speaking, a mutation should always lead to a test failure if the test suite is good enough because its test suite should verify every tiny bit of knowledge in the production code (except, maybe, some configuration).

Such knowledge change is an act of assertion about the presence of the test. When information is covered by test suite well, there should be a test failure. If, after the introduction of the mutation, there is no test failure, this is a failed assertion about test presence or correctness. So one might say:

Knowledge Change is a Test for the Test

That is a fascinating idea since it implies we can use production code can as a test suite for its test suite, which may enable TDD-like iterative development of the test suite that does not exist.

So far, we have covered the idea of knowledge in the production code, explored ways of modifying this information in a way that changes the behavior - we call it a mutation, and also we explored the mirror-like relation between production code and its test suite. We have still much ground to cover, let’s dive in:

Most Useful Coverage Metric

There is a few well-known test coverage metrics that are being used quite often by software engineering teams, such as:

Line coverage, and
Branch coverage.

There is another one, called Path coverage - it is about coverage of all possible code paths in the system, which quickly becomes impractical as the application size grows because of the exponential growth of the amount of these different code paths.

Line coverage and Branch coverage (also, path coverage) all share one major problem - covered line/branch/path does not mean test suite verifies it - only executes it. Great example: remove all the assertions from your tests and the coverage metric will stay the same.

So, what if we could introduce all possible and sane mutations to our code and count how much of them cause test failure? - We will get the knowledge coverage metric. Another name for it is Test Semantic Stability, and it can range from 0% to 100%. Even 100% line/path coverage can easily yield 0% Test Semantic Stability. This metric proves that code is, indeed well-tested and verified (although, it does not say anything about tests’ design and cleanliness): make one assertion incorrect, or not precise enough and the metric will go down by a few mutations.

That makes Test Semantic Stability the most useful coverage metric.

So, how do we check if our test(s) cover well some bit of knowledge in the system? We break it! - Introduce a tiny granular breaking change to that bit of knowledge. The test suite should fail. If it does not - information is not covered well enough. That leads us to the technique that allows us to keep Semantic Test Stability up high:

Mutational Testing

Narrow the scope of work to a single granular piece of knowledge.
Break this knowledge (introduce simple granular breaking change - mutation).
Make sure there is a test suite failure.
Restore the knowledge to its original state (CTRL+Z, ideally).

Let’s see it in action:

if cell_is_alive
  do_this
else
  do_some_other_thing
end

First, we need to narrow our scope to a single bit of knowledge. For example, the if condition: if cell_is_alive. Then we need to introduce the mutation if true, and we need to make sure that there is a test failure. Let’s run the test suite:

$ rake test
....

Finished in 0.02343 seconds (files took 0.11584 seconds to load)
4 examples, 0 failures

Oh no! It did not fail anywhere! That means that we have a “failing test” for our test suite. In this case, we need to add the test for the negative case:

cell_is_alive = false
expect(did_some_other_thing).to eq(true)

When we run the test suite:

$ rake test
....F

Finished in 0.02343 seconds (files took 0.11584 seconds to load)
5 examples, 1 failure

It fails! Great - that means that our test for the test suite is passing now. As the last step of this mutational testing iteration we have to return the code to its original state:

if cell_is_alive
  do_this
else
  do_some_other_thing
end

After doing this, our tests should pass!:

$ rake test
.....

Finished in 0.02343 seconds (files took 0.11584 seconds to load)
5 examples, 0 failures

They do. That concludes one iteration of the mutational testing. Usually, to accomplish any useful behavior we would like to combine many bits of knowledge. If we want to understand better how the system works, we need to focus on groups of bits of knowledge. This is what Explorative TDD technique is about:

Explorative Test-Driven Development

The technique used to increase our understanding of the Legacy Code while enhancing its Test Semantic Stability (the most useful coverage metric). The process roughly looks like that:

Narrow scope to some manageable knowledge and isolate it (manageable knowledge = method/function/class/module).
Read, try to understand, pick a granular piece of knowledge, and make an assumption to which behavior it contributes and how.
Write a test to verify this assumption.
Make sure test passes (by altering the assumption or fixing production code (bugs)). PS: be careful with bugs, since they might be weird behaviors that are bringing someone tremendous value. When finding one of these, consult with stakeholders if that is a bug or a feature.
Apply Mutational Testing to each related granular piece of knowledge to verify that the understanding (and the test) is correct (this may introduce more tests).
Go back to 2

At this point, a nice example would help understand that technique:

Step-by-Step Example

Let’s imagine that we have some legacy system, that is a social network and allows for users to receive notifications on things that happened. You need to change slightly what “Followed” notification means. The code looks like this, and it does not have any tests:

class User
  def notifications
    notifications = Database
      .where("notifications") do |x|
        (x[1][0] == "followed_notification" && x[1][2] == id.to_s) ||
        (x[1][0] == "favorited_notification" && StatusUpdate.find(x[1][2].to_i).owner_id == id) ||
        (x[1][0] == "replied_notification" && StatusUpdate.find(x[1][2].to_i).owner_id == id) ||
        (x[1][0] == "reposted_notification" && StatusUpdate.find(x[1][2].to_i).owner_id == id)
      end.map do |row|
        id, values = row
        kind = values[0]

        if kind == "followed_notification"
          {
            kind: kind,
            follower: User.find(values[1].to_i),
            user: User.find(values[2].to_i),
          }
        elsif kind == "favorited_notification"
          {
            kind: kind,
            favoriter: User.find(values[1].to_i),
            status_update: StatusUpdate.find(values[2].to_i),
          }
        elsif kind == "replied_notification"
          {
            kind: kind,
            sender: User.find(values[1].to_i),
            status_update: StatusUpdate.find(values[2].to_i),
            reply: StatusUpdate.find(values[3].to_i),
          }
        elsif kind == "reposted_notification"
          {
            kind: kind,
            reposter: User.find(values[1].to_i),
            status_update: StatusUpdate.find(values[2].to_i),
          }
        end
      end

    Analytics.tag({name: "fetch_notifications", count: notifications.count})
    notifications
  end
end

Narrow & Isolate

The first step is to isolate this code and make it testable. For this we need to find a low-risk way to refactor all dependencies that this code has:

Database.where,
StatusUpdate.find,
User.find, and
Analytics.tag.

We can promote these things to the following roles:

Database.where => @table_reader.where,
StatusUpdate.find => @status_update_finder.where,
User.find => @user_finder.find, and
Analytics.tag => @event_tagger.tag.

We should be able to have these default to their original values and also allow to substitute different implementation from the test. Also, it is helpful to pull out this method into the clean environment, where accessing a dependency, without us substituting it - is not possible, for example in a separate code-base, so that we can write a test “it works” and see what fails. The first failure is, of course, all our referenced classes are missing. Let’s define all of them without any implementation and make them fail at runtime if we ever call them from our testing environment:

class Database
  def self.where(table_name)
    fail "Database:nope"
  end
end

class Analytics
  def self.tag(event)
    fail "Analytics:nope"
  end
end

class StatusUpdate
  def self.find(id)
    fail "StatusUpdate:nope"
  end
end

class User
  # .. def notifications ..

  def self.find(id)
    fail "User:nope"
  end
end

In our tests, we need to implement our substitutes. For now, they all should be just simple double/stubs:

class FakeTableReader
  def where(table_name, &filter)
    [[nil, ["favorited_notification"]]]
  end
end

class FakeEventTagger
  def tag(event)

  end
end

class FakeUserFinder
  def find(id)
    User.new
  end
end

class FakeStatusUpdateFinder
  def find(id)
    StatusUpdate.new
  end
end

Then, we should write the simplest test, that sets up the stage and substitutes all the collaborators and runs the function under the test (no assertion, we are just verifying that we indeed replaced everything right):

it "works" do
  fake_table_reader = FakeTableReader.new
  fake_event_tagger = FakeEventTagger.new
  fake_user_finder = FakeUserFinder.new
  fake_status_update_finder = FakeStatusUpdateFinder.new

  user = User.new
             .with_table_reader(fake_table_reader)
             .with_event_tagger(fake_event_tagger)
             .with_user_finder(fake_user_finder)
             .with_status_update_finder(fake_status_update_finder)

  user.notifications
end

Since we have not defined all the with_* methods yet, let’s define them now and also define getters for particular instance variables (properties):

class User
  # ...

  def table_reader
    @table_reader ||= Database
  end

  def event_tagger
    @event_tagger ||= Analytics
  end

  def user_finder
    @user_finder || User
  end

  def status_update_finder
    @status_update_finder || StatusUpdate
  end

  def with_table_reader(table_reader)
    @table_reader = table_reader
    self
  end

  def with_event_tagger(event_tagger)
    @event_tagger = event_tagger
    self
  end

  def with_user_finder(user_finder)
    @user_finder = user_finder
    self
  end

  def with_status_update_finder(status_update_finder)
    @status_update_finder = status_update_finder
    self
  end
end

If we run our test, it should fail with RuntimeError: Database:nope in here:

def notifications
  notifications = Database            # <<<<<<
    .where("notifications") do |x|

To fix that, we will need to replace Database with table_reader getter. That will correct the current error, and we will get the next one: RuntimeError User:nope. Following all these failures and replacing direct dependencies with getters we will finally get a Green Bar (passing the test). Our function under the test will look like that:

class User
  def notifications
    notifications = table_reader
      .where("notifications") do |x|
        (x[1][0] == "followed_notification" && x[1][2] == id.to_s) ||
            (x[1][0] == "favorited_notification" && status_update_finder.find(x[1][2].to_i).owner_id == id) ||
            (x[1][0] == "replied_notification" && status_update_finder.find(x[1][2].to_i).owner_id == id) ||
            (x[1][0] == "reposted_notification" && status_update_finder.find(x[1][2].to_i).owner_id == id)
      end.map do |row|
        id, values = row
        kind = values[0]

        if kind == "followed_notification"
          {
              kind: kind,
              follower: user_finder.find(values[1].to_i),
              user: user_finder.find(values[2].to_i),
          }
        elsif kind == "favorited_notification"
          {
              kind: kind,
              favoriter: user_finder.find(values[1].to_i),
              status_update: status_update_finder.find(values[2].to_i),
          }
        elsif kind == "replied_notification"
          {
              kind: kind,
              sender: user_finder.find(values[1].to_i),
              status_update: status_update_finder.find(values[2].to_i),
              reply: status_update_finder.find(values[3].to_i),
          }
        elsif kind == "reposted_notification"
          {
              kind: kind,
              reposter: user_finder.find(values[1].to_i),
              status_update: status_update_finder.find(values[2].to_i),
          }
        end
      end

    event_tagger.tag({name: "fetch_notifications", count: notifications.count})
    notifications
  end

  # ...
end

Structure and logic of the function did not change at all, but now all the dependencies are injectable and can be used to test it nicely. That concludes the first step - narrow & isolate. Now it is time to select a group of knowledge bits that we would like to cover with tests. Since we want to change how followed_notification is behaving, we might as well start checking there.

Trying to Understand & Writing 1st Test

The group of knowledge bits that are related to followed_notification looks like this:

notifications = table_reader
  .where("notifications") do |x|
    (x[1][0] == "followed_notification" && x[1][2] == id.to_s) ||
    # ...
  end.map do |row|
    id, values = row
    kind = values[0]

    if kind == "followed_notification"
      {
          kind: kind,
          follower: user_finder.find(values[1].to_i),
          user: user_finder.find(values[2].to_i),
      }
    elsif #...
      # ...
    end
  end

# ...
notifications

Now we want to write a test. At the first thought, something like:

it "obtains followed notifications for the user" do
  # first create a user with all fakes (extracted to a helper method)
  user = create_user_with_fakes

  # then instruct our table reader fake to return prepared data
  fake_table_reader
      .insert("notifications",
              [1001, ["followed_notification", 2001, 3001]])

  # and expect that we have exactly one notification
  expect(user.notifications.count).to eq(1)
end

def create_user_with_fakes
  User.new(567)
      .with_table_reader(fake_table_reader)
      .with_event_tagger(fake_event_tagger)
      .with_user_finder(fake_user_finder)
      .with_status_update_finder(fake_status_update_finder)
end

class FakeTableReader
  def insert(table_name, row)
    tables(table_name) << row
  end

  def tables(table_name)
    @tables ||= {}
    @tables[table_name] ||= []
  end

  def where(table_name, &filter)
    tables(table_name).select(&filter)
  end
end

Making It Pass

This test fails right away - we don’t have any notifications. This is strange. Let’s take a closer look on the filtering that we are doing:

(x[1][0] == "followed_notification" && x[1][2] == id.to_s) ||

I believe, we have satisfied the first part of this condition, but not the second one. The user id is not the same as the 3rd element of this row. Let’s make them same:

fake_table_reader
    .insert("notifications",
            [1001, ["followed_notification", 2001, 567]])
                                               # ^ here ^

This fails again! This code just keeps proving our assumptions wrong. I think we need to take a careful look at that it.to_s. .to_s is a conversion to string, so the foreign key is stored as a string (who could have thought?). Let’s try to make it work:

fake_table_reader
    .insert("notifications",
            [1001, ["followed_notification", 2001, "567"]])
                                                # ^ here ^

Applying Mutational Testing

If we run our tests, they pass! Great, now we know that this function is capable of obtaining some followed notifications. Of course, our coverage right now is super small. Let’s apply mutational testing to it. We should start from the condition:

(x[1][0] == "followed_notification" && x[1][2] == id.to_s) ||

First, let’s replace the whole thing with false:

false ||

The test fails - mutant does not survive - our tests are covering for this mutation. Let’s try another one: replace the whole thing with true:

true ||

Our tests pass - mutant survives - this is a failing test for our tests. In this case, it is reasonable to write a new test for a case, when the full filtering expression should yield false - when we have notifications of an invalid kind:

it "ignores notifications of an invalid kind" do
  user = create_user_with_fakes

  fake_table_reader
      .insert("notifications",
              [1001, ["invalid", 2001, "567"]])

  expect(user.notifications.count).to eq(0)
end

As a result, we should not get any notifications. After running, we see that our test fail. Great! This mutant no longer survives. Let’s see if our tests will pass when we undo the mutation:

(x[1][0] == "followed_notification" && x[1][2] == id.to_s) ||

And they all pass! Next mutation is inverting the whole condition:

! (x[1][0] == "followed_notification" && x[1][2] == id.to_s) ||

All our tests are RED. Which means that this mutant does not survive and the test for our test is green. Now, we should dig deeper into the parts of the condition itself:

x[1][0] == "followed_notification": replacing with true, false, and inverting it; also, changing numeric and string constants; These all changes did not produce any surviving mutants, so we do not need to introduce new tests.
x[1][2] == id.to_s: replacing with true, false and inverting it; also, changing numeric constants.

Replacing x[1][2] == id.to_s with true, apparently, leaves all our tests passing - a mutant that survives - a failing test for our test suite. It is time to add this test - when we have notifications of some different user:

it "ignores notifications of different user" do
  user = create_user_with_fakes

  fake_table_reader
      .insert("notifications",
              [1001, ["followed_notification", 2001, "other user"]])
                                                   # ^   here   ^

  expect(user.notifications.count).to eq(0)
end

As you can see, having a record with the different user id (in this case, even nonsensical user id) makes our test fail, which means that this mutant no longer survives. Let’s see if undoing the mutation will turn our tests GREEN:

(... && x[1][2] == id.to_s) ||

All our tests pass again. I think we have finished testing the condition in the filter. I would not touch the conditions that are related to different kinds of notifications, as we want to introduce changes only to “Followed” notifications. So we can dig further into the logic of our group of knowledge bits:

id, values = row
kind = values[0]

if kind == "followed_notification"
  {
      kind: kind,
      follower: user_finder.find(values[1].to_i),
      user: user_finder.find(values[2].to_i),
  }
elsif #...
  # ...
end

So, we can see that we split the row into its id and all the other values of the notification record. Apparently, the first value is responsible for the kind, where we are switching on it to construct correct object (in this case just a lump of data - hash map). So let’s try to mutate the numeric constant in kind = values[0]:

kind = values[1]
          #  ^^^

All our tests still pass. That is a failing test for our test suite. We ought to write a new test now. Where we should verify that it constructs correct lumps of data:

it "constructs correct followed notification" do
  user = create_user_with_fakes

  fake_table_reader
      .insert("notifications",
              [1001, ["followed_notification", 2001, "567"]])

  expect(user.notifications[0][:kind]).to eq("followed_notification")
end

This test fails, because our user.notifications[0] Is nil, because none of if or elsif matched the kind variable and in Ruby, by default any function returns a nil value. This failing test means that we no longer have surviving mutant and let’s see if undoing that mutation will make our tests pass:

kind = values[0]
          #  ^^^

It does, all our tests are green now. We should continue like this until we understand code enough and have enough confidence in our tests so that we can make our desired change to the system. When we think we have finished, we should integrate isolated code back to the legacy system, leaving all the fakes and injection capabilities in place. We were separating this code only to make sure, that we are not calling any dependencies on accident (while they just work silently). While integrating it back we, of course, get rid of fail "NAME:nope" implementations of collaborators. With such approach, integrating the code back should be as simple as copy-pasting the test suite code and production code (function under the test, and injecting facilities) without copying always-failing collaborators.

We will have to wrap up the example, and if you, my reader, would like to continue applying Explorative TDD to this code, you can find the code here: https://github.com/waterlink/explorative-tdd-blog-post (specifically, spec/user_spec.rb). The function originates from this example project: https://github.com/waterlink/lemon

Can Explorative TDD Help Me Outside of Legacy Code?

The answer is yes! I use Explorative TDD (as well as mutational testing) in following cases:

During big refactorings, such as Extract class/module/package. The technique helps you quickly understand which tests have to be moved as well to the new test suite (only if you want to transfer them).
When refactoring tests. The technique helps you to verify if your tests are still working as they are intended to and if they are still semantically stable (they catch a majority of mutants).
To measure rigidity of test-to-code coupling. If a single mutation leads to half of your test suite failing (even irrelevant tests) - tests need refactoring.

Bottom Line

Today we have learned about concepts like “Knowledge in production code” and “Mutation.” Also, we learned what Test Semantic Stability is the best code coverage metric. We have seen Mutational Testing and Explorative TDD techniques at work. We could start applying these techniques (after some practice) to stop fearing the legacy code and just handle it as some tedious routine operation.

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Build Your Own Testing Framework. Part 5

2016-11-13T12:50:51+01:00

Welcome back to the new issue of “Build Your Own Testing Framework” series! Did you notice, that out testing framework quits on the first failure? It probably should run all tests, collect all failures and present them nicely. This is what we are going to accomplish today:

Make sure all tests run even when there is a failure.
Make sure exit code is correct.

This article is the fifth one of the series “Build Your Own Testing Framework” so make sure to stick around for next parts! Find all posts of these series can here.

Shall we get started?

Catch and report a test failure

Our test suite should no longer bubble up any exceptions. We can achieve that by making an appropriate assertion. And also we should verify that other tests execute after the failure:

runTestSuite(function FailureTest(t) {
    this.testItDoesNotBubbleUpExceptions = function () {
        var aSpy = t.spy();

        t.assertNotThrow(function () {
            runTestSuite(function (t) {
                this.testFailure = function () {
                    t.assertTrue(false);
                };

                this.testSomething = aSpy;
            });
        });

        aSpy.assertCalled();
    };
});

As expected, this fails with an appropriate error Error: Expected not to throw error, but thrown 'Expected to be true, but got false' indicating that we are bubbling up all errors at the moment. Also, notice how the execution of the whole test suite stops at that point, and it just exits the program with error code 1. A simple try .. catch block will fix the issue:

// in runTestSuite function
    for (var testName in testSuitePrototype) {
        if (testName.match(/^test/)) {
            reporter.reportTest(testName);
            var testSuite = createTestSuite(testSuiteConstructor);

            try {
                testSuite[testName]();
            } catch (error) {
                // do nothing, for now
            }
        }
    }

All tests now run successfully. This code is starting to become unreadable, so it is a good point to refactor. We will:

Extract whole try .. catch as a function runTest. Its current responsibility is only to run the test and ignore any failure;
Extract contents of if statement that matches the test name as a function handleTest. Its responsibility is to report the test, create a fresh testSuite and kick off runTest;
Extract the whole for statement as runAllTests.

Here is the final snippet of code:

function runTest(testSuite, testName) {
    try {
        testSuite[testName]();
    } catch (error) {
        // do nothing, for now
    }
}

function handleTest(reporter, testName, testSuiteConstructor) {
    reporter.reportTest(testName);
    runTest(createTestSuite(testSuiteConstructor), testName);
}

function runAllTests(reporter, testSuitePrototype, testSuiteConstructor) {
    for (var testName in testSuitePrototype) {
        if (testName.match(/^test/)) {
            handleTest(reporter, testName, testSuiteConstructor);
        }
    }
}

function runTestSuite(testSuiteConstructor, options) {
    options = options || {};
    var reporter = options.reporter || new SimpleReporter();

    var testSuitePrototype = createTestSuite(testSuiteConstructor);

    reporter.reportTestSuite(
        getTestSuiteName(testSuiteConstructor, testSuitePrototype)
    );

    runAllTests(reporter, testSuitePrototype, testSuiteConstructor);
}

Exit with code 1

Now, when at least one test fails in a suite of tests, the whole suite should fail (after running the rest of its tests). And the indicator of such failure should be an exit code of the process. Let’s write a test:

runTestSuite(function FailureTest(t) {
    // ...

    this.testItExitsWithProcessCodeOne = function () {
        var processSpy = new ProcessSpy();

        runTestSuite(function (t) {
            this.testFailure = function () {
                t.assertTrue(false);
            };
        }, {process: processSpy});

        t.assertEqual(1, processSpy.hasExitedWithCode);
    };
});

As you might guess, we will need another object. It will be responsible for interaction with our process, i.e.: something that we can ask to “exit with code 1.” Because we can not ask our process to exit within the test run, we will have to create a spy. And we shall test-drive its functionality. There is something interesting that we should worry about before that - our test suite is passing currently.. but it shouldn’t be!

Let’s step back and think what just happened: clearly, we are writing the test, that can not possibly pass because we do not have ProcessSpy yet. So we are expecting a failure - we are expecting a thrown exception. That expectation is an important part of test-driven development: at all times we expect a very specific failure or we expect our tests to pass; if we do not receive a failure when expected and receive an unexpected failure, we should stop right there and think which part of our thinking and our assumptions is incorrect.

Right now, tests do not fail, because we are ignoring all exceptions in our try .. catch that we introduced a couple of minutes ago. If we want to see failures again, let’s modify catch block to just log all errors it receives:

function runTest(testSuite, testName) {
    try {
        testSuite[testName]();
    } catch (error) {
        console.log(error);
    }
}

Now our test suite outputs an expected error: ReferenceError: ProcessSpy is not defined. Also, it outputs some other failures that happen in our nested runTestSuite calls - we should fix them by providing silenceFailures option for nested runTestSuite call. We can focus now on the ProcessSpy failure and test-drive it:

runTestSuite(function ProcessSpy_BehaviorTest(t) {
    var processSpy = new ProcessSpy();
});
// => ReferenceError: ProcessSpy is not defined

function ProcessSpy() {}
// => PASS

    this.testHasExitedWithCode_initiallyIsNull = function () {
        t.assertEqual(null, processSpy.hasExitedWithCode);
    };
// => Error: Expected to equal null, but got: undefined

function ProcessSpy() {
    this.hasExitedWithCode = null;
}
// => PASS

    this.testHasExitedWithCode_isZero_afterExitZeroCall = function () {
        processSpy.exit(0);
        t.assertEqual(0, processSpy.hasExitedWithCode);
    };
// => TypeError: processSpy.exit is not a function

// in ProcessSpy
    this.exit = function (code) {
        this.hasExitedWithCode = 0;
    };
// => PASS

    this.testHasExitedWithCode_isOne_afterExitOneCall = function () {
        processSpy.exit(1);
        t.assertEqual(1, processSpy.hasExitedWithCode);
    };
// => Error: Expected to equal 1, but got: 0

// in ProcessSpy
    this.exit = function (code) {
        this.hasExitedWithCode = code;
        // changed 0 to code      ^here^
    };

I think we have finished test-driving the functionality of ProcessSpy. It is time to get back to our failing test for a failure resulting in an exit with code 1. When we run this test suite, we are getting the following error message: Error: Expected to equal 1, but got: null.‘ To pass this test, we will need to store the fact that we had a failure somewhere and at the end of the test suite run we can trigger exit with code 1 or 0, respectively. We could pass around a status object with boolean property status.failed and set it to true in our catch block:

} catch (error) {
    if (!silenceFailures) console.log(error);
    status.failed = true;
}

And at the end of runTestSuite function we could call process.exit(1) if status.failed was true:

function runTestSuite(testSuiteConstructor, options) {
    // ...

    if (status.failed) {
        process.exit(1);
    }
}

While this works (as in “tests pass after providing fakeProcess where needed for nested failing runTestSuite calls”) state changes in this code are starting to be hard to follow and function signatures remind me of some horror movie:

function getTestSuiteName(testSuiteConstructor, testSuitePrototype)
function runTest(testSuite, testName, silenceFailures, status)
function handleTest(reporter, testName, testSuiteConstructor, silenceFailures, status)
function runAllTests(reporter, testSuitePrototype, testSuiteConstructor, silenceFailures, status)

These signatures smell like objects are hiding there in these functions. Let’s find them!

Quest for hidden objects

First, let’s extract the method object from the function runTestSuite. We will give it a name TestSuiteRunContext:

function TestSuiteRunContext(testSuiteConstructor, options) {
    options = options || {};
    var reporter = options.reporter || new SimpleReporter();
    var process = options.process || global.process;
    var silenceFailures = options.silenceFailures || false;

    var status = {failed: false};

    var testSuitePrototype = createTestSuite(testSuiteConstructor);

    this.invoke = function () {
        reporter.reportTestSuite(
            getTestSuiteName(testSuiteConstructor, testSuitePrototype)
        );

        runAllTests(
            reporter,
            testSuitePrototype,
            testSuiteConstructor,
            silenceFailures,
            status
        );

        if (status.failed) {
            process.exit(1);
        }
    };
}

function runTestSuite(testSuiteConstructor, options) {
    new TestSuiteRunContext(testSuiteConstructor, options).invoke();
}

Now, if we were to move function runAllTests inside of this class, we would not need all these arguments (and all other functions we call):

this.invoke = function () {
    reportTestSuite();
    runAllTests();
    finishTestRun();
};

function reportTestSuite() {
    reporter.reportTestSuite(getTestSuiteName());
}

function getTestSuiteName() {
    if (typeof(createTestSuite().getTestSuiteName) !== "function") {
        return testSuiteConstructor.name;
    }

    return createTestSuite().getTestSuiteName();
}

function createTestSuite() {
    return new testSuiteConstructor(assertions);
}

function runAllTests() {
    for (var testName in createTestSuite()) {
        if (testName.match(/^test/)) {
            handleTest(testName);
        }
    }
}

function handleTest(testName) {
    reportTest(testName);
    runTest(createTestSuite(), testName);
}

function reportTest(testName) {
    reporter.reportTest(testName);
}

function runTest(testSuite, testName) {
    try {
        testSuite[testName]();
    } catch (error) {
        if (!silenceFailures) console.log(error);
        status.failed = true;
    }
}

function finishTestRun() {
    if (status.failed) {
        process.exit(1);
    }
}

It already looks very nice. The only thing that I do not like about this object yet is that it has stateful properties and stateless properties. I like to have my objects separated by this concern. Let’s extract status mutable property as a proper TestSuiteRunStatus object:

function TestSuiteRunStatus() {
    var failed = false;

    this.markAsFailed = function () {
        failed = true;
    };

    this.hasFailed = function () {
        return failed;
    };
}

function TestSuiteRunContext(testSuiteConstructor, options) {
  // ...
  var status = new TestSuiteRunStatus();

  // ...
  function runTest(testSuite, testName) {
        try {
            testSuite[testName]();
        } catch (error) {
            if (!silenceFailures) console.log(error);
            status.markAsFailed();
        }
    }

    function finishTestRun() {
        if (status.hasFailed()) {
            process.exit(1);
        }
    }
}

I think we have finished the refactoring. Now we should verify that test suite exits with the code 0 when everything passes:

this.testItExitsWithProcessCodeZero_onSuccess = function () {
    runTestSuite(function (t) {
        this.testFailure = function () {
            t.assertTrue(true);
        };
    }, {process: processSpy, silenceFailures: true});

    t.assertEqual(0, processSpy.hasExitedWithCode);
};
// => Error: Expected to equal 0, but got: null

function finishTestRun() {
    if (status.hasFailed()) return process.exit(1);
    process.exit(0);
}
// => PASS

Bottom Line

I think we have finished implementing exit code reporting. The code can be found here: https://github.com/waterlink/BuildYourOwnTestingFrameworkPart5

There is still a lot to go through. In a few next episodes we will:

Report OK and FAIL for each test;
Output carefully formatted failures to the STDERR;
Enable our testing framework to run multiple test suite files at once;
Enable our testing framework to run in a browser (it is javascript after all).

Stay tuned!

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on Twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Do More With Baby-Steps TDD

2016-10-19T23:15:12+02:00

Hello everyone! I’m usually advocating for the Baby-Steps Test-Driven Development with Triangulation. On the first encounter, this technique seems very verbose and everybody wonders how can it possibly work and why I am very productive with it. Let me tell you about that. First, let’s quickly recap both techniques:

Baby-Steps TDD

In Baby-Steps TDD the basic strategy is to get to the green state ASAP. If you can pass all tests with return 42, you should! While the benefit of the approach is not directly obvious exploring the alternative shows its value. One possible alternative is to write a bunch of tests for the software and then make them all pass. This results in a lot of changes made to the software under the test while tests are failing (are in red state). This provides very slow feedback and high risks because with every decision in the code complexity grows exponentially and problems are hard to find when you only know that software worked 1 hour ago and there is one mistake in a whole 1 hour worth of work. The same effect can be seen if the most complex test is written first so that it forces the engineer to implement the whole solution or big part of it in one go.

Baby-Steps TDD mitigates the issue by ensuring everything worked one or two minutes ago. At least to the extent the software is specified by currently written tests. So if something does not work as expected it is most probably a mistake in these last 2-3 lines of code that we have written. And we can even discard them entirely with “undo” command and start over from the green state without losing much work and saving a whole lot of time debugging.

Baby-Steps TDD provides faster feedback and fewer risks for the cost of a bit more overall effort. Let’s take a look into the triangulation technique:

Triangulation Technique

In essence, Triangulation Technique takes ideas of Baby-Steps TDD further and reduces step size even further. For example, when usually with Baby-Steps TDD you would need one test to introduce the correct if statement, with Triangulation Technique and Baby-Steps TDD combined you would use multiple tests for this:

The first test for the most degenerated case which requires one to write a simple return CONSTANT statement to pass:

return 1

The second test for the same case where the result will be different which requires one to promote CONSTANT to some sort of calculation (variable, formula or function call):

return n

The third test for the other case which requires one to write a specific if (argument == SPECIFIC_VALUE) with another return ANOTHER_CONSTANT:

if (n == 7)
  return 42

return n

The fourth test for the same case where the result will be different which requires one to promote ANOTHER_CONSTANT to some sort of calculation (variable, formula or function call):

if (n == 7)
  return m * 6

return n

The fifth test for the same case where the condition has to be different which requires one to promote specific condition to the proper one:

if (n >= 7)
  return m * 6

return n

In normal Baby-Steps TDD that would probably have been only 2 or 3 test cases. With Triangulation it is 5 and to make every one of them pass requires a simple transformation of the production code.

Now let’s see why these techniques combined make me more productive.

Willpower Depletion

Did you know that every decision you make costs you some willpower? For example: choosing what to wear in the morning, refusing to eat this tasty cake or to make a design choice in your code. This phenomenon is known as Ego Depletion (see in wiki) and it has an experimental evidence. According to this phenomenon self-control and willpower both draw upon a limited pool of mental resources and it can be used up. Usually, these resources are recovered greatly during good night’s sleep or slightly after consuming food. Cost per each made decision differs also and even for the same kind of decision can depend on various factors:

perceived complexity of the problem,
mood, physical state and perceived fatigue of the person (tired, angry, confused, happy, energized, etc.),
required effort to make and execute the decision,
blood glucose levels.

Baby-Steps TDD combined with Triangulation technique optimize for “perceived complexity of the problem” so that every decision is nearly obvious to make and the effort required to make it and execute on it is stupidly small. While this increases the amount of decisions that I need to make it also decreases the complexity and willpower cost of each decision to the point, where after completing the same amount of work I still have plenty of energy and willpower to make any other decisions at work and outside of it.

It is worth noting that these techniques need to be practiced quite a bit to enable this effect - with such incremental design it is important to avoid getting stuck. Definitely, try it out and see if it works for you!

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Acknowledgements

Thanks to David Völkel for the great presentation about Baby-Steps TDD. Slides can be found here.

Thanks to Stephen Guise for the great book “Mini Habits” that has opened my eyes to the reasons why I like these techniques so much and why I love designing software in tiny increments.

HighScore Kata

2016-09-30T18:23:21+02:00

Hello, everyone. Today we will take a look into a little problem involving high scores in some sort of game. The game has only one high score and when current game’s score exceeds that number, it gets updated. Example acceptance test would read like this:

Given high score is 174
When player scores 191
Then high score is 191

Current implementation stores high score in the web browser’s local storage. This detail does not change the purpose of this Kata very much since any other platform and language can have its own analog of local storage (file system, in-memory or local database, application settings, etc.). HighScore object looks like this:

function Highscore() {
    // initially, load high score value from the local storage
    this.load();
}

Highscore.prototype.updateHighscore = function (score) {
    // check if we need to update high score
    if (score > this.highscore) {
        this.highscore = score;
        this.save();
    }

    // render the high score in the
    // id="highscore" element in the browser
    $("#highscore").text("HIGHSCORE: " + this.highscore);
};

Highscore.prototype.load = function () {
    // localStorage is storing everything as strings,
    // so we need to convert it to number
    this.highscore = parseFloat(localStorage.highscore);
};

Highscore.prototype.save = function () {
    localStorage.highscore = this.highscore;
};

Task for the Kata:

Write tests for this class.
Fix the bug: when a game is launched on the new client (without the high score stored), it renders HIGHSCORE: NaN. (NaN is javascript’s abbreviation for “not a number”). parseFloat most probably is a culprit for this.
Extract storing mechanisms, so that class can be re-used with different storage mechanisms (for example local database, or external REST API).
Make all tests runnable outside of the context of the browser (for example, on nodejs).

The focus of this Kata is on architectural boundaries, that this little innocent class spans.

Questions to ask yourself:

How much distinct responsibilities this class has?
What architectural boundaries should I draw through this class?
- How do I split this class according to these boundaries?
What is the easiest way and what is the proper way to make tests runnable outside of the context of the browser?
How would this code look like in different kinds of languages? (static, dynamic, object-oriented, functional, strong-typed, etc.).
- And how the solution for the Kata will look like for these?

Next time we will take a look at one possible solution for this Kata. Try to solve it on your own, my dear reader, and please share the code and insights!

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Meet Duck Type

2016-09-18T16:37:33+02:00

Duck type

Duck type is the concept in the domain of the type safety that represents objects, that pass a so-called “Duck Test”:

If it looks like a duck, swims like a duck, and quacks like a duck, then it probably is a duck.

In terms of programming language, it might look like this:

function Duck() {
    this.swim = function (coordinates) { ... };
    this.quack = function (sentence) { ... };
}

function RoboDuck() {
    this.swim = function (coordinates) { ... };
    this.quack = function (sentence) { ... };
}

// .. and so on ..

The point is that the public interface has methods swim() and quack(). This is how you identify the duck in a programming language. This concept is very similar to the concept of the interface in programming languages that have one, but it is not enforced in any way by the programming language.

Duck typing is mostly natural in dynamic languages, where it is possible to send any message to any object and the check if that is something possible will happen at runtime. In static languages, it is still possible to use duck typing via some sort of Reflection.

Contract test for duck types

In a dynamic language, it is important to make it obvious, that something is implementing certain duck type by writing one test suite for all implementers and executing it against them. For example:

[Duck, RoboDuck].forEach(function (duckType) {
    runTestSuite(function (t) {
        this.testItSwimsLikeADuck = function () {
            var duck = new duckType();
            duck.swim({x: 5, y: 7});
            t.assertThat(duck).swamTo({x: 5, y: 7});
        };

        this.testItQuacksLikeADuck = function () {
            var duck = new duckType();
            duck.quack("hello world");
            t.assertThat(duck).quacked("hello world");
        };
    });
});

This test suite has to go only through the Duck type public interface. If it is not possible to test behavior through it, one should test at least the function signatures, for example:

this.testItSwimsLikeADuck = function () {
    var duck = new duckType();
    var swim = duck.swim;
    t.assertEqual("function", typeof(swim));
    t.assertEqual(1, swim.length);
};

Not doing contract tests for your ducks may result in a passing test suite and broken production code. For example, when one duck and its test suite have been updated, but others haven’t.

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Introducing Test Doubles

2016-09-18T16:31:48+02:00

A test double is a test object or a test function, that looks and behaves like its production counterpart, but is actually a simplified version that reduces the complexity and enables simpler testing. One can represent all types of test double as an inheritance tree like this:

Where Double is an abstract test double, which has no functionality - it is a general concept to talk about test doubles.

Dummy - is a test double, that is used to fill parameter lists, in cases where these parameters are not used by production code. Simplest Dummy would look like this:

function ExampleDummyObject() {
  this.doSomething = function () {};

  this.getSomething = function () {
    return null;
  };
}

function exampleDummyFunction() {
  return null;
}

// Usage example in test
someObject.someMethod(dummyObject);
someFunction(exampleDummyFunction);

Stub - is a test dummy, additionally, providing an indirect input for the production code from the test. “Indirect” means here via a method call on the stub object or a call of the stub function. For example:

function ExampleStubObject() {
  var something = null;

  this.getSomething = function () {
    return something;
  };

  this.stubSomething = function(somethingValue) {
    something = somethingValue;
  };
}

var someValue = null;
function exampleStubFunction() {
  return someValue;
}

// Usage example in the test
stubObject.stubSomething("a value from the test");
anObject.aMethod(stubObject);

someValue = "a value from the test";
someFunction(exampleStubFunction);

Spy - is a test stub, additionally, verifying an indirect output of the production code, by asserting afterward, without having defined the expectations before the production code is executed. For example:

function ExampleSpyObject(assertions) {
  var didSomethingWithName = null;
  var somethingValue = null;

  this.doSomethingWith = function (name) {
    didSomethingWithName = name;
    return somethingValue;
  };

  this.stubSomething = function (something) {
    somethingValue = something;
  };

  this.assertDidSomethingWithName = function(expectedName) {
    assertions.assertTrue(
      didSomethingWithName === expectedName,
      "Expected to do something with '" + expectedName + "'"
    )
  };
}

var withName = null;
var exampleValue = null;
function exampleSpyFunction(name) {
  withName = name;
  return exampleValue;
}

function verifyExampleSpyFunction(expectedName) {
  t.assertTrue(
    withName === expectedName,
    "Expected to be called with '" + expectedName + "'"
  )
}

// Usage in the test
spyObject.stubSomething("a value from the test");
anObject.aMethod(spyObject);
spyObject.assertDidSomethingWithName("helloWorld");

exampleValue = "a value from the test";
someFunction(exampleSpyFunction);
verifyExampleSpyFunction("helloWorld");

Mock - is a stub, but the expectations are defined before the execution of the production code and it can verify itself after the execution. A simple example:

function ExampleMockObject() {
  var expectedName = null;
  var fulfilled = false;
  var somethingValue = null;

  this.expectWillDoSomethingWithName = function (name) {
    expectedName = name;
  };

  this.doSomething = function (name) {
    assertions.assertTrue(
      name === expectedName,
      "Unexpected name '" + name + "',"
        + " expecting: '" + expectedName + "'"
    );

    fulfilled = true;
    return somethingValue;
  };

  this.stubSomething = function (value) {
    somethingValue = value;
  };

  this.verify = function () {
    assertions.assertTrue(
      fulfilled,
      "Expected to receive name '" + expectedName + "', "
        + "but got nothing"
    );
  };
}

// And the usage from the test:
mockObject.stubSomething("a value from the test");
mockObject.expectWillDoSomethingWithName("helloWorld");
anObject.aMethod(mockObject);
mockObject.verify();

Mocks can be much more complex (verifying order of messages, allowing multiple messages to be sent, etc.). So it is recommended to either:

avoid them and use simpler test doubles, or
use a full-blown well-tested mocking framework.

And if you do have to use your own custom mocks, please, write tests for them, since they can have a lot of logic inside of them.

And, finally, Fake - is a test double providing a simpler implementation used in the tests instead of the real thing. A good example is an in-memory database gateway, that behaves the same way the real one would, but it stores all the data in the memory. A very simple example:

function FakeDatabase() {
  var objects = {};

  this.save = function (id, object) {
    objects[id] = object;
  };

  this.findById = function (id) {
    return objects[id];
  };

  this.findByName = function (name) {
    for (var id in objects) {
      if (objects.hasOwnProperty(id)) {
        if (objects[id].name === name) {
          return objects[id];
        }
      }
    }

    return null;
  };
}

Obviously, fakes require full-blown testing for them. And if the real implementation is testable (even if it is slow), it is a good idea to have the same test suite for both: fake and real implementation. This way we can really be sure, that the fake behaves the same way as the real thing. And don’t forget about the edge cases, for example, if the real thing can throw a ConnectionError, the fake should be able too (after being instructed to do so via a special method in the tests).

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Build Your Own Testing Framework. Part 4

2016-09-17T10:00:32+02:00

Welcome back to the new issue of “Build Your Own Testing Framework” series! As you might have noticed, currently, our testing framework only outputs failures and nothing else. It is impossible to know if it actually runs any tests when they all pass because there is no output. Today we will implement a simple reporter for our testing framework. It will report the name of the test suite and names of the tests that are being executed, for example:

SpyTest
    testIsNotCalledInitially
    testAssertNotCalledFailsWhenWasCalled
    testIsCalledAfterBeingCalled
    testAssertCalledFailsWhenWasNotCalled

This article is the fourth one of the series “Build Your Own Testing Framework”, so make sure to stick around for next parts! All articles of these series can be found here.

Shall we get started?

Render the name of the test suite

So where should the name of the test suite come from? Probably it should be a test suite class name. Currently, all of them are anonymous classes and therefore don’t have a name:

runTestSuite(function () {
  //         ^          ^
  //       - no name here -
  // ...
});

We would like all test suites to have that name, for example:

runTestSuite(function SpyTest() {
  //                 ^       ^
  //            - here is the name -
  // ...
});

We should write a test for this case:

Create a test suite with the name
Run the test suite with function runTestSuite
Assert that the test suite name is reported

Let’s try to write a test in a RunTestSuiteTest.js test suite for that:

this.testItOutputsNameOfTheTest = function () {
  runTestSuite(function TestSuiteName(t) {});

  // TODO: assert that the test suite name is reported
};

Now it is problematic: how are we going to assert that something is reported? Should we replace console.log(message) or process.stdout.write(message) with our own implementation, so that we can test it?:

var logged = "";
var oldConsoleLog = console.log;

console.log = function (message) {
  logged = logged + message + "\n";
};

And then we should be able to assert with: t.assertTrue(logged.indexOf("TestSuiteName") >= 0). Finally we will need to restore the old console.log function:

this.testItOutputsNameOfTheTest = function () {
  var logged = "";
  var oldConsoleLog = console.log;

  console.log = function (message) {
    logged = logged + message + "\n";
  };

  runTestSuite(function TestSuiteName(t) {});

  t.assertTrue(logged.indexOf("TestSuiteName" >= 0));

  console.log = oldConsoleLog;
};

While this code works, it has multitude of problems:

If the test fails then oldConsoleLog function is not restored;
It has too much setup (which we could extract as a function);
It has teardown (which would be nice to avoid if we could);
It is hard to read because from 8 lines of code only 2 are delivering the core intent;
And it is testing how exactly test suite name is being reported, which is basically a View-like concern.

And fixing the last problem will actually fix everything else because this problem causes others. We can fix it by introducing some sort of Reporter type, that can respond to reportTestSuite(name) message:

this.testItOutputsNameOfTheTest = function () {
  runTestSuite(function TestSuiteName(t) {
  }, {reporter: reporter});

  t.assertTrue(reporter.hasReportedTestSuite("TestSuiteName"));
  // or even better:
  reporter.assertHasReportedTestSuite("TestSuiteName");
};

reporter in this case is some sort of test double. And what are they? - Find out here: Introducing Test Doubles.

Implementing the reporter spy

So our reporter object in the test seems terribly like a Spy Double to me, let’s test-drive it:

// test/ReporterSpyTest.js
var runTestSuite = require("../src/TestingFramework");
var ReporterSpy = require("./ReporterSpy");

runTestSuite(function ReporterSpy_BehaviorTest(t) {
  var reporter = new ReporterSpy(t);

  // Let's write our first test:
  this.testAssertHasReportedTestSuite_whenFailing = function () {
    t.assertThrow(
      "Expected test suite 'HelloWorld' to be reported",
      function () {
        reporter.assertHasReportedTestSuite("HelloWorld");
      }
    );
  };
});

// Error: Cannot find module './ReporterSpy'

// Create file test/ReporterSpy.js

Now we are getting the following error:

//     var reporter = new ReporterSpy(t);
//                    ^
//
// TypeError: ReporterSpy is not a function

We need to create ReporterSpy object now:

module.exports = function ReporterSpy(assertions) {

};

Now we are getting:

// Error: Expected to equal
//   Expected test suite 'HelloWorld' to be reported,
// but got:
//   reporter.assertHasReportedTestSuite is not a function

Now we need to create a function assertHasReportedTestSuite(name) for out ReporterSpy:

this.assertHasReportedTestSuite = function (expectedName) {
  assertions.assertTrue(
    false,
    "Expected test suite 'HelloWorld' to be reported"
  );
};

Next we need to make sure, that expectedName is actually present in the error message by triangulating with different name:

this.testAssertHasReportedTestSuite_whenFailing_withOtherName = function () {
  t.assertThrow("Expected test suite 'OtherTestSuite' to be reported", function () {
    reporter.assertHasReportedTestSuite("OtherTestSuite");
  });
};

// Error: Expected to equal
//   Expected test suite 'OtherTestSuite' to be reported,
// but got:
//   Expected test suite 'HelloWorld' to be reported

// And we need to change the respective string:
"Expected test suite '" + expectedName + "' to be reported"

Then we need to make sure that we do succeed when the message is received:

this.testAssertHasReportedTestSuite_whenSucceeding = function () {
  t.assertNotThrow(function () {
    reporter.reportTestSuite("HelloWorld");
    reporter.assertHasReportedTestSuite("HelloWorld");
  });
};

// Error:
//   Expected not to throw error,
// but thrown
//   'reporter.reportTestSuite is not a function'

// So we need to define this function in ReporterSpy:
this.reportTestSuite = function (name) {

};

// Error:
//   Expected not to throw error,
// but thrown
//   'Expected test suite 'HelloWorld' to be reported'

// Now we need to provide the simplest implementation we can,
// we can do that by introducing the boolean variable:

module.exports = function ReporterSpy(assertions) {
  // initially nothing is reported
  var hasReported = false;

  this.assertHasReportedTestSuite = function (expectedName) {
    assertions.assertTrue(
      // we should fail only when nothing was reported
      hasReported,
      "Expected test suite '" + expectedName + "' to be reported"
    );
  };

  this.reportTestSuite = function (name) {
    // and we mark it as reported when we do receive the message
    hasReported = true;
  };
};

And all our tests pass. Now, when the wrong name is getting reported we should still fail:

this.testAssertHasReportedTestSuite_whenReporting_andFailing = function () {
  t.assertThrow("Expected test suite 'HelloWorld' to be reported", function () {
    reporter.reportTestSuite("OtherTestSuite");
    reporter.assertHasReportedTestSuite("HelloWorld");
  });
};

// Error: Expected to throw an error,
// but nothing was thrown

// Now we need to actually store the name of reported test suite:

module.exports = function ReporterSpy(assertions) {
  // initially, we didn't receive any reports
  var testSuiteName = null;

  this.assertHasReportedTestSuite = function (expectedName) {
    assertions.assertTrue(
      // we fail only if received testSuiteName is not right
      testSuiteName === "HelloWorld",
      "Expected test suite '" + expectedName + "' to be reported"
    );
  };

  this.reportTestSuite = function (name) {
    // and we need to store the reported name
    testSuiteName = name;
  };
};

And all tests pass again. Although, we should notice this weird condition:

testSuiteName === "HelloWorld"

Looks like our current production code is not generic enough, it will work well only with the expectedName equal to "HelloWorld". Let’s fix that by triangulating over this parameter:

this.testAssertHasReportedTestSuite_whenReporting_andFailingWithDifferentName = function () {
  t.assertThrow("Expected test suite 'OtherTestSuite' to be reported", function () {
    reporter.reportTestSuite("HelloWorld");
    reporter.assertHasReportedTestSuite("OtherTestSuite");
  });
};

// Error: Expected to throw an error,
// but nothing was thrown

// And we should fix it by actually using the `expectedName`:

assertions.assertTrue(
  testSuiteName === expectedName,
  //               ^ fixed here ^
  "Expected test suite '" + expectedName + "' to be reported"
);

And all the tests pass. Now we can get back to our failing test for the runTestSuite:

Implementing rendering of the name of the test suite

this.testItOutputsNameOfTheTest = function () {
  runTestSuite(function TestSuiteName(t) {
  }, {reporter: reporter});

  reporter.assertHasReportedTestSuite("TestSuiteName");
};

To implement this, first we will need to accept options parameter with sane defaults:

function runTestSuite(testSuiteConstructor, options) {
  options = options || {};
  var reporter = options.reporter || new SimpleReporter();

  // ...
}

// We have to implement this, otherwise our test suite will fail
function SimpleReporter() {
    this.reportTestSuite = function (name) {
        process.stdout.write("\n" + name + "\n");
    };
}

After making the failing test pass and triangulating over the name of the test suite:

function runTestSuite(testSuiteConstructor, options) {
  options = options || {};
  var reporter = options.reporter || new SimpleReporter();

  reporter.reportTestSuite(testSuiteConstructor.name)

  // ...
}

And all tests pass now. Unfortunately, this is the output that we see now:

Yeah, empty lines. This is because (function () {}).name is equal to "". We need to give proper names to all our anonymous constructors for the test suites:

runTestSuite(function RunTestSuiteTest(t) { ... });
runTestSuite(function AssertEqualTest(t) { ... });
// .. and so on ..

And now we should see the correct output:

AssertEqualTest

AssertNotEqualTest

AssertNotThrowTest

AssertThrowTest

AssertTrueTest

FizzBuzzKataTest

.. and so on ..

Great, now we would like to render the name of the executed test:

Render the name of the executed test

this.testItOutputsNameOfTheTest = function () {
  runTestSuite(function TestSuiteName(t) {
    this.testSomeTestName = function () {};
    this.testSomeOtherTestName = function () {};
  }, {reporter: reporter});

  reporter.assertHasReportedTestSuite("TestSuiteName");
  reporter.assertHasReportedTest("testSomeTestName");
  reporter.assertHasReportedTest("testSomeOtherTestName");
};

Of course this fails, because we need to implement assertHasReportedTest(name) now for our ReporterSpy. Let’s test-drive it:

// test/ReporterSpyTest.js
this.testAssertHasReportedTest_whenFailing = function () {
  t.assertThrow("Expected test 'testName' to be reported", function () {
    reporter.assertHasReportedTest("testName");
  });
};

// Error: Expected to equal
//   Expected test 'testName' to be reported,
// but got:
//   reporter.assertHasReportedTest is not a function

// We need to define assertHasReportedTest(name) method:
this.assertHasReportedTest = function (expectedName) {

};

// Error: Expected to throw an error,
// but nothing was thrown

// We need to make it throw the expected error:
this.assertHasReportedTest = function (expectedName) {
  assertions.assertTrue(
    false,
    "Expected test 'testName' to be reported"
  );
};

// And the test passes. Message hard-codes `testName` -
// we should triangulate over it:

this.testAssertHasReportedTest_whenFailing_withDifferentName = function () {
  t.assertThrow("Expected test 'testDifferentName' to be reported", function () {
    reporter.assertHasReportedTest("testDifferentName");
  });
};

// Error: Expected to equal
//   Expected test 'testDifferentName' to be reported,
// but got:
//   Expected test 'testName' to be reported

// And to fix it:
"Expected test '" + expectedName + "' to be reported"

// Next test will force us to implement simple reportTest function:
this.testAssertHasReportedTest_whenSucceeding = function () {
  t.assertNotThrow(function () {
    reporter.reportTest("testName");
    reporter.assertHasReportedTest("testName");
  });
};

// Error: reporter.reportTest is not a function

// After fixing this and triangulating a bit, we get:

module.exports = function ReporterSpy(assertions) {
  var testName = null;
  // ...
  this.assertHasReportedTest = function (expectedName) {
    assertions.assertTrue(
      testName === expectedName,
      "Expected test '" + expectedName + "' to be reported"
    );
  };
  this.reportTest = function (name) {
    testName = name;
  };
}

// Finally we need ability to report multiple tests:

this.testAssertHasReportedTest_whenSucceeding_withMultipleReports = function () {
  t.assertNotThrow(function () {
    reporter.reportTest("testName");
    reporter.reportTest("testOtherName");
    reporter.assertHasReportedTest("testName");
  });
};

// Error: Expected not to throw error,
// but thrown 'Expected test 'testName' to be reported'

// And to implement this:
module.exports = function ReporterSpy(assertions) {
  // we will store all reported names,
  // initially no names are reported
  var testNames = [];
  // ...
  this.assertHasReportedTest = function (expectedName) {
    assertions.assertTrue(
      // check if expectedName was reported
      testNames.indexOf(expectedName) >= 0,
      "Expected test '" + expectedName + "' to be reported"
    );
  };
  this.reportTest = function (name) {
    // store the reported test name
    testNames.push(name);
  };
}

Unfortunately, this does not pass our tests, because this test fails now:

this.testAssertHasReportedTest_whenReporting_andFailing = function () {
  t.assertThrow("Expected test 'testName' to be reported", function () {
    reporter.reportTest("testOtherName");
    reporter.assertHasReportedTest("testName");
  });
};

After an investigation, it becomes clear, that this happens because we can not re-use reporter variable defined at the higher level since all tests share the same testSuite object at the moment. We will have to move the creation of the reporter variable inside of each test:

this.testAssertHasReportedTest_whenReporting_andFailing = function () {
  var reporter = new ReporterSpy(t);
  // ...
};

this.testAssertHasReportedTest_whenReporting_andFailing_withOtherName = function () {
  var reporter = new ReporterSpy(t);
  // ...
};

// .. and so on ..

And this makes all our tests pass.

Stateless tests

This is quite a noticeable problem, that our users can be frustrated with, so we probably should make it easy on them and allow such variables to be fresh for every test. This can be achieved quite easy if we were to create a new testSuite for each test. Let’s write a simple test to show the problem:

// test/StatelessTest.js
var runTestSuite = require("../src/TestingFramework");

runTestSuite(function StatelessTest(t) {
  var answer = 41;

  this.testItCanMutateVariable_andImmediatelyUseNewValue = function () {
    answer++;
    t.assertEqual(42, answer);
  };

  this.testItCanMutateVariableAgain_andGetTheSameResult = function () {
    answer++;
    t.assertEqual(42, answer);
  };
  // this fails as expected:
  // Error: Expected to equal 42, but got: 43
});

And now let’s implement it by creating the testSuite for every test:

function runTestSuite(testSuiteConstructor, options) {
  options = options || {};
  var reporter = options.reporter || new SimpleReporter();

    reporter.reportTestSuite(testSuiteConstructor.name);

    var testSuitePrototype = createTestSuite(testSuiteConstructor);
    // ^ we change this from `testSuite` to `testSuitePrototype`  ^

  for (var testName in testSuitePrototype) {
    if (testName.match(/^test/)) {
      var testSuite = createTestSuite(testSuiteConstructor);
            // ^   and we create our testSuite every time here   ^
      testSuite[testName]();
    // ^  and run test on it ^
    }
  }
}

function createTestSuite(testSuiteConstructor) {
    return new testSuiteConstructor(assertions);
}

After doing this, we can move var reporter = new ReporterSpy(t); to the top level of the ReporterSpyTest suite again. And all the tests pass.

Implementation of the rendering of the test name

Finally, we need to make sure that the test suite, that we have written before will pass:

this.testItOutputsNameOfTheTest = function () {
    runTestSuite(function TestSuiteName(t) {
        this.testSomeTestName = function () {};
        this.testSomeOtherTestName = function () {};
    }, {reporter: reporter});

    reporter.assertHasReportedTestSuite("TestSuiteName");
    reporter.assertHasReportedTest("testSomeTestName");
    reporter.assertHasReportedTest("testSomeOtherTestName");
};

As expected it fails with Error: Expected test 'testSomeTestName' to be reported. After fixing it and applying triangulation once, we would end up with the following implementation:

// src/TestingFramework.js in runTestSuite function:
for (var testName in testSuitePrototype) {
    if (testName.match(/^test/)) {

        reporter.reportTest(testName);
// ^  here is our implementation  ^

        var testSuite = createTestSuite(testSuiteConstructor);
        testSuite[testName]();
    }
}

function SimpleReporter() {
    // ...
    // and we should not forget to implement it for real reporter
  this.reportTest = function (name) {
    process.stdout.write("\t" + name + "\n");
  };
}

Now, it seems that both ReporterSpy and SimpleReporter are implementing the same Duck type - Reporter. What Duck Type is? - find out here: Meet Duck Type.

Contract testing all Reporter duck types

So we should test all our ducks that their public API don’t get out of sync:

var TestingFramework = require("../src/TestingFramework");
var runTestSuite = TestingFramework;
var SimpleReporter = TestingFramework.SimpleReporter;

var ReporterSpy = require("./ReporterSpy");

const IMPLEMENTATIONS = [
    SimpleReporter,
    ReporterSpy
];

IMPLEMENTATIONS.forEach(function (ReporterImplementation) {
  runTestSuite(function (t) {
    var reporter = new ReporterImplementation();

    this.testDefines_reportTestSuite = function () {
      var reportTestSuite = reporter.reportTestSuite;
      t.assertEqual("function", typeof(reportTestSuite));
      t.assertEqual(1, reportTestSuite.length);
    };

    this.testDefines_reportTest = function () {
      var reportTest = reporter.reportTest;
      t.assertEqual("function", typeof(reportTest));
      t.assertEqual(1, reportTest.length);
    }
  });
});

All the tests pass. Unfortunately, the output regarding this test suite looks weird:

    testDefines_reportTestSuite
    testDefines_reportTest


    testDefines_reportTestSuite
    testDefines_reportTest

The test suite name is empty. I think we need an ability to define a custom and dynamic test suite name:

Custom name for the test suite

We can achieve this by allowing any test suite to define special hook method, that will return its custom name, like testSuite.getTestSuiteName(). Let’s write a test for this:

this.testItCanHaveCustomNameOfTheTestSuite = function () {
  runTestSuite(function (t) {
    this.getTestSuiteName = function () {
      return "CustomNameOfTheTestSuite";
    };
  }, {reporter: reporter});

  reporter.assertHasReportedTestSuite("CustomNameOfTheTestSuite");
};

After implementing it and triangulating over the name once the code looks like this:

function runTestSuite(testSuiteConstructor, options) {
  options = options || {};
  var reporter = options.reporter || new SimpleReporter();

  var testSuitePrototype = createTestSuite(testSuiteConstructor);

  reporter.reportTestSuite(
    getTestSuiteName(testSuiteConstructor, testSuitePrototype)
// ^ this is the function that we introduced here to make it pass ^
  );

    for (var testName in testSuitePrototype) { ... }
}

function getTestSuiteName(testSuiteConstructor, testSuitePrototype) {
    if (typeof(testSuitePrototype.getTestSuiteName) !== "function") {
        return testSuiteConstructor.name;
    }

    return testSuitePrototype.getTestSuiteName();
}

Now, if we were to use this feature in our duck type tests:

IMPLEMENTATIONS.forEach(function (ReporterImplementation) {
  runTestSuite(function (t) {
    this.getTestSuiteName = function () {
      return ReporterImplementation.name + "_ReporterTest";
    };

    // ...
});

Then we are getting the proper output:

SimpleReporter_ReporterTest
    testDefines_reportTestSuite
    testDefines_reportTest

ReporterSpy_ReporterTest
    testDefines_reportTestSuite
    testDefines_reportTest

Bottom Line

I think we are done with implementing our first simple reporter. Now we can see that the tests are actually executing and passing. The code can be found here: https://github.com/waterlink/BuildYourOwnTestingFrameworkPart4

There is still a lot to go through. In a few next episodes we will:

Make sure that first failure does not cause test suite to stop running;
Make sure the exit code is right;
Report OK and FAIL;
Output carefully formatted failures to the STDERR.

Stay tuned!

Thanks

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Getting Stuck While Doing TDD. Part 3: Triangulation to the Rescue!

2016-08-31T02:35:32+02:00

Welcome back to the “Getting Stuck While Doing TDD” series. Today we are going to learn the Golden Rule of TDD and how to not get stuck while doing TDD.

TL;DR

“As tests get more specific, production code gets more generic”.
RED is as important as other in Red-Green-Refactor cycle. If next test does not fail, it is either: already implemented, or has to wait until a later time (until it will fail).
At its core the Triangulation Technique has the following idea:

After implementing one business rule (with Red-Green-Refactor) make sure to find all “weirdnesses” or non-generalities in the production code and one-by-one eliminate them by writing a test, that proves such non-generality, and then making it pass while removing non-generality. This is the third cycle of TDD - Mini Cycle.

This is a series of articles:

Part 1: Example
Part 2: Buggy Code and Forcing Our Way Through
Part 3: Triangulation to the Rescue! (reading this)

Shall we get started?

Specific/Generic Rule of TDD

As tests get more specific, production code gets more generic.

When making the next failing test pass, our production code should also pass a whole class of similar tests. Best shown in the very simple example. The task at hand is to write the function sum(a, b) that will add two numbers. Let’s see us a violation of the Specific/Generic rule:

expect(sum(2, 2)).to eq(4)
# => NoMethodError: undefined method `sum'

def sum(a, b); end
# => expected: 4, got: nil

def sum(a, b)
  4
end
# => PASS

expect(sum(2, 3)).to eq(5)
# => expected: 5, got: 4

def sum(a, b)
  if b == 2
    4
  else
    5
  end
end
# => PASS

The production code to make this last test pass is as specific as the failing test now. The test of the same class (where we change the value of the b parameter) will fail for it:

expect(sum(2, 42)).to eq(44)
# => expected: 44, got: 5

To follow the Specific/Generic rule we ought to make 4 into 2 + b like that:

def sum(a, b)
  2 + b
end
# => PASS

This way, when we change b to any value it will still pass the test, aside from the fact, that we didn’t do anything about a. This is because we still don’t have any test showing us, that parameter a is important, like the following one:

expect(sum(4, 7)).to eq(11)
# => expected: 11, got: 9

Again we can make it pass in a very specific fashion by introducing specific if statement, or we could do it to pass the whole class of such tests:

def sum(a, b)
  a + b
end
# => PASS

Have you noticed, that from the test suite side we had to “prove” that some knowledge in the system is important and had to be used? This technique is called Triangulation.

Triangulation Technique

In the essence, Triangulation technique has a very simple idea at its core:

Change certain important* knowledge in the system.
Assert that the production code behaves in an accordingly expected manner.

* - important from the perspective of the system or unit under the test

Red-Green-Refactor has to have all stages

One Red-Green-Refactor cycle really has to have all stages in it. And I’m not ranting right now about “Refactor” stage, that is a given. Rather, I insist on the “Red” stage - in TDD, when we write a new test, it has to fail. Writing tests that do not fail is another way to get ourselves stuck while doing TDD. One could ask: “If I can’t write this test because it does not fail, what should I do about the requirement it represents?”, and the answer is rather simple - either this requirement is already implemented and tested by other tests, or we still need this test and we will get back to it later when it actually will fail.

As we can remember, in the first part of these series, we were going through an OrderKindValidator example, and we were writing multiple tests in a row, that were all expecting the same outcome and of course they didn’t fail, because we had one line in our function that made them all pass. If we were to sprinkle some other tests, that do fail (like a test for a valid order kind), after making it pass, all of these tests will now be failing and therefore they are good candidates for our next test. Let’s see it with our own eyes:

it_fails_with("Order kind can not be empty")
  .when_order_kind_is_absent
# => expected InvalidOrderError with "Order kind can not be empty",
# => got #
# =>      undefined method `validate' for #>

class OrderKindValidator
  def validate(order)
  end
end
# => expected InvalidOrderError with "Order kind can not be empty"
# => but nothing was raised

def validate(order)
  raise InvalidOrderError, "Order kind can not be empty"
end
# => PASS

Now is the point, where we have to choose our next test, and last time we have chosen the test with the same outcome and it did not go so well. Let’s choose a test with different outcome, e.g.: when valid order kind is provided:

it_does_not_fail
  .when_order_kind_is(%w(private))
# => expected no Exception,
# => got #

Now, we have 2 options, to either check for order[:kind] == %w(private) or to check for order[:kind] being absent. It does not matter what we choose at this point, so let’s go with the first one:

def validate(order)
  if order[:kind] == %w(private)
    return
  end

  raise InvalidOrderError, "Order kind can not be empty"
end
# => PASS

Now let’s apply Triangulation technique. We should always ask ourselves the question: “What is weird about this code?” and “What failing test should I write to point out this weirdness?”. First weirdness we can spot is that the validator currently accepts only one order kind - private. According to our requirements it should also accept corporate:

it_does_not_fail
  .when_order_kind_is(%w(corporate))
# => expected no Exception,
# => got #

def validate(order)
  kinds = order[:kind]
  if kinds == %w(private) || kinds == %w(corporate)
    # ...
end
# => PASS

We also know, that our system should handle duplicate entries in order[:kind]:

it_does_not_fail
  .when_order_kind_is(%w(private private))
# => expected no Exception,
# => got #

def validate(order)
  kinds = order[:kind]
  if kinds.include?("private") || kinds == %w(corporate)
    # ...
end
# => expected InvalidOrderError with "Order kind can not be empty",
# => got #
# => for nil:NilClass>

Wow! We, of course, can check for kinds to not be nil, but I would rather listen to this test failure and put a check for kinds being absent (and this makes for our second check, that we could have chosen from):

def validate(order)
  kinds = order[:kind]

  if kinds.nil?
    raise InvalidOrderError, "Order kind can not be empty"
  end

  if kinds.include?("private") || kinds == %w(corporate)
    return
  end

  raise InvalidOrderError, "Order kind can not be empty"
end
# => PASS

So this passes all our tests. It may look weird, and this is exactly the pointer for us which test to write next to prove, that this weirdness is incorrect:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
  .when_order_kind_is(%w(invalid))
# => expected InvalidOrderError
# => with "Order kind can be one of: 'private', 'corporate', 'bundle'",
# => got #

def validate(order)
  # ...

  raise InvalidOrderError,
    "Order kind can be one of: 'private', 'corporate', 'bundle'"
end
# => PASS

Production code starts looking not so clean and I think it is time to give things proper names:

class OrderKindValidator
  def validate(order)
    kinds = order[:kind]

    if empty?(kinds)
      fail_with("Order kind can not be empty")
    end

    unless valid?(kinds)
      fail_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
    end
  end

  def valid?(kinds)
    kinds.include?("private") || kinds == %w(corporate)
  end

  def empty?(kinds)
    kinds.nil?
  end

  def fail_with(message)
    raise InvalidOrderError, message
  end
end

There is only one weirdness, that is left for triangulation in current production code, before we can move on to the next requirement - private can be duplicated while corporate can not:

it_does_not_fail
  .when_order_kind_is(%w(corporate corporate))
# => expected no Exception,
# => got #

def valid?(kinds)
  kinds.include?("private") ||
      kinds.include?("corporate")
end
# => PASS

Great, now we can safely go back to our empty order kind edge cases:

it_fails_with("Order kind can not be empty")
  .when_order_kind_is([])
# => expected InvalidOrderError with "Order kind can not be empty",
# => got #

def empty?(kinds)
  kinds.nil? ||
      kinds.empty?
end
# => PASS

it_fails_with("Order kind can not be empty")
  .when_order_kind_is([nil])
# => expected InvalidOrderError with "Order kind can not be empty",
# => got #

def empty?(kinds)
  kinds.nil? ||
      kinds.empty? ||
      kinds[0].nil?
end
# => PASS

it_fails_with("Order kind can not be empty")
  .when_order_kind_is([""])
# => expected InvalidOrderError with "Order kind can not be empty",
# => got #

def empty?(kinds)
  kinds.nil? ||
      kinds.empty? ||
      kinds[0].nil? ||
      kinds[0].empty?
end
# => PASS

And it is a good opportunity to eliminate some duplication:

def empty?(kinds)
  empty_value?(kinds) ||
      empty_value?(kinds[0])
end

def empty_value?(value)
  value.nil? || value.empty?
end

Now, it is a good time to triangulate, because we have a weirdness in our code: kinds[0]. To prove that this is too specific we can write another test:

it_fails_with("Order kind can not be empty")
  .when_order_kind_is(["private", ""])
# => expected InvalidOrderError with "Order kind can not be empty"
# => but nothing was raised

def empty?(kinds)
  empty_value?(kinds) ||
      kinds.any? { |kind| empty_value?(kind) }
end
# => PASS

Notice, how every single test that we have written was failing and how easy it was to make it pass. This suggests that we are probably moving in the right direction. Let’s test our next requirement - we can combine private and bundle:

it_does_not_fail
  .when_order_kind_is(%w(private bundle))
# => PASS

Wait a minute. This is really bad. We should have a failing test here. This happened because we are checking only for the inclusion of private or corporate and we do not care about anything else in the order[:kind] array. We have to discard this test and try to go with failing version of the same business rule - invalid order kind can not be combined with private:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
  .when_order_kind_is(%w(private invalid))
# => expected InvalidOrderError
# => with "Order kind can be one of: 'private', 'corporate', 'bundle'"
# => but nothing was raised

def valid?(kinds)
  return false if kinds[1] == "invalid"

  kinds.include?("private") ||
      kinds.include?("corporate")
end
# => PASS

While this works, it leads to two other weirdnesses: kinds[1] and "invalid", let’s the latter first:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
  .when_order_kind_is(%w(private another_invalid))
# => expected InvalidOrderError
# => with "Order kind can be one of: 'private', 'corporate', 'bundle'"
# => but nothing was raised

def valid?(kinds)
  return false if kinds[1] && kinds[1] != "private"

  kinds.include?("private") ||
      kinds.include?("corporate")
end
# => expected no Exception,
# => got #
# .. and more failures ..

Other tests fail now, from them it is possible to see, that second kind should be either private or corporate:

def valid?(kinds)
  return false if kinds[1] &&
      kinds[1] != "private" &&
      kinds[1] != "corporate"

  kinds.include?("private") ||
      kinds.include?("corporate")
end
# => PASS

This looks rather clunky, we should make it a bit cleaner:

ALLOWED_ORDER_KINDS = %w(private corporate)

def valid?(kinds)
  return false if kinds[1] &&
      !ALLOWED_ORDER_KINDS.include?(kinds[1])

  kinds.include?("private") ||
      kinds.include?("corporate")
end

Let’s eliminate the other weirdness - kinds[1], it probably should verify all kinds in the array:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
  .when_order_kind_is(%w(invalid private))
# => expected InvalidOrderError
# => with "Order kind can be one of: 'private', 'corporate', 'bundle'"
# => but nothing was raised

def valid?(kinds)
  return false if kinds.any? { |kind|
    !ALLOWED_ORDER_KINDS.include?(kind)
  }

  kinds.include?("private") ||
      kinds.include?("corporate")
end
# => PASS

And now this can be greatly simplified by inverting the boolean logic:

def valid?(kinds)
  kinds.all? { |kind|
    ALLOWED_ORDER_KINDS.include?(kind)
  }
end

Now that we have dealt with all weirdnesses in our production code, let’s get back to our requirement:

it_does_not_fail
  .when_order_kind_is(%w(private bundle))
# => expected no Exception,
# => got #

Wow! Now it fails exactly as it should. This means that it is now the right time for this test! Let’s make it pass by adding bundle to the list of allowed order kinds:

ALLOWED_ORDER_KINDS = %w(private corporate bundle)
# => PASS

Nice! Our next requirement is about bundle not being used on its own, i.e.: either private or corporate is required:

it_fails_with("Order kind should be 'private' or 'corporate'")
  .when_order_kind_is(%w(bundle))
# => expected InvalidOrderError with "Order kind should be 'private' or 'corporate'"
# => but nothing was raised

def validate(order)
  # ...

  if kinds == %w(bundle)
    fail_with("Order kind should be 'private' or 'corporate'")
  end
end
# => PASS

And this is good enough, because that is really the only case, when this can happen until the list of allowed order kinds is extended by future business requirements. We should at least give this condition a proper name:

unless has_required?(kinds)
  fail_with("Order kind should be 'private' or 'corporate'")
end

# ...

def has_required?(kinds)
  kinds != %w(bundle)
end

Except, that we could provide duplicated bundle:

it_fails_with("Order kind should be 'private' or 'corporate'")
  .when_order_kind_is(%w(bundle bundle))
# => expected InvalidOrderError
# => with "Order kind should be 'private' or 'corporate'"
# => but nothing was raised

def has_required?(kinds)
  # Easy to fix if we de-duplicate it with #uniq:
  kinds.uniq != %w(bundle)
end
# => PASS

Now it is time to move on to the final requirement about conflicts between private and corporate:

it_fails_with("Order kind can not be 'private' and 'corporate' at the same time")
  .when_order_kind_is(%w(private corporate))
# => expected InvalidOrderError
# => with "Order kind can not be 'private' and 'corporate' at the same time"
# => but nothing was raised

def validate(order)
  # ...

  if kinds == %w(private corporate)
    fail_with("Order kind can not be 'private' and 'corporate' at the same time")
  end
end
# => PASS

Of course, kinds == %w(private corporate) can be considered too specific for production code, we should triangulate it:

it_fails_with("Order kind can not be 'private' and 'corporate' at the same time")
  .when_order_kind_is(%w(corporate private))
# => expected InvalidOrderError
# => with "Order kind can not be 'private' and 'corporate' at the same time"
# => but nothing was raised

if kinds.include?("private") &&
    kinds.include?("corporate")
  fail_with("Order kind can not be 'private' and 'corporate' at the same time")
end
# => PASS

And, finally, let’s give this condition a proper name:

if has_conflicts?(kinds)
  fail_with("Order kind can not be 'private' and 'corporate' at the same time")
end

# ...

def has_conflicts?(kinds)
  kinds.include?("private") &&
    kinds.include?("corporate")
end

I believe we are done now. Source for this example can be found in an open pull request here.

Let’s recap how Triangulation technique worked for us here.

Triangulation Technique in Depth

The main goal of triangulation is to prove that the code is not general enough along some axis (class of tests) by writing a test and then making sure it passes. Effective application of the technique requires to prove and eliminate all such “weirdnesses” or non-generalities from the production code after each Red-Green-Refactor cycle for business requirements. This is, in fact, the 3rd cycle of Test-Driven-Development called Mini Cycle of TDD, it should be executed about every 10 minutes.

Another observation is that following this technique we are introducing only one small piece of knowledge into our production code, for example:

When writing a test for next business requirement, we are introducing the fact that we need an if statement with a certain body (in this example it was a raise error statement). Since we can not introduce the if statement without a condition we need to put some condition there and we put a very specific condition on purpose since we know that it is tested and it is simple.
Next, we are proving that this condition is too specific by writing a test, and then making it pass with a more generic solution. This way we are introducing a tiny little bit more knowledge in our production code.
We are repeating this iterative process until the production code is generic enough for the current specification (test suite). And we start over. This is the Mini Cycle of TDD.

Bottom Line

Today we have learned the Golden Rule of TDD - “As tests get more specific, production code gets more generic”, and we have learned the Triangulation Technique, that allows us to follow this rule in an incremental and confident way. Additionally, we have learned, that following Red-Green-Refactor strictly is important, and this includes even the RED stage of this cycle - when the test for business requirement does not fail, it is either: already implemented or it has to wait for later.

This is a series of articles:

Part 1: Example
Part 2: Buggy Code and Forcing Our Way Through
Part 3: Triangulation to the Rescue! (reading this)

You would not want to miss next articles on this tech blog, we still have a lot to talk about:

Continuous Integration and Continuous Delivery - importance of not impeding others,
Open-Closed Principle - changing behavior by adding new code,
Mutational Testing, “Build Your Own Testing Framework” series, 4 Cycles of TDD, Test-Driven Development screencasts and so much more!

Thanks!

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Getting Stuck While Doing TDD. Part 2: Buggy Code and Forcing Our Way Through

2016-08-31T02:35:06+02:00

Welcome back to the “Getting Stuck While Doing TDD” series. Today we are going to see the results of getting stuck while doing TDD and scratch the surface of how to avoid this outcome.

Code examples today will be in Ruby programming language. The technique itself is, of course, language-agnostic.

TL;DR

It is painful and difficult to force your way through when getting stuck in TDD.
It results in degraded guarantees from TDD (such as test coverage, semantical stability, and confidence).

Ways to avoid this outcome:

do not write tests that will not fail with the current production code
choose next test to write that will address particular detail about production code that is wrong or not general enough (Triangulation)

Finally, do not forget to remove redundant tests if any.

This is a series of articles:

Part 1: Example
Part 2: Buggy Code and Forcing Our Way Through (reading this)
Part 3: Triangulation to the Rescue!

Buggy if-riddled code

Buggy if-riddled code is what we’ve got. It is even not so easy to read. While we can refactor it to be more readable that won’t change the presence of bugs, though. Let’s still do it to understand what happens in this code better:

class OrderKindValidator
  def validate(order)
    kinds = order[:kind]

    validate_only_known(kinds)
    validate_has_required(kinds)
    validate_no_conflicting(kinds)
    validate_non_empty(kinds)
  end

  private

  def validate_non_empty(kinds)
    if empty?(kinds)
      fail_with("Order kind can not be empty")
    end
  end

  def validate_no_conflicting(kinds)
    if has_conflicting(kinds)
      fail_with("Order kind can not be 'private' and 'corporate' at the same time")
    end
  end

  def validate_has_required(kinds)
    if has_no_required(kinds)
      fail_with("Order kind should be 'private' or 'corporate'")
    end
  end

  def validate_only_known(kinds)
    if invalid?(kinds)
      fail_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
    end
  end

  def empty?(kind)
    kind != ["private"] && kind != ["corporate"] &&
        kind != %w(private bundle) &&
        kind != %w(corporate bundle)
  end

  def has_conflicting(kind)
    kind == %w(private corporate)
  end

  def has_no_required(kind)
    kind == ["bundle"]
  end

  def invalid?(kind)
    kind == ["invalid"]
  end

  def fail_with(message)
    raise InvalidOrderError.new(message)
  end
end

Structure of the class, actually, sounds just right, but conditions are not good:

def empty?(kind)
  kind != ["private"] && kind != ["corporate"] &&
      kind != %w(private bundle) &&
      kind != %w(corporate bundle)
end

Really? It does not do what it says. At all. It basically just solves the problem very specifically to the tests. I can easily come up with a test that will break it:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
    .when_order_kind_is ["almost anything"]

# Error: expected InvalidOrderError with
#    "Order kind can be one of: 'private', 'corporate', 'bundle'",
# got #

# or other test
it_does_not_fail.when_order_kind_is %w(corporate corporate)

def has_conflicting(kind)
  kind == %w(private corporate)
end

This at least does what it says. But only for one specific case, instead of general one. One test that I can come up with right away:

it_fails_with("Order kind can not be 'private' and 'corporate' at the same time")
    .when_order_kind_is %w(private corporate bundle)

# and another one:
it_fails_with("Order kind can not be 'private' and 'corporate' at the same time")
    .when_order_kind_is %w(corporate private)

def has_no_required(kind)
  kind == ["bundle"]
end

While this may work for our current requirements, it is really confusing for the reader. Method name says: “has no required kind” while method body checks if it is only bundle. And it does not work well with that edge case:

it_fails_with("Order kind should be 'private' or 'corporate'")
    .when_order_kind_is %w(bundle bundle)

While this case is quite unlikely, nothing in business rules forbid that and some other part of the system may as well duplicate bundle kind for some reason or it may be a user input mistake.

def invalid?(kind)
  kind == ["invalid"]
end

This method, indeed, checks that kind is invalid. Literally "invalid". Which would mean, that all kinds except exactly "invalid" are allowed. This is not true according to our business rules. In fact, we have already written the failing test for this some moments ago:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
    .when_order_kind_is ["almost anything"]

Let’s comment out these failing tests and try to force-TDD our way through these bugs by uncommenting and fixing them one-by-one following Red-Green-Refactor loop:

Forcing our way through

So, let’s uncomment our first failing test:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
    .when_order_kind_is ["almost anything"]

We are expecting validate_only_known to fail with its message and that means invalid?(kinds) should return true. To make it return true in this case and preserve its old behavior we will need to remove private, corporate and bundle from kinds and check that it is not empty:

def invalid?(kinds)
  (kinds - %w(private corporate bundle)).any?
end

See how we had to write the whole thing in one go. There is no chance to write it incrementally because there will be a bunch of tests that fail. Wait! While it does not fail for any tests related to invalid kinds, it fails for all tests related to emptiness:

OrderKindValidator
  fails with message "Order kind can not be empty"
    when order kind is [""] (FAILED - 1)
    when order kind is ["", ""] (FAILED - 2)
    when order kind is ["private", ""] (FAILED - 3)
    when order kind is absent (FAILED - 4)
    when order kind is nil (FAILED - 5)

So we need to change more production code to make this one tiny test pass. It looks like validate_non_empty is a culprit now - it is being called after validate_only_known. It should be the other way around:

def validate(order)
  kinds = order[:kind]

  validate_non_empty(kinds)
# ^ we moved this up here ^

  validate_only_known(kinds)
  validate_has_required(kinds)
  validate_no_conflicting(kinds)
end

Oh! Now a bunch of other tests fails:

OrderKindValidator
  fails with message "Order kind should be 'private' or 'corporate'"
    when order kind is ["bundle"] (FAILED - 1)

  fails with a message "Order kind can not be 'private' and 'corporate' at the same time"
    when order kind is ["private", "corporate"] (FAILED - 4)

  fails with a message "Order kind can be one of: 'private', 'corporate', 'bundle'"
    when order kind is ["almost anything"] (FAILED - 2)
    when order kind is ["invalid"] (FAILED - 3)

From failure messages it is possible to guess, that the culprit is empty?(kinds) function that fails in too much cases now, such as: ["bundle"], ["private", "corporate"], ["almost anything"] and ["invalid"]. This is because it was not doing what it said it was:

def empty?(kinds)
  kinds != ["private"] && kinds != ["corporate"] &&
      kinds != %w(private bundle) &&
      kinds != %w(corporate bundle)
end

And this is why it was hard to change the order of validations. We will have to completely rewrite this function. Let’s start small and see which tests fail:

def empty?(kinds)
  false
end

The failures are:

OrderKindValidator
  fails with message "Order kind can not be empty"
    when order kind is ["private", ""] (FAILED - 1)
    when order kind is [nil] (FAILED - 2)
    when order kind is ["", ""] (FAILED - 3)
    when order kind is nil (FAILED - 4)
    when order kind is [""] (FAILED - 5)
    when order kind is absent (FAILED - 6)
    when order kind is [nil, nil] (FAILED - 7)
    when order kind is ["private", nil] (FAILED - 8)
    when order kind is [] (FAILED - 9)

Good, only tests related directly to this case are failing. So one-by-one we can construct our condition while fixing these test failures:

kinds.nil?
|| kinds.empty?
|| kinds[0].nil? (turned out to be redundant in the end)
|| kinds[0].empty? (turned out to be redundant in the end)
|| kinds.any? { |k| k.nil? || k.empty? }

After refactoring empty? the function now is looking this way:

def empty?(kinds)
  absent_or_empty?(kinds) ||
      kinds.any? { |kind| absent_or_empty?(kind) }
end

def absent_or_empty?(value)
  value.nil? || value.empty?
end

And all tests, finally, pass. It took a lot of effort and re-writing to get this one little test to pass. This is what we call “Getting Stuck” in TDD. There is always an order of tests that will lead to this result almost for any somewhat complex problem.

The code can be found in GitHub repository in an open pull request here.

Almost guaranteed ways to get stuck in TDD:

write tests that do not fail,
do not address weird results of “simplest thing that could possibly work” to make the test pass and moving on to the next business rule,
make production code a mirror of the tests and too specific, not general.

And to not get stuck is to do the opposite:

do not write the test that will not fail (wait until later, when it will fail), and
always first write the test that will point out next deficiency in the current production code (in TDD this is called Triangulation), and
while making some failing test pass, make sure that the change in production code covers not only this one specific test, rather, a whole class of tests (Golden Rule of TDD: As tests get more specific, production code gets more generic).

Bottom Line

Today we have seen how bad the results of getting stuck while doing TDD can be. In the next article of these series, we will explore Golden Rule of TDD and the technique called Triangulation, that allows us to incrementally test-drive code in a way, that it will always be conforming to the Golden Rule of TDD and therefore will never get us stuck. Stay tuned!

This is a series of articles:

Part 1: Example
Part 2: Buggy Code and Forcing Our Way Through (reading this)
Part 3: Triangulation to the Rescue!

You would not want to miss next articles on this tech blog, we still have a lot to talk about:

Triangulation technique in Test-Driven Development - overlooking this technique might cause one fail at doing TDD (these series),
Continuous Integration and Continuous Delivery - importance of not impeding others,
Open-Closed Principle - changing behavior by adding new code,
Mutational Testing, “Build Your Own Testing Framework” series, Test-Driven Development screencasts and so much more!

Thanks!

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Getting Stuck While Doing TDD. Part 1: Example

2016-08-30T15:27:30+02:00

Following 3 rules of TDD sounds really simple at first. In practice, there is a moment when one has to implement the whole algorithm at once to make currently failing test pass. This is called “getting stuck” in TDD. In this article, we will explore how exactly this happens and how to prevent that.

Code examples today will be in Ruby programming language. The technique itself is, of course, language-agnostic.

TL;DR

“Getting stuck” happens for a couple of reasons:

wrong order of tests
production code is not getting more general with each test

This is a series of articles:

Part 1: Example (reading this)
Part 2: Buggy Code and Forcing Our Way Through
Part 3: Triangulation to the Rescue!

“Getting Stuck” in TDD

Usually “Getting Stuck” follows this pattern:

write some test and implement it via “simplest thing that might possibly work”,
write another test and implement it again in a non-general manner,
write some more tests in that fashion, while never addressing the fact that production code now looks completely wrong from what it should probably be looking like,
write a new test, that forces us to completely rewrite production code in a complete algorithm just to make it pass.

This last step usually takes minutes to hours depending on the complexity of the problem at hand. Additionally, the first few tests are basically wasted time since they did not produce any bits of knowledge in the production code that persisted in production code in the end. Even worse, chances are that the algorithm that we have just written is not fully covered by current tests, since we have written it in one go just to make current failing test pass - this is no longer correct TDD and can not guarantee high test coverage, and, therefore, can not guarantee high confidence anymore.

Let’s go through a small example on how one can get stuck in TDD:

Order Kind Validation - Getting Stuck

Let’s define the problem at hand first. We have some sort of order request as an input to our system and we need to validate that its kind is correct:

valid order kinds: private, corporate, bundle,
order kinds can be combined,
private and corporate order kinds can not be combined, otherwise InvalidOrderError with message Order kind can not be 'private' and 'corporate' at the same time,
either private or corporate should be always present, otherwise InvalidOrderError with message Order kind should be 'private' or 'corporate',
if order kind is not in the above list, then we need to raise InvalidOrderError with message Order kind can be one of: 'private', 'corporate', 'bundle',
if order kind is not present or an empty string, then we need to raise InvalidOrderError with message Order kind can not be empty.

This is a fairly simple problem and it is easy to get stuck while doing TDD here. So let’s write our first test: “When order has no order_kind, then we should get InvalidOrderError with message ‘Order kind can not be empty’”:

RSpec.describe OrderKindValidator do
  it "fails with a message about order kind being empty when it is absent" do
    validator = OrderKindValidator.new

    expect { validator.validate({ items: 42 }) }
        .to raise_error(InvalidOrderError, "Order kind can not be empty")
  end
end

And the simplest implementation possible:

class OrderKindValidator
  def validate(order)
    raise InvalidOrderError.new("Order kind can not be empty")
  end
end

class InvalidOrderError < StandardError
end

Next test is our next simplest edge case - when kind’s value is nil:

it "fails with a message about order kind being empty when it is nil" do
  validator = OrderKindValidator.new

  expect { validator.validate({items: 42, kind: nil }) }
      .to raise_error(InvalidOrderError, "Order kind can not be empty")
end

It does not fail at all, so we don’t have any reason to change the production code. We can already spot a little duplication - validator variable. Let’s extract it as a named subject of the test suite:

subject(:validator) { OrderKindValidator.new }

And OrderKindValidator can be replaced with described_class (RSpec feature), so that we will not have to change too much in case we wanted to change name of the class:

subject(:validator) { described_class.new }

Next simplest edge case - when kind is an empty array:

it "fails with a message about order kind being empty when it has zero elements" do
  expect { validator.validate({items: 42, kind: [] }) }
      .to raise_error(InvalidOrderError, "Order kind can not be empty")
end

I believe I am spotting annoying pattern now:

it "fails with message MESSAGE when it is KIND_CASE" do
  expect { validator.validate({items: 42, kind: KIND_VALUE}) }
    .to raise_error(InvalidOrderError, MESSAGE)
end

It would be really nice to write it in this fashion:

it_fails_with("Order kind can not be empty").when_order_kind_is_absent
it_fails_with("Order kind can not be empty").when_order_kind_is nil
it_fails_with("Order kind can not be empty").when_order_kind_is []

And as another duplication piles up:

it_fails_with_order_kind_not_empty = it_fails_with("Order kind can not be empty")

it_fails_with_order_kind_not_empty.when_order_kind_is_absent
it_fails_with_order_kind_not_empty.when_order_kind_is nil
it_fails_with_order_kind_not_empty.when_order_kind_is []

Now the next tests look very easy and simple:

it_fails_with_order_kind_not_empty.when_order_kind_is [nil]
it_fails_with_order_kind_not_empty.when_order_kind_is [""]
it_fails_with_order_kind_not_empty.when_order_kind_is [nil, nil]
it_fails_with_order_kind_not_empty.when_order_kind_is ["", ""]
it_fails_with_order_kind_not_empty.when_order_kind_is ["private", ""]
it_fails_with_order_kind_not_empty.when_order_kind_is ["private", nil]

And they all pass right from the go. The implementation for the it_fails_with is looking like this:

RSpec.describe OrderKindValidator do
  class ItFailsWith
    def initialize(spec, expected_message)
      @spec = spec
      @expected_message = expected_message
    end

    def when_order_kind_is_absent
      expect_failure("absent", {items: 42})
    end

    def when_order_kind_is(value)
      expect_failure(value.inspect, {items: 42, kind: value})
    end

    private

    def expect_failure(feature, order, expected_message = @expected_message)
      @spec.it("fails with message #{expected_message.inspect} when order kind is #{feature}") do
        expect { validator.validate(order) }
            .to raise_error(InvalidOrderError, expected_message)
      end
    end
  end

  def self.it_fails_with(message)
    ItFailsWith.new(self, message)
  end
end

So, let’s write our next edge case - when order kind is invalid:

it_fails_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
    .when_order_kind_is ["invalid"]

Pretty neat! And oh, it fails:

expected InvalidOrderError with
  "Order kind can be one of: 'private', 'corporate', 'bundle'",
got #

And the fix:

def validate(order)
  if order[:kind] == ["invalid"]
    raise InvalidOrderError.new(
        "Order kind can be one of: 'private', 'corporate', 'bundle'"
    )
  end

  raise InvalidOrderError.new("Order kind can not be empty")
end

Let’s write our next test - when order kind is private:

it_does_not_fail.when_order_kind_is ["private"]

This fails as expected with expected no Exception, got #. And to make it pass we need to wrap second raise statement in the if condition:

if order[:kind] != ["private"]
  raise InvalidOrderError.new("Order kind can not be empty")
end

The implementation for it_does_not_fail looks like that:

class ItDoesNotFail
  def initialize(spec)
    @spec = spec
  end

  def when_order_kind_is(value)
    @spec.it("does not fail when order kind is #{value.inspect}") do
      expect { validator.validate({items: 42, kind: value}) }
        .not_to raise_error
    end
  end
end

def self.it_does_not_fail
  ItDoesNotFail.new(self)
end

Let’s write our next test:

it_does_not_fail.when_order_kind_is ["corporate"]

And it fails with the expected error: expected no Exception, got #. The fix is to amend our if condition with that case:

if order[:kind] != ["private"] && order[:kind] != ["corporate"]
                                # ^  we have added this case  ^
  raise InvalidOrderError.new("Order kind can not be empty")
end

And the tests pass. Our next business rule is that one of private and corporate should be always present:

it_fails_with("Order kind should be 'private' or 'corporate'")
    .when_order_kind_is ["bundle"]

As expected the test fails:

expected InvalidOrderError with
  "Order kind should be 'private' or 'corporate'",
got #

And to fix it we just need to sprinkle another if statement in the middle of the function:

if order[:kind] == ["bundle"]
  raise InvalidOrderError.new("Order kind should be 'private' or 'corporate'")
end

As expected, the test passes. Now we should test the next business rule - order can not be of private and corporate kind at the same time:

it_fails_with("Order kind can not be 'private' and 'corporate' at the same time")
    .when_order_kind_is %w(private corporate)

This, as expected, fails with error message:

expected InvalidOrderError with
  "Order kind can not be 'private' and 'corporate' at the same time",
got #

And easiest way to fix that is to add another if statement:

if order[:kind] == %w(private corporate)
  raise InvalidOrderError.new(
      "Order kind can not be 'private' and 'corporate' at the same time"
  )
end

And it passes. Let’s test that we can combine private or corporate with bundle order kinds:

it_does_not_fail.when_order_kind_is %w(private bundle)

And it fails with error: expected no Exception, got #. To fix this we will have to amend our last if condition in the function even more:

if order[:kind] != ["private"] && order[:kind] != ["corporate"] &&
    order[:kind] != %w(private bundle)
  # ^    this is our new condition    ^
  raise InvalidOrderError.new("Order kind can not be empty")
end

And the test passes. Let’s refactor the code a bit:

First, we should extract order[:kind] duplication to a local variable kind
Extract common parts of raise statement to the private method

After this, OrderKindValidator will look a bit cleaner:

class OrderKindValidator
  def validate(order)
    kind = order[:kind]

    if kind == ["invalid"]
      fail_with("Order kind can be one of: 'private', 'corporate', 'bundle'")
    end

    if kind == ["bundle"]
      fail_with("Order kind should be 'private' or 'corporate'")
    end

    if kind == %w(private corporate)
      fail_with("Order kind can not be 'private' and 'corporate' at the same time")
    end

    if kind != ["private"] && kind != ["corporate"] &&
        kind != %w(private bundle)
      fail_with("Order kind can not be empty")
    end
  end

  private

  def fail_with(message)
    raise InvalidOrderError.new(message)
  end
end

Let’s write our next test for the same business rule (now a corporate bundle):

it_does_not_fail.when_order_kind_is %w(corporate bundle)

And it fails with error: expected no Exception, got #. To fix this we need to add && kind != %w(corporate bundle) to our last if condition again.

The code can be found in GitHub repository in an open pull request here.

Now it seems that we have implemented all the business rules (we have all tests for them). Or did we?

Bottom Line

Buggy if-riddled code is what we’ve got. We will see why in the next part of “Getting Stuck While Doing TDD” series. Stay tuned!

This is a series of articles:

Part 1: Example (reading this)
Part 2: Buggy Code and Forcing Our Way Through
Part 3: Triangulation to the Rescue!

Today we have implemented our not-so-complex problem at hand while following 3 rules of TDD. The result was not of the best quality and we will take a look why in further articles of these series. You would not want to miss next articles on this tech blog, we still have a lot to talk about:

Triangulation technique in Test-Driven Development - overlooking this technique might cause one fail at doing TDD (these series),
Continuous Integration and Continuous Delivery - importance of not impeding others,
Open-Closed Principle - changing behavior by adding new code,
Mutational Testing, “Build Your Own Testing Framework” series, Test-Driven Development screencasts and so much more!

Thanks!

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

Eliminating 'if' Statements: Legacy Endpoint Primer

2016-08-20T11:59:37+02:00

if statements tend to duplicate throughout the code base. This may lead to subtle mistakes and bugs. One way to avoid that problem is to eliminate if statement completely. Today we are going to take a look at one example of such elimination. Code examples today will be in Kotlin.

Problem at hand

Our API has an endpoint for issuing some sort of verification token given device id and phone number of the user’s mobile device.
We need to integrate these verification tokens with 3rd party API.
The format of verification token is fairly standardized.
After initial research, it turned out that issuer field of verification token has to be URL of the API that has issued that token and 3rd party API in question validates this fact.
Currently, issuer field gets generated as com.tddfellow. According to this standard, it has to be https://tddfellow.com.
Additionally, we have to support old versions of mobile clients for next 6 months, that are validating issuer to be com.tddfellow, we can not change them as they are already installed on users’ mobile devices.

Solution: bump the version of our API from v1 to v2 and use v1 for integration with old mobile clients and use v2 for integration with 3rd party API and all new clients.

Current Relevant Code

Main program, containing routing information:

// Main.kt

fun main(args: Array<String>) {
    val secureTokenSource = SimpleSecureTokenSource()
    val verificationTokenGateway = DummyVerificationTokenGateway()

    val issueVerificationTokenUseCase = UseCase(secureTokenSource, verificationTokenGateway)
    val issueVerificationTokenEndpoint = Endpoint(issueVerificationTokenUseCase)

    Spark.get("/api/v1/issueVerificationToken") { request, response ->
        issueVerificationTokenEndpoint.issueVerificationToken(
                deviceId = request.queryParams("deviceId"),
                phoneNumber = request.queryParams("phoneNumber")
        )
    }
}

Endpoint that issues verification token:

// ApiEndpoints/IssueVerificationToken/Endpoint.kt

class Endpoint(private val useCase: UseCase) {

    fun issueVerificationToken(deviceId: String, phoneNumber: String): IssueVerificationTokenEndpointResponse {
        val issueVerificationToken = useCase.issueVerificationToken(deviceId, phoneNumber)

        return IssueVerificationTokenEndpointResponse(
                issuer = issueVerificationToken.issuer,
                token = issueVerificationToken.secureToken
        )
    }

}

And the UseCase itself:

// IssueVerificationToken/UseCase.kt

open class UseCase(private val secureTokenSource: SecureTokenSource,
                   private val verificationTokenGateway: VerificationTokenGateway) {

    open fun issueVerificationToken(deviceId: String, phoneNumber: String): VerificationToken {
        val verificationToken = VerificationToken(
                issuer = "com.tddfellow",
                deviceId = deviceId,
                phoneNumber = phoneNumber,
                secureToken = secureTokenSource.generateToken()
        )

        verificationTokenGateway.persist(verificationToken)
        return verificationToken
    }

}

Code can be found here.

Solution With Awkward `if` Statement

Easiest solution using passed in apiVersion from the Main program and switch on it being old or new in the use case to determine which issuer to generate:

// Main.kt

Spark.get("/api/v1/issueVerificationToken") { request, response ->
    issueVerificationTokenEndpoint.issueVerificationToken(
            deviceId = request.queryParams("deviceId"),
            phoneNumber = request.queryParams("phoneNumber"),

            // here we are passing "old" version to the endpoint
            apiVersion = "v1"
    )
}

Spark.get("/api/v2/issueVerificationToken") { request, response ->
    issueVerificationTokenEndpoint.issueVerificationToken(
            deviceId = request.queryParams("deviceId"),
            phoneNumber = request.queryParams("phoneNumber"),

            // here we are passing "new" version to the endpoint
            apiVersion = "v2"
    )
}

And the endpoint just passes this value through to the use case:

// ApiEndpoints/IssueVerificationToken/Endpoint.kt

fun issueVerificationToken(deviceId: String,
                           phoneNumber: String,
                           apiVersion: String): IssueVerificationTokenEndpointResponse {

    val issueVerificationToken = useCase.issueVerificationToken(
            deviceId = deviceId,
            phoneNumber = phoneNumber,

            // here we are passing API version through
            apiVersion = apiVersion
    )

    return IssueVerificationTokenEndpointResponse(
            issuer = issueVerificationToken.issuer,
            token = issueVerificationToken.secureToken
    )
}

And finally the if statement in the use case:

// IssueVerificationToken/UseCase.kt

private val OLD_ISSUER = "com.tddfellow"
private val NEW_ISSUER = "https://tddfellow.com"
private val OLD_API_VERSION = "v1"

fun issueVerificationToken(deviceId: String, phoneNumber: String, apiVersion: String): VerificationToken {
    val verificationToken = VerificationToken(
            issuer = getIssuerFor(apiVersion),
            deviceId = deviceId,
            phoneNumber = phoneNumber,
            secureToken = secureTokenSource.generateToken()
    )

    verificationTokenGateway.persist(verificationToken)
    return verificationToken
}

private fun getIssuerFor(apiVersion: String): String {
    if (apiVersion.equals(OLD_API_VERSION)) {
        return OLD_ISSUER
    }
    return NEW_ISSUER
}

The full code change available here (via open Pull Request).

This solution has quite a few problems:

if statement smells a bit
use case probably should not have any knowledge of apiVersion, since APIs is not our domain, it is just a delivery mechanism

If we were to pass some object, like TokenIssuer, it would probably be more appropriate to have use case know of it. Let’s try to refactor:

Refactoring `if` Statement Using Polymorphism

First, let’s start passing in the token issuer in the routing:

// Main.kt

Spark.get("/api/v1/issueVerificationToken") { request, response ->
    issueVerificationTokenEndpoint.issueVerificationToken(
            deviceId = request.queryParams("deviceId"),
            phoneNumber = request.queryParams("phoneNumber"),

            // here we pass a specific object now for old token issuer:
            tokenIssuer = OldTokenIssuer()
    )
}

Spark.get("/api/v2/issueVerificationToken") { request, response ->
    issueVerificationTokenEndpoint.issueVerificationToken(
            deviceId = request.queryParams("deviceId"),
            phoneNumber = request.queryParams("phoneNumber"),

            // here we pass an object that returns URL according to the standard
            tokenIssuer = UrlTokenIssuer()
    )
}

And this is how TokenIssuer and its derivatives are looking like:

// IssueVerificationToken/TokenIssuer.kt

interface TokenIssuer {

    fun getName(): String

}

// IssueVerificationToken/OldTokenIssuer.kt

class OldTokenIssuer : TokenIssuer {

    override fun getName() = "com.tddfellow"

}

// IssueVerificationToken/UrlTokenIssuer.kt

class UrlTokenIssuer : TokenIssuer {

    override fun getName() = "https://tddfellow.com"

}

As you might guess, endpoint just passes this object through to the use case. And the use case itself just calls getName() on it when generating issuer:

// IssueVerificationToken/UseCase.kt

fun issueVerificationToken(deviceId: String, phoneNumber: String, tokenIssuer: TokenIssuer): VerificationToken {
    val verificationToken = VerificationToken(

            // here is how much simpler it becomes
            issuer = tokenIssuer.getName(),

            deviceId = deviceId,
            phoneNumber = phoneNumber,
            secureToken = secureTokenSource.generateToken()
    )

    verificationTokenGateway.persist(verificationToken)
    return verificationToken
}

Full code change can be seen here (in the Pull Request).

Bottom Line

This code may be refactored further so that even Endpoint class will not have to know about tokenIssuer and pass it through. I will leave that as an exercise to you, my dear reader.

You would not want to miss next articles on this tech blog, we still have a lot to talk about:

Continuous Integration and Continuous Delivery - importance of not impeding others,
Open-Closed Principle - changing behavior by adding new code,
Triangulation technique in Test-Driven Development - overlooking this technique might cause one fail at doing TDD,
Mutational Testing, “Build Your Own Testing Framework” series, Test-Driven Development screencasts and so much more!

Thanks!

Thank you for reading, my dear reader. If you liked it, please share this article on social networks and follow me on twitter: @tdd_fellow.

If you have any questions or feedback for me, don’t hesitate to reach me out on Twitter: @tdd_fellow.

That TDD Fellow | Tech Blog | Screencasts

Why Best Software Engineering Practices Are Insanely Important, Really

Holy Grail of Software Development

Whether You Know it or Not, You’re Still Using Practices.

Not Every “Best” Practice is Always Good — Context is King

Software Engineering Practices Are Based on Principles

Where do Principles Come From? How to choose?

Example Values That I Like to Optimize For

Example Software Engineering Principles

Example Software Engineering Practices

Values, Principles & Practices Are Technology-Agnostic

Conclusion

Are We Addicted to Complexity?

Feedback Time at the End of the Pairing Session

Pairing ~100% of the Time

Chemistry

Five Minutes Feedback Time

Conclusion

Thanks

On Being More Productive in the Morning. Even if You Are an Owl

Mornings That Start with a Lunch

A Lot of Rework

Productive Mornings with Momentum

Thanks

Drake's 24 Hours Challenge or How to Be Less Negative

Drake’s Challenge

How I Apply It

My Experience So Far

Thanks

Build Your Own Testing Framework. Part 6: Test Suite Does Not Run All Tests!

Verify All Tests Run

process.exit with hooks

Installing the “verify all tests run” hook

Fixing test suites to run all tests

Conclusion

Thanks

Learning Test-Driven Development With Javascript: Laws of TDD

Basics of Test-Driven Development

Three Rules of Test-Driven Development

Feedback Loop Benefits

English Numbers Kata

Exercises

Conclusion

Thanks

Learning Test-Driven Development With Javascript: End-to-End Testing

User Story

Exercises

Setting Up the Project

Crash Course into Jasmine

Writing our First Simple Tests

Three “A”s: Arrange, Act, and Assert

Exercises

Writing our First End-to-End Test

Exercises

Bottom Line

Thanks

Mobile Waterfall. Being Agile Again

Being Agile Again

Example of the DSL

Bottom Line

Thanks

Understanding Legacy Code Using Explorative Test-Driven Development Technique

Legacy Code

Knowledge in Production Code

Mutation

Code and Test Suite Relationship

Most Useful Coverage Metric

Mutational Testing

Explorative Test-Driven Development

Step-by-Step Example

Narrow & Isolate

Want more articles like this delivered to your inbox?

Trying to Understand & Writing 1st Test

Making It Pass

Applying Mutational Testing

Can Explorative TDD Help Me Outside of Legacy Code?

Bottom Line

Thanks

Build Your Own Testing Framework. Part 5

Catch and report a test failure

`process.exit` with hooks

Solution With Awkward `if` Statement

Refactoring `if` Statement Using Polymorphism