Notes on B. Meyer's "On Formalism in Specifications"

SE 507
Notes on Bertrand Meyer's On Formalism in Specifications from January 1985 issue of IEEE Software (pp. 6-26)

Note that the full text of this paper is available at the IEEE Computer Science Digital Library, a hyperlink to which can be found on the U of Scranton Library's list of databases.

Overview

Specification is the phase of the software lifecycle concerned with precise definition of the tasks to be performed by the system. (Notice on page 7 the figure depicting Royce's waterfall model of the software life cycle, the phases of which are requirements, specification, design ("global" and then "detailed"), implementation, validation, distribution, and operation.)

Although SE textbooks emphasize its importance, in practice the specification phase is often overlooked, being confused with either the preceding phase, definition of system objectives (during which a natural-language requirements document is produced), or the following phase, design.

In the former case, the requirements document is deemed sufficient to proceed to system design without further specification activity.

Meyer's paper emphasizes the drawbacks of such an informal approach and attempts to show the usefulness of formal specifications as a complement to, not a replacement for, natural-language requirements. It also attemps to show how a formal specification can be used to improve/clarify the natural-language descriptions of requirements.

Seven Sins of the Specifier

In natural-language requirements, one can find recurring patterns/classes of deficiencies, or "sins". Among the most common and damaging are

Noise: Presence in the text of an element that fails to carry information relevant to any feature of the problem. Variants include
- redundancy: in which old information is repeated, but using different terms/language, thereby giving the impression that something new is being introduced, and
- remorse: in which the meaning of a term defined earlier is qualified, as though the author was sorry for the original definition.
Silence: Existence of a feature of the problem that is not covered by any element of the text.
Overspecification: Presence in the text of an element that corresponds not to a feature of the problem but to features of a possible solution.
Contradiction: Presence in the text of two or more elements that define a feature of the system in incompatible ways.
Ambiguity: Presence in the text of an element making it possible to interpret a feature of the problem in at least two different ways.
Forward Reference: Presence in the text of an element that uses features of the problem not defined until later in the text.
Wishful Thinking: Presence in the text of an element that defines a feature of the problem in such a way that a candidate solution cannot realistically be validated with respect to this feature.

This classification is interesting for at least two reasons:

It gives some weight to the thesis that formal specifications are needed as an intermediate step between requirements and design.
It provides a checklist of common mistakes for those who write natural-language requirements (which, after all, are still necessary), which may help to prevent them from making such mistakes.

Question: Suppose that writers of natural-language requirements were to stop committing such sins and write only requirements documents of very high quality. Would this solve the problem?

Answer: Meyer thinks not! In his view, a natural-language description of any significant system, even a description of good quality, exhibits deficiencies making it unacceptable for rigorous software development.

Illustration of a Particular Requirements Document

To illustrate the point, Meyer chooses a very simple text-formatting problem that was described (in natural language) in a 1969 paper by Peter Naur (Programming by Action Clusters, BIT, Vol. 9, No. 3, 1969, pp. 250-258.) The main point of Naur's paper was to present an algorithm that solves the problem and to prove the algorithm's correctness.

Naur's description was as follows:

Given a text consisting of words separated by BLANK or NEWLINE (newline) characters, convert it to a line-by-line form in accordance with the following rules:

line breaks must be made only where the given text has BLANK or NEWLINE;

each line is filled as far as possible, as long as

no line will contain more than MAXPOS characters.

Goodenough and Gerhart (henceforth, G&G) subsequently wrote two papers about program testing that addressed Naur's problem, criticizing not only his description but also his (very flawed) solution.

The historical backdrop is that G&G were defending the notion of testing as a useful technique, in oppostion to those (e.g., Dijkstra) who put more emphasis on proving the correctness of programs. (Dijkstra famously said in his 1972 ACM Turing Award lecture, "Testing can be a very effective way to show the presence of bugs, but it is hopelessly inadequate for showing their absence.")

G&G not only found several deficiencies in Naur's problem description, but also found that his solution had major flaws —including that it would terminate only if the input data were invalid (in a particular way)! This demonstrated that even a program that had been "proved correct" could be incorrect! (This is not a contradiction; rather, it reminds us that proofs can possess errors, too.)

G&G offered an improved (but much longer) description of the problem in their first paper ("Towards a Theory of Test Data Selection", by J.B. Goodenough and S. Gerhart, IEEE Transactions on SE, Vol. Se-1, No. 2, June 1975, pp. 156-173). In their second paper ("Towards a Theory of Test: Data Selection Criteria", in Current Trends in Programming Methodology, Vol. 2, edited by R.T. Yeh, Prentice-Hall, 1977, pp. 44-79.), they acknowledge that their improvement of Naur's description still left something to be desired, so they gave yet another one, which appears in Figure 2, page 11, of Meyer's paper.

Analysis of Goodenough's & Gerhart's Specification

Meyer's first observation is that G&G's specification is four times as long as Naur's (resulting, no doubt, from their efforts to "leave no stone unturned" and to eliminate all ambiguity), and seems inappropriately lengthy for such a simple problem.

Meyer then goes on to point out several examples of where G&G commit six of the seven "sins" described earlier.

Noise: Noise isn't always bad; sometimes it can play the same role in a specification as comments do in a program. But often, noise elements obscure the text in that a reader, upon first encountering such an element, thinks it brings new information, but upon closer examination realizes that it only repeats known information in a new way.

Example: nonempty sequence (in line 8) is the same thing as sequence of one or more characters (9). In technical writing, it is better to use the same term each time the same concept is referred to.

Remorse: This is a variant of noise in which a term is used, but qualified in order to restrict or modify its earlier definition, as though the author suddenly regretted the initial definition.

Example: the output text, if any (20). Up to this point, the specification freely used the notion of output text (12,17) without hinting that it might not exist. Even here, no criterion is given for determining whether or not the output text exists. (Note: Meyer seems to be playing dumb here; a more likely interpretation is that a distinction is being made between output text that is empty (i.e., of length zero) and output text that is nonempty (i.e., of length one or more).)

Silence: Often a specifier will fail to address some vital features of the problem, or address them inadequately.

Example 1: line, which is not really defined except in a parenthetical bit of remorse in (24), where it is described as a sequence of characters "between successive NEWLINE characters".
An interesting point here is the cultural background necessary to understand this concept. In ASCII-oriented environments, newline (often denoted by \n) is an (ordinary) character signaling the end of a line of text. In an IBM-like environment, a text file is a sequence of records (each one corresponding to a "line"), not just a sequence of characters (some of which happen to be newlines).
Besides, the late definition of line is wrong insofar as it captures only the notion of "interior" lines (ones not occurring at the beginning or end). The first and last lines, after all, are not sandwiched by NEWLINE's.
If we accept G&G's definition of line, the first and last lines of output can be arbitrarily long! Hence, if the output contains no NEWLINE's at all, it is acceptable!
Example 2: Line (16) says that variable Alarm should be set to TRUE in case of an error, but nothing is said about what value should be assigned to it in other cases. The answer is obvious, but it remains unstated.
Note: Meyer observes that the problem being addressed might better be viewed as two separate problems, one of which is to compress runs of break characters into a single character and the other of which is to divide the text into maximal-length "lines" bounded in length by a parameter (MAXPOS). (If we can solve each of these problems, we can use piping, as is commonly employed in UNIX environments, to solve the original problem. Piping refers to feeding the output of one program as input to another program.) End of Note

Contradictions: arise from elements of the text that result in incompatible interpretations.

Example 1: Of what form is the input? In line (1), it is a stream of characters, but in line (10) it can be viewed as a sequence of words and breaks. These correspond, respectively, to seq[CHAR] and seq[seq[CHAR]] (each word and break being of type seq[CHAR], of course).
Meyer illustrates the difference with an example that any LISP programmer would appreciate:
(a b a c c a) vs. ((a) (b a) (c c a))

The former sequence is composed of CHAR's whereas the latter is composed of sequences, each of which is composed of CHAR's.
Example 2: Similarly, it is not clear whether the data type of the output text should be taken to be
- seq[CHAR], as lines (21-22) suggest, or
- seq[WORD], that is, seq[seq[CHAR]], as lines (12-13) suggest (interpreting WORD to be seq[CHAR]),or
- seq[LINE], that is, seq[seq[seq[CHAR]]], which would be reasonable even if it is not really suggested by any particular statement in the specification. This would correspond to interpreting the output to be a sequence of lines, each of which is a sequence of words (or of both words and breaks).
Meyer asserts that lines (12-13) (he mistakenly identifies them as (13-14)), which say
The program's output should be the same sequence of words as in the input...
are remarkable in that neither the input nor the output is a sequence of words! Meyer's language seems a bit strong in that he has entertained at least the possibility that they are, indeed, sequences of words.
Meyer goes on to say (at the top of page 12) that if the input is interpreted to be a sequence of words, in order to produce correct output we must have two additional pieces of information: whether there was a leading blank and whether there was a trailing blank. Apparently, Meyer's interpretation of the specification is that the output must include a leading blank if the input does, and similarly for a trailing blank. (What if the input had a leading or trailing NEWLINE?)
Example 3: Line (11) gives rise to another contradiction in that it says that the input ends with ET and at the same time says that the input can have trailing blanks. (If there can be blanks following the occurrence of ET, then apparently the input need not end with ET!!)

Overspecification: The reader is told too much about a possible solution. Programmers, understandably, tend to make this mistake in writing requirements documents.

Example 1: The ET character, which is mentioned in (2,6,7,11), is an implementation detail, relevant to programming languages that have no explicit means for detecting end-of-text (and hence rely upon there being a sentinel character to signal the end). Also, the specification, despite assuming that the input ends with ET, fails to say that the output should, too, which means that the output text will not be suitable for use as input for any similarly specified textual manipulation. (This is not necessarily wrong, but one suspects that it is an oversight.)
Rather than talk about an ET character marking the end of input, it would be more useful to talk about input as being a finite sequence of characters. Exactly how one detects the end of the sequence is an implementation matter.
Example 2: Error exit (line 16), causing (variable) ALARM to assume the value TRUE. But a variable is internal to the program unit to which it belongs; indeed, the notion of a variable belongs to the world of programs, not specifications (at least the kind of specification that this is intended to be). It would have been better to talk in terms of an exception being thrown, perhaps.

Ambiguities:

Example 1: Lines (16-17) say that the output text should satisfy properties 1 to 4 up to the point of an error. Which is when? For example, in Figure 3, is the point of an error [row 4, column 10] or [row 3, column 7]?
Example 2: On line (23), which says that as many words as possible should be placed on each line, it should be qualified to say that the last line is an exception. (Example: WHO WHAT WHEN and MAXPOS = 10).
Example 3: Use of stream in some places and sequence in others.
Example 4: (15) talks about an error exit, which is explained in (16) as ALARM taking on the value TRUE. But how does the latter action imply that the program exits?
Example 5: Line is not well-defined (24).
Example 6: The relationship between "new line" as used in (5) and as used in (19) is not clear.

Forward References:

Example 1: ET is used three times (2,3,6) before it is defined (7).
Example 2: The notion of line is defined at (24) but used in (19-20).
Example 3: (instructor's example, not Meyer's) The notion of "as many words as possible should be placed on each line" (23) makes no sense without the condition stated afterwards (25-26) regarding the limits on the length of a line.

So what??

If great care can be taken to describe such a simple problem, and it still comes out bad, imagine how much more difficult it is to give a good description of a complicated problem, possibly one related to something that puts lives and/or property at stake, like nuclear reactor control, or missile guidance, or even payroll.

In Meyer's opinion, the situation can be improved significantly by a reasoned use of more formal specifications, which would serve as a complement to (but not a replacement for) natural-language documents. Indeed, one can often use a formalized specification (and the insights that arose during its development) to formulate a better natural language version.

Elements for a formal specification

Most languages/notations for expressing specifications formally are based upon well-known mathematical concepts such as sets, functions, relations, and sequences. So, rather than choose any particular formal specification language (e.g., Z, B, Larch), Meyer uses traditional mathematical notation (for sets, functions, etc.) to develop a formal specification for Naur's text formatting problem.

There are essentially three aspects to solving this problem:

reducing each break (in the input text) to a break of length one (in the output text)
ensuring that no "line" (in the output text) exceeds MAXPOS characters
filling each "line" (in the output text) as much as possible

It will simplify matters to think of these three semi-independently.

Note: Meyer seems to be guilty here of using language that is suggestive of a method of solution, when he should be describing (only) the desired relationship between input text and output text, as well as any additional conditions that the output text must meet. End of note.

As for the first item, (informally) define the binary relation short_breaks ⊆ seq[CHAR] × seq[CHAR] by

short_breaks ::= { (x,y) | y can be obtained from x by removing break characters until each break has length one }

Recall that a break, within a sequence of characters (i.e., a value of type seq[CHAR]), is a maximal (contiguous) subsequence of characters in the set BREAK_CHAR = { BLANK, NEWLINE }.

As for the second item, (informally) define the binary relation limited_length ⊆ seq[CHAR] × seq[CHAR] by

limited_length ::= { (x,y) | no "line" in y exceeds length MAXPOS ∧ y can be obtained from x by replacing zero or more occurrences of NEWLINE by BLANK and zero or more occurrences of BLANK by NEWLINE }

If we take the relation product/composition limited_length º short_breaks, we get the set

{ (x,z) | z can be obtained from x by replacing each break in x by a break of length one ∧ no "line" in z exceeds length MAXPOS }

Note that the first conjunct says not only that each break in z is of length one but also that the sequence of "words" (those contiguous subsequences of characters appearing between breaks!) in z corresponds to those in x. And the latter is the relationship that we want between input text and output text!

Note: Different authors use different notations for the relation product/composition operation. Here I have used notation consistent with Meyer. However, Gries & Schneider would have written it with the two operands in the opposite order. (See Chapter 14 of their book.) End of note.

Note: Had we strengthened the definition of short_breaks to say that y is obtained by replacing every break in x by a single occurrence of BLANK, then we could have simplified the description of limited_length by omitting the part allowing NEWLINEs being replaced by BLANKs. End of Note.

For a relation R⊆A×B and x∈A, define R.x = { y | (x,y)∈R }.

Note: Using a slightly generalized definition of relation composition (from that usually found in textbooks), what we are here calling R.x is just R º {x}.
End of note.

Let's give the name ll_sb to the relation limited_length º short_breaks. Then, for x∈seq[CHAR],

ll_sb.x = { z | (x,z) ∈ ll_sb }

is the set of texts that can be obtained by replacing breaks in x by breaks of length one and ensuring that no "line" in the resulting text has a length exceeding MAXPOS.

Now consider the function FEWEST_LINES: P(seq[CHAR]) → P(seq[CHAR]), which, given a set X of texts, yields the subset of X containing precisely those texts having the fewest number of "lines". (The number of lines in a text could be defined as being one more than the number of occurrences of NEWLINE in that text.) Note that P(A) denotes the set of all subsets of A. Hence, a function having domain P(A) is one that "expects" to be given an argument that is a set whose members are elements of A, and a function having P(A) as its range is one that, when applied to an element of its domain, yields as a result a set whose members are elements of A.

Then FEWEST_LINES(ll_sb.x) is the set of texts that

can be obtained by replacing breaks in x by breaks of length one,
have no lines exceeding length MAXPOS, and
have the fewest number of lines among all texts satisfying the previous two conditions.

According to Meyer, for any input text x, any member of FEWEST_LINES(ll_sb.x) is an acceptable output text. (Do you agree? If not, where is the flaw in Meyer's thinking?)

That is, the relation goal ⊆ seq[CHAR] × seq[CHAR] that contains precisely those pairs (x,y) such that y is an acceptable output for input x is

goal ::= { (x,y) | y ∈ FEWEST_LINES(ll_sb.x) }

What we have done so far is to give a semi-formal specification of the problem.

A More Formal Specification

Sequences and Subsequences

Meyer's formal specification relies very much upon the concepts of sequence and subsequence, which he defines on page 19 in a manner similar to how they are defined in the specification language Z.

Definition: A sequence S of elements of type A having length n is a function with domain 1..n (the set of natural numbers {1,2,...,n}) and range A. For example, the sequence

S = < spock, kirk, gorn, spock, uhura, mccoy >

is a function with domain 1..6 and range

star_trek_character = { kirk, spock, mccoy, uhura, gorn, sulu, checkov, ... }

such that S(1) = spock, S(2) = kirk, ..., S(6) = mccoy.

The set of sequences of elements of type A is denoted by seq[A]. Our problem is with respect to an input text and an output text, each of type seq[CHAR].

Informally, if x,y ∈ seq[A] and y can be obtained by "erasing" zero or more elements from x, we say that y is a subsequence of x. For example,

T = < kirk, gorn, spock, mccoy >

is a subsequence of S that was obtained by erasing the first and fifth elements of S.

Formally, y is a subsequence of x if there exists an increasing function f : 1..m → 1..n (i.e., a sequence of natural numbers!), where length(y) = m and length(x) = n, such that, for all i in 1..m, y(i) = x(f(i)). That is,

isSubsequenceOf(y,x) ::= (∃f : 1..length(y) → 1..length(x) |: (∀i | 1<i≤length(y) : f(i-1) < f(i)) ∧ (∀i | 0<i≤length(y) : y(i) = x(f(i))))

In our S/T example, such a function is f(1) = 2, f(2) = 3, f(3) = 4, f(4) = 6.

Following Meyer's approach, define the (family of) function(s) SUBSEQUENCE : seq[A] → P(seq[A]) by

SUBSEQUENCE(x) ::= { y |: isSubsequenceOf(y,x) }

Defining `short_breaks` Formally

Define SINGLE_BREAKS : seq[CHAR] → P(seq[CHAR]) as follows:

SINGLE_BREAKS(x) ::= { y ∈ SUBSEQUENCE(x) | (∀i | 1<i≤length(x) : y(i-1) ∈ BREAK_CHAR ⇒ y(i) ∉ BREAK_CHAR) }

In other words, y ∈ SINGLE_BREAKS(x) iff y is a subsequence of x having breaks of length one.

The trouble with SINGLE_BREAKS(x) is that it includes subsequences of x obtained by removing not just "extra" break characters, but also non-break characters or entire breaks (so that adjacent words have joined to become one word)! What we really want (in order to realize the short_breaks relation) are all members of SINGLE_BREAKS(x) having maximum length. (Such texts must have been obtained without erasing any non-break characters and without erasing entire breaks.)

To achieve this, Meyer uses the function MAX_SET, which takes as arguments a set A of texts and a function f (where f maps texts to numbers) and yields that subset of A containing precisely those members having maximum value when f is applied to them. That is,

MAX_SET(A,f) ::= { x∈A | f(x) = (max z | z∈A : f(z)) }

This gives rise to the definition

COMPACTED(x) ::= MAX_SET( SINGLE_BREAKS(x), length)

which yields the set containing all subsequences of x obtained by erasing from x only "extra" break characters (so that no break is erased entirely and no non-break characters are erased at all).

Now we can define the relation short_breaks:

short_breaks(x,y) ::= y ∈ COMPACTED(x)

Defining `limited_length` Formally

Define EQUIVALENT ⊆ seq[CHAR] × seq[CHAR] by

EQUIVALENT ::= { (u,v) | length(u) = length(v) ∧ (∀i | 1≤i≤length(u) : u(i) ≠ v(i) ⇒ u(i)∈BREAK_CHAR ∧ v(i)∈BREAK_CHAR) }

This says that (u,v) ∈ EQUIVALENT holds iff u and v are identical, except where one has an occurrence of NEWLINE, the other may have an occurrence of BLANK.

Define noLinesLongerThan : seq[CHAR] × Z → BOOLEAN informally by

noLinesLongerThan(u,k) ::= true if every substring of u of length k+1 includes at least one occurrence of NEWLINE, false otherwise.

It is left to the reader to provide a formal definition.

Now we can formally define limited_length:

limited_length ::= { (u,v) ∈ EQUIVALENT | noLinesLongerThan(v,MAXPOS) }

What mostly remains is to give a formal definition of FEWEST_LINES:

FEWEST_LINES(A) = MIN_SET(A, #new_lines)

where

#new_lines(u) = (#i | 1≤i≤ length(u) : u(i) = NEWLINE)

Improved Natural Language Specification

Using the insights gained from developing the formal specification, Meyer offers an improved informal specification in Figure 5 (and modified slightly here by the instructor):

Given are a nonnegative integer MAXPOS and a character set including two "break" characters, BLANK and NEWLINE. A substring is defined to be a contiguous subsequence of a sequence of characters. A break is defined to be a maximal substring containing only break characters. (Here, "maximal" means that any character preceding or following the substring is a non-break character.)
The program shall accept as input a finite sequence of characters and produce as output a sequence of characters satisfying the following conditions:

It differs from the input only insofar as each break in the latter is replaced by a single break character in the former.

Any substring of the output of length MAXPOS+1 includes at least one occurrence of NEWLINE.

The number of occurrences of NEWLINE in the output is minimal (among all sequences of characters satisfying the previous two conditions).

SE 507 Notes on Bertrand Meyer's On Formalism in Specifications from January 1985 issue of IEEE Software (pp. 6-26)