Top banner
MY AMERICAN SCIENTIST
SEARCH

COMPUTING SCIENCE

# Why W?

No, not that W. I won't be drawn into presidential politics here. The W I want to discuss is something else entirely: the Lambert W function, a mathematical contrivance that has been getting a fair amount of attention lately. The buzz began in the world of computer-algebra systems such as Macsyma, Maple and Mathematica, but word of W has also been spreading through journal articles, preprints, conference presentations and Internet news groups. The W function even has its own poster (see http://www.orcca.on.ca/LambertW).

The concept at the root of W can be traced back through more than two centuries of the mathematical literature, but the function itself has had a name only for the past 10 years or so. (A few years longer if you count a name used within the Maple software but otherwise unpublished.) When it comes to mathematical objects, it turns out that names are more important than you might guess.

Without further ado, here is the definition of  Lambert W: It is the inverse function associated with the equation:

WeW = x.

What does that mean? Some readers of this column will grasp it instantly, but I am not going to pretend that I am one of them. It took me a while to figure out how W works, and even longer to see why the concept might be considered interesting or important. At the risk of inflicting severe tedium on those who are more adept at algebra and analysis, I want to retrace my own path toward understanding what W is all about. It's a fairly long and wiggly path.

Single U and W

In trying to make sense of the expression WeW , the first question is not "why W?" but "why e?" The e in the formula is Euler's number, the second-most-famous constant in all of mathematics, often introduced as "the base of the natural logarithms"; but then the natural logarithms are usually defined as "logarithms taken to the base e," which is not much help. Another way of defining e, originally derived from the study of compound interest, avoids this circularity: e is the limiting value of the expression (1+1/n) n as n tends to infinity. Thus we can approximate the value of e by setting n equal to some arbitrary, large number. With n=1,000,000, for example, we get six correct digits of e: 2.71828.

Now, with e in hand, consider an equation somewhat simpler than the one for W; we might call it single-U:

eU = x.

This equation defines the exponential function,  also written exp(U). The function maps each given value of U to a corresponding value of x, namely e raised to the power U. If U is a positive integer, we can calculate the function's value by simple arithmetic: Just multiply e by itself U times. For nonintegral U, the procedure is not quite so obvious but is still well-defined; the left part of Figure 1 shows the elegant curve generated.

The equation eU = x also defines an inverse function; we just need to read the equation backward. Whereas the forward function maps a value of U to a value of x, the inverse function takes a value of x as input and returns the corresponding value of U. In other words, the inverse function finds the power to which e must be raised to yield a given value of x. This is another well-known, textbook function: the natural logarithm, written log(x) or ln(x). The log function has the same graph as the exponential function but reflected across the diagonal, as shown in the right part of Figure 1.

Sometimes it's helpful to think of functions like these as if they were machines. The exp function works like a meat grinder: Dump a U into the input hopper, turn the crank, and out comes an x equal to eU . The same machine can calculate logarithms if we run numbers through it backwards and turn the crank the other way, but there is an important caveat: When the machine runs in reverse, some inputs can jam the gears. For example, what output should the machine produce if you ask it for the logarithm of 0 or -1? (If you have a scientific calculator handy, see how a real machine answers these questions.)

The problem is that the logarithm function works only over a limited domain. Exponentiation is defined over the entire real number line; any real value of U, whether positive or negative, produces a value of exp(U), and that value is always a positive real number. The inverse function is not so well-mannered: As the right side of Figure 1 suggests, log(x) is defined only if x is positive. This limitation can be sidestepped by venturing off the real number line into the wilds of the complex plane. If the value of log(x) is allowed to be a complex number, with both a real and an imaginary part, then log(-1) has a definite value: According to a famous formula of Euler, it is equal to π i, where i is the imaginary unit, the square root of -1. The W function is also defined throughout the complex plane, but in this article I shall confine myself to the straight and narrow path of real numbers. However, see Figure 4.

One more simple fact about logarithms will be needed below. The logarithm of the product of two numbers is equal to the sum of the logarithms of the factors: log(xy)=log(x)+log(y). Likewise for quotients: log(x/y)=log(x)-log(y). These relations were the main reason for inventing logarithms in the first place: They convert multiplication and division into the easier tasks of addition and subtraction.

W Coming and Going

There is an obvious family resemblance between single-U and W, between the equations eU = x and WeW = x. In the case of the forward W function, if we know how to calculate eW , then it's a trivial matter to calculate WeW : just multiply by W. The resulting curve is shown in the left part of Figure 2. In overall shape it looks much like the exponential curve, although for large W it rises more steeply. Where eW and WeW really part company is to the left of W=0. Whereas eW is always positive, WeW dips into negative territory, reaching a minimum at the point W=-1, x=-1/e. As W tends toward negative infinity, both eW and WeW approach 0, but one from above and the other from below.

Taking the inverse of this function—solving WeW = x for W instead of for x—finally brings us to the Lambert W function. Just how to solve for W is a matter I'll return to below, but for now it's enough to flip the graph of the function about its diagonal, as in the right side of Figure 2; the inverse graph is drawn in more detail in Figure 3. Just as the forward function resembles the exponential curve, the inverse function appears similar to the logarithm. The curves for log(x) and W(x) cross at x=e, where both are equal to 1. Where things get most interesting, again, is to the left of x=0. Whereas log(x) is undefined for any x ≤ 0, W(x) continues to have a value down to x=-1/e, or about -0.37. Indeed, when x lies in the range between -1/e and 0, W(x) has not just a value but two values. For example, W(-0.2) could be equal to either -0.26 or -2.54. Plugging either of these W values into the formula WeW yields the x value -0.2.

For a mathematical function, multiple values are an embarrassment of riches; a well-bred function is supposed to map each value in its domain to a single value in its range. But in practice multiple values are not uncommon, particularly with inverse functions. The square root is a familiar example: Whereas squaring 2 yields the unique result 4, the square root of 4 could be either +2 or -2. Some of the trigonometric functions are even worse. Every angle has just one sine, but the inverse function, the arc sine, wraps around to produce infinitely many values.

The problem with multivalued functions is knowing which value, or branch, to choose. Most calculators and programming languages give precedence to positive roots and to arc sine values between -90 and +90 degrees, but there is no fundamental justification for these choices. In the case of Lambert W, the part of the curve with W>-1 has been labeled the "principal branch," but again this is mainly a matter of convention. (In the complex plane, W has infinitely many branches.)

W—What Is It Good For?

The Lambert W function may make a pretty curve, but what's it good for? Why should anyone care? By mixing up a few symbols we could generate an endless variety of function definitions. What makes this one stand out from all the rest?

If you ask the same question of more familiar functions such as exp and log and square root, the answer is that those functions are tools useful in solving broad classes of mathematical problems. With just the four basic operations of arithmetic, you can represent the solution of any linear equation. Adding square roots to the toolbox allows you to solve quadratic equations as well. Expanding the kit to include the trigonometric, exponential and logarithmic functions brings still more problems within reach. All of these well-known functions, and perhaps a few more, are classified as "elementary." The exact membership of this category is not written in stone, but it excludes more specialized tools such as Bessel functions.

A few years ago, a brief, unsigned editorial in Focus, the newsletter of the Mathematical Association of America, asked: "Time for a new elementary function?" The function proposed for promotion to the core set was Lambert W. Whether W ultimately attains such canonical status will depend on whether the mathematical community at large finds it sufficiently useful, which won't be clear for some years. In the meantime, I can list a few applications of W discovered so far.

One place where W turns up in pure mathematics is the "power tower," the infinitely iterated exponential

For large x, this expression soars off to infinity faster than we can follow it, but Euler showed that the tower converges to a finite value in the domain between x = e-e (about 0.07) and x = e1/e (about 1.44). Within this realm, the value to which the infinite tower converges is W(-log(x))/-log(x).

W has another cameo role in the "omega constant," which is a distant of cousin of the golden ratio. The latter constant, with a value of about 1.618, is a solution of the quadratic equation 1/x= x-1. The omega constant is the solution of an exponential variant of this equation, to wit: 1/ex = x. And what is the value of that solution? It is W(1), equal to about 0.567143.

Of more practical import, W also appears in solutions to a large family of equations known as delay differential equations, which describe situations where the present rate of change in some quantity depends on the value of the quantity at an earlier moment. Behavior of this kind can be found in population dynamics, in economics, in control theory and even in the bathroom shower, where the temperature of the water now depends on the setting of the mixing valve a few moments ago. Many delay differential equations can be solved in terms of W; in some cases the two branches of the W function correspond to distinct physical solutions.

A recent article by Edward W. Packel and David S. Yuen of Lake Forest College applies the W function to the classical problem of describing the motion of a ballistic projectile in the presence of air resistance. In a vacuum, as Galileo knew, the ballistic path is a parabola, and the maximum range is attained when the projectile is launched at an angle of 45 degrees. Air resistance warps the symmetry of the curve and greatly complicates its mathematical description. Packel and Yuen show that the projectile's range can be given in terms of a W function, although the expression is still forbiddingly complex. (They remark: "Honesty compels us to admit at this point that the idea for using Lambert W to find a closed-form solution was really Mathematica's and not ours.")

Still another example comes from electrical engineering, where T. C. Banwell of Telcordia Technologies and A. Jayakumar of Anadigics show that a W function describes the relation between voltage, current and resistance in a diode. In a simple resistor, this relation is given by Ohm's law, I=V/R, where I is the current, V the voltage and R the resistance. In a diode, however, the relation is nonlinear: Although current still depends on voltage and resistance, the resistance in turn depends on current and voltage. Banwell and Jayakumar note that no explicit formula for the diode current can be constructed from the elementary functions, but adding W to the repertory allows a solution.

Other applications of W have been discovered in statistical mechanics, quantum chemistry, combinatorics, enzyme kinetics, the physiology of vision, the engineering of thin films, hydrology and the analysis of algorithms.

Evaluating W

It's all very well to express the solutions of problems in terms of W, but then how do we find the value of the resulting function? In the case of logarithms and trigonometric functions, the standard method for many years was to look up the answer in a big printed table; now we push the appropriate button on a calculator. For W, however, there are no published tables, and so far no scientific calculator has a built-in Lambert W key. Several computer algebra systems know how to evaluate the W function, but if you don't have access to such software, you're on your own.

Suppose we already know how to calculate exponentials and logarithms; can we then solve the equation WeW = x? As noted above, the forward version is easy: just evaluate eW and then multiply by W. At first glance, the inverse function looks like it might be wrestled to submission by a similar tactic. If we can solve for x by calculating an exponential and then multiplying, can't we solve for W by dividing and then taking a logarithm?

Dividing both sides of the equation by W gives eW = x/W. Then, taking the logarithm of both sides produces log(eW) = log(x/W). On the left hand side, the logarithm of eW is simply W. On the right hand side we can rewrite the logarithm of a quotient as the difference of two logarithms, and so we wind up with this equation:

W = log(x) - log(W).

We have succeeded in getting W off by itself on the left side, but unfortunately there's still a log(W) on the right. Thus we don't have a closed-form solution, a formula that would allow us to plug in an x and immediately get back the corresponding W. This failure is not merely a result of my ineptitude; no algebraic wizardry will yield a finite closed-form solution.

On the other hand, the equation above is not totally worthless. If we have a guess about the value of W, then we can plug it into the right hand side of the equation to get an even better guess, then repeat the process until we're satisfied with the accuracy of the approximation. For some values of x—well away from 0—this simple iterative scheme converges quickly on the correct result. The algorithms used in computer-algebra software are more efficient, accurate and robust, but they still rely on successive approximations.

Whence and Whither W?

The modern history of Lambert W began in the 1980s, when a version of the function was built into the Maple computer-algebra system and given the name W. Why W? An earlier publication by F. N. Fritsch, R. E. Shafer and W. P. Crowley of the Lawrence Livermore Laboratory had written the defining equation as wew = x. The Maple routine was written by Gaston H. Gonnet of the Institut für Wissenschaftliches Rechnen in Zurich, who adopted the letter w but because of typographic conventions in Maple had to capitalize it.

A few years later Robert M. Corless and David J. Jeffrey of the University of Western Ontario launched a discussion of W and its applications in what has turned out to be a long series of journal articles and less-formal publications. The most influential paper, issued as a preprint in 1993 but not published until 1996, was written by Corless and Jeffrey in collaboration with Gonnet, David E. G. Hare of the University of Waterloo and Donald E. Knuth of Stanford University. This was the paper that named the function in honor of the 18th-century savant Johann Heinrich Lambert.

Lambert, who wrote on everything from cartography to photometry to philosophy, never published a word on the function that now bears his name. It was his eminent colleague Leonhard Euler who first described a variant of the W function in a paper published in 1779, two years after Lambert's death. So why isn't it called the Euler W function? For one thing, Euler gave credit to Lambert for the earliest work on the subject. Perhaps more to the point, Corless, Jeffrey and Knuth note that "naming yet another function after Euler would not be useful."

In the years between Euler and Maple, the W function did not disappear entirely. The Dutch mathematician N. G. de Bruijn analyzed the equation in 1958, and the British mathematician E. M. Wright wrote on the subject at about the same time. In the 1970s and 80s there were several more contributions, including that of Fritsch, Shafer and Crowley. Nevertheless, the literature remained widely scattered and obscure until the function acquired a name. In a 1993 article, Corless, Gonnet, Hare and Jeffrey remark: "For a function, getting your own name is rather like Pinocchio getting to be a real boy."

Some of the recent publications on W go beyond mere explication of mathematics; they carry a whiff of evangelical fervor. Those for whom W is a favorite function want to see it elevated to the canon of standard textbook functions, alongside log and sine and square root. I am reminded of another kind of canonization—a campaign for the recognition of a local saint, with testimonials to good works and miracles performed.

The advocates of W do make a strong case. In a 2002 paper, Corless and Jeffrey argue that W is in some sense the smallest step beyond the present set of elementary functions. "The Lambert W function is the simplest example of the root of an exponential polynomial; and exponential polynomials are the next simplest class of functions after polynomials."

But the elevation of W has not won universal assent. R. William Gosper, Jr., has suggested that a better choice might be the square of W, that is, WeW2 = x, which eliminates the multivalued branching on the real line. (In a play on "Lambert W," Gosper calls this the Dilbert lambda function.) And Dan Kalman of American University has suggested a formulation based on eW/W = c, with an inverse function he calls glog.

Woo Woo Woo

My own misgivings about Lambert W pertain not to the function itself but to the name. Again: Why W? Over the years, English-speaking people have inflicted far too many Ws on the rest of the world, from the Wicked Witch of the West to the W boson to the World Wide Web. (Again I forgo comment on the current occupant of the White House.) We purse our lips painfully to pronounce doubleyou, doubleyou, doubleyou. With 26 letters to choose from, why do we keep fixing upon the only letter in the English alphabet with a polysyllabic name? (I acknowledge that I have made matters worse by writing this column, in which every sentence of the text includes at least one instance of the letter w.)

It's not too late to right the wrong. On a bus in Italy—a country that doesn't even have a w in its alfabeto—I overheard a fragment of a conversation: Someone was reading a URL and pronounced the first part "woo woo woo." It's a shrewd accommodation to linguistic wimperialism. We should all adopt it. Let us keep the letter but change the way we say it. Whether it's Lambert W or George W. or www, it's woo all the way.

Acknowledgments
For assistance with this article I offer warm thanks to Jonathan M. Borwein, David W. Cantrell, Robert M. Corless, David J. Jeffrey and Tim Royappa.