3.1 Theory of Linear Equations

INTRODUCTION

We turn now to differential equations of order two or higher. In this section we will examine some of the underlying theory of linear DEs. Then in the five sections that follow we learn how to solve linear higher-order differential equations.

3.1.1. Initial-Value and Boundary-Value Problems

Initial-Value Problem

In Section 1.2 we defined an initial-value problem for a general nth-order differential equation. For a linear differential equation, an nth-order initial-value problem (IVP) is

(1)

Recall that for a problem such as this, we seek a function defined on some interval I containing x0 that satisfies the differential equation and the n initial conditions specified at x0: y(x0) = y0, y′(x0) = y1, …, y(n−1)(x0) = yn−1. We have already seen that in the case of a second-order initial-value problem, a solution curve must pass through the point (x0, y0) and have slope y1 at this point.

Existence and Uniqueness

In Section 1.2 we stated a theorem that gave conditions under which the existence and uniqueness of a solution of a first-order initial-value problem were guaranteed. The theorem that follows gives sufficient conditions for the existence of a unique solution of the problem in (1).

THEOREM 3.1.1 Existence of a Unique Solution

Let an(x), an−1(x), …, a1(x), a0(x), and g(x) be continuous on an interval I, and let an(x) ≠ 0 for every x in this interval. If x = x0 is any point in this interval, then a solution y(x) of the initial-value problem (1) exists on the interval and is unique.

EXAMPLE 1 Unique Solution of an IVP

The initial-value problem

3y‴ + 5y″y′ + 7y = 0, y(1) = 0, y′(1) = 0, y″(1) = 0

possesses the trivial solution y = 0. Since the third-order equation is linear with constant coefficients, it follows that all the conditions of Theorem 3.1.1 are fulfilled. Hence y = 0 is the only solution on any interval containing x = 1.

EXAMPLE 2 Unique Solution of an IVP

You should verify that the function y = 3e2x + e−2x − 3x is a solution of the initial-value problem

y″ − 4y = 12x, y(0) = 4, y′(0) = 1.

Now the differential equation is linear, the coefficients as well as g(x) = 12x are continuous, and a2(x) = 1 ≠ 0 on any interval I containing x = 0. We conclude from Theorem 3.1.1 that the given function is the unique solution on I.

The requirements in Theorem 3.1.1 that ai(x), i = 0, 1, 2, …, n be continuous and an(x) ≠ 0 for every x in I are both important. Specifically, if an(x) = 0 for some x in the interval, then the solution of a linear initial-value problem may not be unique or even exist. For example, you should verify that the function y = cx2 + x + 3 is a solution of the initial-value problem

x2y″ − 2xy′ + 2y = 6, y(0) = 3, y′(0) = 1

on the interval (−∞, ∞) for any choice of the parameter c. In other words, there is no unique solution of the problem. Although most of the conditions of Theorem 3.1.1 are satisfied, the obvious difficulties are that a2(x) = x2 is zero at x = 0 and that the initial conditions are also imposed at x = 0.

Boundary-Value Problem

Another type of problem consists of solving a linear differential equation of order two or greater in which the dependent variable y or its derivatives are specified at different points. A problem such as

is called a two-point boundary-value problem, or simply a boundary-value problem (BVP). The prescribed values y(a) = y0 and y(b) = y1 are called boundary conditions (BC). A solution of the foregoing problem is a function satisfying the differential equation on some interval I, containing a and b, whose graph passes through the two points (a, y0) and (b, y1). See FIGURE 3.1.1.

Five curves are graphed on an x y coordinate plane. The graph has a shaded region of width I marked on the x axis. The curves are inside the region. The first curve starts just above the x axis, goes up and to the right through the marked point (a, y subscript 0), reaches a high point, goes down and to the right through the marked point (b, y subscript 1), and ends on the x axis. The second curve starts above the first curve, goes down and to the right through the marked point (a, y subscript 0), reaches a low point, goes up and to the right through the marked point (b, y subscript 1), and ends above the first curve. The third curve starts above the second curve, goes up to the right, then goes down to the right and ends above the second curve. There are two curves above the third curve. The first one starts above the third curve, goes down to the right, reaches a low point and then goes up to the right. The second one starts at the top, goes up to the right, and goes down and to the right. The curve is labeled: solutions of the D E.

FIGURE 3.1.1 Colored curves are solutions of a BVP

For a second-order differential equation, other pairs of boundary conditions could be

y′(a) = y0, y(b) = y1

y(a) = y0, y′(b) = y1

y′(a) = y0, y′(b) = y1,

where y0 and y1 denote arbitrary constants. These three pairs of conditions are just special cases of the general boundary conditions

A1y(a) + B1y′(a) = C1

A2y(b) + B2y′(b) = C2.

The next example shows that even when the conditions of Theorem 3.1.1 are fulfilled, a boundary-value problem may have several solutions (as suggested in Figure 3.1.1), a unique solution, or no solution at all.

EXAMPLE 3 A BVP Can Have Many, One, or No Solutions

In Example 10 of Section 1.1 we saw that the two-parameter family of solutions of the differential equation x″ + 16x = 0 is

x = c1 cos 4t +c2 sin 4t. (2)

(a) Suppose we now wish to determine that solution of the equation that further satisfies the boundary conditions x(0) = 0, x(π/2) = 0. Observe that the first condition 0 = c1 cos 0 + c2 sin 0 implies c1 = 0, so that x = c2 sin 4t. But when t = π/2, 0 = c2 sin 2π is satisfied for any choice of c2 since sin 2π = 0. Hence the boundary-value problem

(3)

has infinitely many solutions. FIGURE 3.1.2 shows the graphs of some of the members of the one-parameter family x = c2 sin 4t that pass through the two points (0, 0) and (π/2, 0).

(b) If the boundary-value problem in (3) is changed to

, (4)

then x(0) = 0 still requires c1 = 0 in the solution (2). But applying x(π/8) = 0 to x = c2 sin 4t demands that 0 = c2 sin(π/2) = c2 · 1. Hence x = 0 is a solution of this new boundary-value problem. Indeed, it can be proved that x = 0 is the only solution of (4).

(c) Finally, if we change the problem to

x″ + 16x = 0, x(0) = 0, x(π/2) = 1, (5)

The graph has four quadrants and consists of four curves and a line on the t x coordinate plane. The curves follow wave pattern. The first curve enters the third quadrant, goes up and to the right through the origin (0, 0), and reaches a high point in the first quadrant. Then, it goes down and to the right through the positive t axis, reaches a low point in the fourth quadrant, again goes up and to the right through the point (pi over 2, 0), and exits the first quadrant. The curve is labeled: c subscript 2 = 1. The second curve enters the third quadrant, goes up and to the right through the origin (0, 0), and reaches a high point in the first quadrant below the first curve. Then, it goes down and to the right through the positive t axis, reaches a low point in the fourth quadrant above the first curve, again goes up and to the right through the point (pi over 2, 0), and exits the first quadrant. The curve is labeled: c subscript 2 = 1 over 2. The third curve enters the third quadrant, goes up and to the right through the origin (0, 0), and reaches a high point in the first quadrant below the second curve. Then, it goes down and to the right through the positive t axis, reaches a low point in the fourth quadrant above the second curve, again goes up and to the right through the point (pi over 2, 0), and exits the first quadrant. The curve is labeled: c subscript 2 = 1 over 4. The line is horizontal and enters the left on the negative t axis, goes to the right through the points (0, 0) and (pi over 2, 0), and exits the right. It is labeled: c subscript 2 = 0. The fourth curve enters the second quadrant, goes down and to the right through the origin (0, 0), and reaches a low point in the fourth quadrant. Then, it goes up and to the right through the positive t axis, reaches a high point in the first quadrant, again goes down and to the right through the point (pi over 2, 0), and exits the fourth quadrant. The curve is labeled: c subscript 2 = negative 1 over 2.

FIGURE 3.1.2 The BVP in (3) of Example 3 has many solutions

we find again that c1 = 0 from x(0) = 0, but that applying x(π/2) = 1 to x = c2 sin 4t leads to the contradiction 1 = c2 sin 2π = c2 · 0 = 0. Hence the boundary-value problem (5) has no solution.

3.1.2 Homogeneous Equations

A linear nth-order differential equation of the form

Note y = 0 is always a solution of a homogeneous linear equation.

(6)

is said to be homogeneous, whereas an equation

(7)

with g(x) not identically zero is said to be nonhomogeneous. For example, 2y″ + 3y′ − 5y = 0 is a homogeneous linear second-order differential equation, whereas x2y‴ + 6y′ + 10y = ex is a nonhomogeneous linear third-order differential equation. The word homogeneous in this context does not refer to coefficients that are homogeneous functions as in Section 2.5; rather, the word has exactly the same meaning as in Section 2.3.

We shall see that in order to solve a nonhomogeneous linear equation (7), we must first be able to solve the associated homogeneous equation (6).

To avoid needless repetition throughout the remainder of this section, we shall, as a matter of course, make the following important assumptions when stating definitions and theorems about the linear equations (6) and (7). On some common interval I,

Remember these assumptions in the definitions and theorems of this chapter.

  • the coefficients ai(x), i = 0, 1, 2, …, n, are continuous;
  • the right-hand member g(x) is continuous; and
  • an(x) ≠ 0 for every x in the interval.

Differential Operators

In calculus, differentiation is often denoted by the capital letter D; that is, dy/dx = Dy. The symbol D is called a differential operator because it transforms a differentiable function into another function. For example, D(cos 4x) = −4 sin 4x, and D(5x3 − 6x2) = 15x2 − 12x. Higher-order derivatives can be expressed in terms of D in a natural manner:

where y represents a sufficiently differentiable function. Polynomial expressions involving D, such as D + 3, D2 + 3D − 4, and 5x3D3 − 6x2D2 + 4xD + 9, are also differential operators. In general, we define an nth-order differential operator to be

L = an(x)Dn + an−1(x)Dn−1 + … + a1(x)D +a0(x). (8)

As a consequence of two basic properties of differentiation, D(cf(x)) = cDf(x), c a constant, and D{f(x) + g(x)} = Df(x) + Dg(x), the differential operator L possesses a linearity property; that is, L operating on a linear combination of two differentiable functions is the same as the linear combination of L operating on the individual functions. In symbols, this means

L{αf(x) + βg(x)} = αL(f(x)) + βL(g(x)), (9)

where α and β are constants. Because of (9) we say that the nth-order differential operator L is a linear operator.

Differential Equations

Any linear differential equation can be expressed in terms of the D notation. For example, the differential equation y″ + 5y′ + 6y = 5x − 3 can be written as D2y + 5Dy + 6y = 5x − 3 or (D2 + 5D + 6)y = 5x − 3. Using (8), the linear nth-order differential equations (6) and (7) can be written compactly as

L(y) = 0 and L(y) = g(x),

respectively.

Superposition Principle

In the next theorem we see that the sum, or superposition, of two or more solutions of a homogeneous linear differential equation is also a solution.

THEOREM 3.1.2 Superposition Principle—Homogeneous Equations

Let y1, y2, …, yk be solutions of the homogeneous nth-order differential equation (6) on an interval I. Then the linear combination

y = c1y1(x) + c2y2(x) + … + ckyk(x),

where the ci, i = 1, 2, …, k are arbitrary constants, is also a solution on the interval.

PROOF:

We prove the case k = 2. Let L be the differential operator defined in (8), and let y1(x) and y2(x) be solutions of the homogeneous equation L(y) = 0. If we define y = c1y1(x) + c2y2(x), then by linearity of L we have

L(y) = L{c1y1(x) + c2y2(x)} = c1L(y1) + c2L(y2) = c1 · 0 + c2 · 0 = 0.

Corollaries to Theorem 3.1.2

(a) A constant multiple y = c1y1(x) of a solution y1(x) of a homogeneous linear differential equation is also a solution.

(b) A homogeneous linear differential equation always possesses the trivial solution y = 0.

EXAMPLE 4 Superposition—Homogeneous DE

The functions y1 = x2 and y2 = x2 ln x are both solutions of the homogeneous linear equation x3y‴ − 2xy′ + 4y = 0 on the interval (0, ∞). By the superposition principle, the linear combination

y = c1x2 + c2x2 ln x

is also a solution of the equation on the interval.

The function y = e7x is a solution of y″ − 9y′ + 14y = 0. Since the differential equation is linear and homogeneous, the constant multiple y = ce7x is also a solution. For various values of c we see that y = 9e7x, y = 0, , …, are all solutions of the equation.

Linear Dependence and Linear Independence

The next two concepts are basic to the study of linear differential equations.

DEFINITION 3.1.1 Linear Dependence/Independence

A set of functions f1(x), f2(x), …, fn(x) is said to be linearly dependent on an interval I if there exist constants c1, c2, …, cn, not all zero, such that

c1f1(x) + c2f2(x) + … + cnfn(x) = 0

for every x in the interval. If the set of functions is not linearly dependent on the interval, it is said to be linearly independent.

In other words, a set of functions is linearly independent on an interval if the only constants for which

c1f1(x) + c2f2(x) + … + cnfn(x) = 0

for every x in the interval are c1 = c2 = … = cn = 0.

It is easy to understand these definitions in the case of two functions f1(x) and f2(x). If the functions are linearly dependent on an interval, then there exist constants c1 and c2 that are not both zero such that for every x in the interval c1f1(x) + c2f2(x) = 0. Therefore, if we assume that c1 ≠ 0, it follows that f1(x) = (−c2/c1)f2(x); that is,

If two functions are linearly dependent, then one is simply a constant multiple of the other.

Conversely, if f1(x) = c2f2(x) for some constant c2, then (−1) · f1(x) + c2f2(x) = 0 for every x on some interval. Hence the functions are linearly dependent, since at least one of the constants (namely, c1 = −1) is not zero. We conclude that:

Two functions are linearly independent when neither is a constant multiple of the other on an interval.

For example, the functions f1(x) = sin 2x and f2(x) = sin x cos x are linearly dependent on (−∞, ∞) because f1(x) is a constant multiple of f2(x). Recall from the double angle formula for the sine that sin 2x = 2 sin x cos x. On the other hand, the functions f1(x) = x and f2(x) = | x | are linearly independent on (−∞, ∞). Inspection of FIGURE 3.1.3 should convince you that neither function is a constant multiple of the other on the interval.

Part (a) has the graph of f subscript 1. A line is graphed on an x y coordinate plane. The line enters the bottom left in the third quadrant, goes up and to the right through the origin, and exits the top right in the first quadrant. It is labeled: f subscript 1 = x. Part (b) has the graph of f subscript 2. Two linear parts are graphed on an x y coordinate plane. The first part enters the top left in the second quadrant, goes down and to the right, and ends at the origin. The second part starts at the origin, goes up and to the right, and exits the top right in the first quadrant. It is labeled: f subscript 2 = mod(x).

FIGURE 3.1.3 The set consisting of f1 and f2 is linearly independent on (−∞, ∞)

It follows from the preceding discussion that the ratio f2(x)/f1(x) is not a constant on an interval on which f1(x) and f2(x) are linearly independent. This little fact will be used in the next section.

EXAMPLE 5 Linearly Dependent Functions

The functions f1(x) = cos2x,f2(x) = sin2x,f3(x) = sec2x,f4(x) = tan2x are linearly dependent on the interval (−π/2, π/2) since

c1 cos2x + c2 sin2x + c3 sec2x + c4 tan2x = 0,

when c1 = c2 = 1, c3 = −1, c4 = 1. We used here cos2x + sin2x = 1 and 1 + tan2x = sec2x for every number x in the interval.

A set of n functions f1(x), f2(x), …, fn(x) is linearly dependent on an interval I if at least one of the functions can be expressed as a linear combination of the remaining functions. For example, three functions f1(x), f2(x), and f3(x) are linearly dependent on I if at least one of these functions is a linear combination of the other two, say,

f3(x) = c1f1(x) + c2f2(x)

for all x in I. A set of n functions is linearly independent on I if no one function is a linear combination of the other functions.

EXAMPLE 6 Linearly Dependent Functions

The functions f1(x) = + 5, f2(x) = + 5x, f3(x) = x − 1, f4(x) = x2 are linearly dependent on the interval (0, ∞) since f2 can be written as a linear combination of f1, f3, and f4. Observe that

f2(x) = 1 · f1(x) + 5 · f3(x) + 0 · f4(x)

for every x in the interval (0, ∞).

Solutions of Differential Equations

We are primarily interested in linearly independent functions or, more to the point, linearly independent solutions of a linear differential equation. Although we could always appeal directly to Definition 3.1.1, it turns out that the question of whether n solutions y1, y2, …, yn of a homogeneous linear nth-order differential equation (6) are linearly independent can be settled somewhat mechanically using a determinant.

DEFINITION 3.1.2 Wronskian

Suppose each of the functions f1(x), f2(x), …, fn(x) possesses at least n −1 derivatives. The determinant

where the primes denote derivatives, is called the Wronskian of the functions.

The Wronskian determinant is named after the Polish philosopher, inventor, lawyer, physicist, and mathematician Józef Maria Hoëné-Wronski (1776–1853).

THEOREM 3.1.3 Criterion for Linearly Independent Solutions

Let y1, y2, …, yn be n solutions of the homogeneous linear nth-order differential equation (6) on an interval I. Then the set of solutions is linearly independent on I if and only if W(y1, y2, …, yn) ≠ 0 for every x in the interval.

It follows from Theorem 3.1.3 that when y1, y2, …, yn are n solutions of (6) on an interval I, the Wronskian W(y1, y2, …,yn) is either identically zero or never zero on the interval. Thus, if we can show that W(y1, y2, …, yn) ≠ 0 for some x0 in I, then the solutions y1, y2, …, yn are linearly independent on I. For example, the functions

are solutions of the differential equation

x2y″ + 7xy′ + 13y = 0

on the interval (0, ∞). Note that the coefficient functions a2(x) = x2, a1(x) = 7x, and a0(x) = 13 are continuous on (0, ∞) and that a2(x) ≠ 0 for every value of x in the interval. The Wronskian is

Rather than expanding this unwieldy determinant, we choose x = 1 in the interval (0, ∞) and find

The fact that W(y1(1), y2(1)) = 2 ≠ 0 is sufficient to conclude that y1(x) and y2(x) are linearly independent on (0, ∞).

A set of n linearly independent solutions of a homogeneous linear nth-order differential equation is given a special name.

DEFINITION 3.1.3 Fundamental Set of Solutions

Any set y1, y2, …, yn of n linearly independent solutions of the homogeneous linear nth-order differential equation (6) on an interval I is said to be a fundamental set of solutions on the interval.

The basic question of whether a fundamental set of solutions exists for a linear equation is answered in the next theorem.

THEOREM 3.1.4 Existence of a Fundamental Set

There exists a fundamental set of solutions for the homogeneous linear nth-order differential equation (6) on an interval I.

Analogous to the fact that any vector in three dimensions can be expressed uniquely as a linear combination of the linearly independent vectors i, j, k, any solution of an nth-order homogeneous linear differential equation on an interval I can be expressed uniquely as a linear combination of n linearly independent solutions on I. In other words, n linearly independent solutions y1, y2, …, yn are the basic building blocks for the general solution of the equation.

THEOREM 3.1.5 General Solution—Homogeneous Equations

Let y1, y2, …, yn be a fundamental set of solutions of the homogeneous linear nth-order differential equation (6) on an interval I. Then the general solution of the equation on the interval is

y = c1y1(x) + c2y2(x) + … + cnyn(x),

where ci, i = 1, 2, …, n are arbitrary constants.

Theorem 3.1.5 states that if Y(x) is any solution of (6) on the interval, then constants C1, C2, …, Cn can always be found so that

Y(x) = C1y1(x) + C2y2(x) + … + Cn yn(x).

We will prove the case when n = 2.

PROOF:

Let Y be a solution and y1 and y2 be linearly independent solutions of a2y″ + a1y′ + a0y = 0 on an interval I. Suppose x = t is a point in I for which W(y1(t), y2(t)) ≠ 0. Suppose also that Y(t) = k1 and Y′ (t) = k2. If we now examine the equations

it follows that we can determine C1 and C2 uniquely, provided that the determinant of the coefficients satisfies

But this determinant is simply the Wronskian evaluated at x = t, and, by assumption, W ≠ 0. If we define G(x) = C1y1(x) + C2y2(x), we observe that G(x) satisfies the differential equation, since it is a superposition of two known solutions; G(x) satisfies the initial conditions

Y(x) satisfies the same linear equation and the same initial conditions. Since the solution of this linear initial-value problem is unique (Theorem 3.1.1), we have Y(x) = G(x) or Y(x) = C1y1(x) + C2y2(x).

EXAMPLE 7 General Solution of a Homogeneous DE

The functions y1 = e3x and y2 = e−3x are both solutions of the homogeneous linear equation y″ − 9y = 0 on the interval (−∞, ∞). By inspection, the solutions are linearly independent on the x-axis. This fact can be corroborated by observing that the Wronskian

for every x. We conclude that y1 and y2 form a fundamental set of solutions, and consequently y = c1e3x + c2e−3x is the general solution of the equation on the interval (−, ).

EXAMPLE 8 A Solution Obtained from a General Solution

The function y = 4 sinh 3x − 5e3x is a solution of the differential equation y″ − 9y = 0 in Example 7. (Verify this.) In view of Theorem 3.1.5, we must be able to obtain this solution from the general solution y = c1e3x + c2e−3x. Observe that if we choose c1 = 2 and c2 = −7, then y = 2e3x − 7e3x can be rewritten as

The last expression is recognized as y = 4 sinh 3x − 5e−3x.

EXAMPLE 9 General Solution of a Homogeneous DE

The functions y1 = ex, y2 = e2x, and y3 = e3x satisfy the third-order equation

y‴ − 6y″ + 11y′ − 6y = 0.

Since

for every real value of x, the functions y1, y2, and y3 form a fundamental set of solutions on (−∞, ∞). We conclude that y = c1ex + c2e2x + c3e3x is the general solution of the differential equation on the interval (−, ).

3.1.3 Nonhomogeneous Equations

Any function yp free of arbitrary parameters that satisfies (7) is said to be a particular solution of the equation. For example, it is a straightforward task to show that the constant function yp = 3 is a particular solution of the nonhomogeneous equation y″ + 9y = 27.

Now if y1, y2, …, yk are solutions of (6) on an interval I and yp is any particular solution of (7) on I, then the linear combination

y = c1y1(x) + c2y2(x) + … + ckyk(x) + yp(x) (10)

is also a solution of the nonhomogeneous equation (7). If you think about it, this makes sense, because the linear combination c1y1(x) + c2y2(x) + … + ckyk(x) is mapped into 0 by the operator L = anDn + an−1Dn−1 + … + a1D + a0, whereas yp is mapped into g(x). If we use k = n linearly independent solutions of the nth-order equation (6), then the expression in (10) becomes the general solution of (7).

THEOREM 3.1.6 General Solution—Nonhomogeneous Equations

Let yp be any particular solution of the nonhomogeneous linear nth-order differential equation (7) on an interval I, and let y1, y2, …, yn be a fundamental set of solutions of the associated homogeneous differential equation (6) on I. Then the general solution of the equation on the interval is

y = c1y1(x) + c2y2(x) + … + cnyn(x) + yp(x),

where the ci, i = 1, 2, …, n are arbitrary constants.

PROOF:

Let L be the differential operator defined in (8), and let Y(x) and yp(x) be particular solutions of the nonhomogeneous equation L(y) = g(x). If we define u(x) = Y(x) − yp(x), then by linearity of L we have

L(u) = L{Y(x) − yp(x)} = L(Y(x)) − L(yp(x)) = g(x) − g(x) = 0.

This shows that u(x) is a solution of the homogeneous equation L(y) = 0. Hence, by Theorem 3.1.5, u(x) = c1y1(x) + c2y2(x) + … + cnyn(x), and so

Y(x) − yp(x) = c1y1(x) + c2y2(x) + … + cnyn(x)

or

Y(x) = c1y1(x) + c2y2(x) + … + cnyn(x) + yp(x).

Complementary Function

We see in Theorem 3.1.6 that the general solution of a nonhomogeneous linear equation consists of the sum of two functions:

y = c1y1(x) + c2y2(x) + … + cnyn(x) + yp(x) = yc(x) + yp(x).

The linear combination yc(x) = c1y1(x) + c2y2(x) + … + cnyn(x), which is the general solution of (6), is called the complementary function for equation (7). In other words, to solve a nonhomogeneous linear differential equation we first solve the associated homogeneous equation and then find any particular solution of the nonhomogeneous equation. The general solution of the nonhomogeneous equation is then

EXAMPLE 10 General Solution of a Nonhomogeneous DE

By substitution, the function yp = − x is readily shown to be a particular solution of the nonhomogeneous equation

y‴ − 6y″ + 11y′ − 6y = 3x. (11)

In order to write the general solution of (11), we must also be able to solve the associated homogeneous equation

y‴ − 6y″ + 11y′ − 6y = 0.

But in Example 9 we saw that the general solution of this latter equation on the interval (−∞, ∞) was yc = c1ex + c2e2x + c3e3x. Hence the general solution of (11) on the interval is

y = yc + yp = c1ex + c2e2x + c3e3x

Another Superposition Principle

The last theorem of this discussion will be useful in Section 3.4, when we consider a method for finding particular solutions of nonhomogeneous equations.

THEOREM 3.1.7 Superposition Principle—Nonhomogeneous Equations

Let yp1, yp2, …, ypk be k particular solutions of the nonhomogeneous linear nth-order differential equation (7) on an interval I corresponding, in turn, to k distinct functions g1, g2, …, gk. That is, suppose ypi denotes a particular solution of the corresponding differential equation

an(x)y(n) + an−1(x)y(n−1) + … + a1(x)y′ + a0(x)y = gi(x), (12)

where i = 1, 2, …, k. Then

yp(x) = yp1(x) + yp2(x) + … + ypk(x) (13)

is a particular solution of

an(x)y(n) + an−1(x)y(n−1) + … + a1(x)y′ + a0(x)y

= g1(x) + g2(x) + … + gk(x). (14)

PROOF:

We prove the case k = 2. Let L be the differential operator defined in (8), and let yp1(x) and yp2(x) be particular solutions of the nonhomogeneous equations L(y) = g1(x) and L(y) = g2(x), respectively. If we define yp(x) = (x) + (x), we want to show that yp is a particular solution of L(y) = g1(x) + g2(x). The result follows again by the linearity of the operator L:

L(yp) = L{(x) + (x)} = L((x)) + L((x)) = g1(x) + g2(x).

EXAMPLE 11 Superposition—Nonhomogeneous DE

You should verify that

yp1 = −4x2 is a particular solution of y″ − 3y′ + 4y = −16x2 + 24x − 8,

yp2 = e2x is a particular solution of y″ − 3y′ + 4y = 2e2x,

yp3 = xex is a particular solution of y″ − 3y′ + 4y = 2xexex.

It follows from Theorem 3.1.7 that the superposition of , , and ,

y = yp1 + yp2 + yp3 = −4x2 + e2x + xex,

is a solution of

If the are particular solutions of (12) for i = 1, 2, …, k, then the linear combination

This sentence is a generalization of Theorem 3.1.7.

where the ci are constants, is also a particular solution of (14) when the right-hand member of the equation is the linear combination

c1g1(x) + c2g2(x) + … + ckgk(x).

Before we actually start solving homogeneous and nonhomogeneous linear differential equations, we need one additional bit of theory presented in the next section.

REMARKS

This remark is a continuation of the brief discussion of dynamical systems given at the end of Section 1.3.

A dynamical system whose rule or mathematical model is a linear nth-order differential equation

an(t)y(n) + an−1(t)y(n−1) + … + a1(t)y′ + a0(t)y = g(t)

is said to be a linear system. The set of n time-dependent functions y(t), y′(t), …, y(n−1)(t) are the state variables of the system. Recall, their values at some time t give the state of the system. The function g is variously called the input function or forcing function. A solution y(t) of the differential equation is said to be the output or response of the system. Under the conditions stated in Theorem 3.1.1, the output or response y(t) is uniquely determined by the input and the state of the system prescribed at a time t0; that is, by the initial conditions y(t0), y′(t0), …, y(n−1)(t0).

In order that a dynamical system be a linear system, it is necessary that the superposition principle (Theorem 3.1.7) hold in the system; that is, the response of the system to a superposition of inputs is a superposition of outputs. We have already examined some simple linear systems in Section 2.7 (linear first-order equations); in Section 3.8 we examine linear systems in which the mathematical models are second-order differential equations.

3.1 Exercises Answers to selected odd-numbered problems begin on page ANS-5.

3.1.1 Initial-Value and Boundary-Value Problems

In Problems 1–4, the given family of functions is the general solution of the differential equation on the indicated interval. Find a member of the family that is a solution of the initial-value problem.

  1. y = c1ex + c2ex, (−∞, ∞); y″y = 0, y(0) = 0, y′(0) = 1
  2. y = c1e4x + c2ex, (−∞, ∞); y″ − 3y′ − 4y = 0, y(0) = 1, y′(0) = 2
  3. y = c1x + c2x ln x, (0, ∞); x2y″xy′ + y = 0, y(1) = 3, y′(1) = −1
  4. y = c1 + c2 cos x + c3 sin x, (−∞, ∞); y‴ + y′ = 0, y(π) = 0, y′(π) = 2, y″(π) = −1
  5. Given that y = c1 + c2x2 is a two-parameter family of solutions of xy″y′ = 0 on the interval (−∞, ∞), show that constants c1 and c2 cannot be found so that a member of the family satisfies the initial conditions y(0) = 0, y′(0) = 1. Explain why this does not violate Theorem 3.1.1.
  6. Find two members of the family of solutions in Problem 5 that satisfy the initial conditions y(0) = 0, y′(0) = 0.
  7. Given that x(t) = c1 cos ωt + c2 sin ωt is the general solution of x″ + ω2x = 0 on the interval (−∞, ∞), show that a solution satisfying the initial conditions x(0) = x0, x′(0) = x1, is given by

    x(t) = x0 cos ωt + sin ωt.

  8. Use the general solution of x″ + ω2x = 0 given in Problem 7 to show that a solution satisfying the initial conditions x(t0) = x0, x′(t0) = x1, is the solution given in Problem 7 shifted by an amount t0:

    x(t) = x0 cos ω (t − t0) + sin ω (t − t0).

In Problems 9 and 10, find an interval centered about x = 0 for which the given initial-value problem has a unique solution.

  1. (x − 2)y″ + 3y = x,y(0) = 0, y′(0) = 1
  2. y″ + (tan x)y = ex, y(0) = 1, y′(0) = 0
    1. Use the family in Problem 1 to find a solution of y″y = 0 that satisfies the boundary conditions y(0) = 0, y(1) = 1.
    2. The DE in part (a) has the alternative general solution y = c3 cosh x + c4 sinh x on (−∞, ∞). Use this family to find a solution that satisfies the boundary conditions in part (a).
    3. Show that the solutions in parts (a) and (b) are equivalent.
  3. Use the family in Problem 5 to find a solution of xy″y′ = 0 that satisfies the boundary conditions y(0) = 1, y′(1) = 6.

In Problems 13 and 14, the given two-parameter family is a solution of the indicated differential equation on the interval (−∞, ∞). Determine whether a member of the family can be found that satisfies the boundary conditions.

  1. y = c1ex cos x + c2ex sin x; y″ −2y′ + 2y = 0
    1. y(0) = 1, y′(π) = 0
    2. y(0) = 1, y(π) = −1
    3. y(0) = 1, y = 1
    4. y(0) = 0, y(π) = 0
  2. y = c1x2 + c2x4 + 3; x2y″ − 5xy′ + 8y = 24
    1. y(−1) = 0, y(1) = 4
    2. y(0) = 1, y(1) = 2
    3. y(0) = 3, y(1) = 0
    4. y(1) = 3, y(2) = 15

3.1.2 Homogeneous Equations

In Problems 15–22, determine whether the given set of functions is linearly dependent or linearly independent on the interval (−∞, ∞).

  1. f1(x) = x, f2(x) = x2, f3(x) = 4x −3x2
  2. f1(x) = 0, f2(x) = x, f3(x) = ex
  3. f1(x) = 5, f2(x) = cos2x, f3(x) = sin2x
  4. f1(x) = cos 2x, f2(x) = 1, f3(x) = cos2x
  5. f1(x) = x, f2(x) = x − 1, f3(x) = x + 3
  6. f1(x) = 2 + x, f2(x) = 2 + | x |
  7. f1(x) = 1 + x, f2(x) = x, f3(x) = x2
  8. f1(x) = ex, f2(x) = ex, f3(x) = sinh x

In Problems 23–30, verify that the given functions form a fundamental set of solutions of the differential equation on the indicated interval. Form the general solution of the equation.

  1. y″y′ − 12y = 0; e−3x, e4x, (−∞, ∞)
  2. y″ − 4y = 0; cosh 2x, sinh 2x, (−∞, ∞)
  3. y″ − 2y′ + 5y = 0; ex cos 2x,ex sin 2x, (−∞, ∞)
  4. 4y″ − 4y′ + y = 0; ex/2, xex/2, (−∞, ∞)
  5. x2y″ − 6xy′ + 12y = 0; x3, x4, (0, ∞)
  6. x2y″ + xy′ + y = 0; cos(ln x), sin(ln x), (0, ∞)
  7. x3y‴ + 6x2y″ + 4xy′ − 4y = 0; x, x−2, x−2 ln x, (0, ∞)
  8. y(4) + y″ = 0; 1, x, cos x, sin x, (−∞, ∞)

3.1.3 Nonhomogeneous Equations

In Problems 31–34, verify that the given two-parameter family of functions is the general solution of the nonhomogeneous differential equation on the indicated interval.

  1. y″ − 7y′ + 10y = 24ex;
    y = c1e2x + c2e5x + 6ex, (−∞, ∞)
  2. y″ + y = sec x;
    y = c1 cos x + c2 sin x + x sin x + (cos x) ln(cos x),
    (− π/2,π/2)
  3. y″ − 4y′ + 4y = 2e2x + 4x − 12;
    y = c1e2x + c2xe2x + x2e2x + x − 2, (−∞, ∞)
  4. 2x2y″ + 5xy′ + y = x2x;
    y = c1 + c2x−1 + x2 x, (0, ∞)
  5. (a) Verify that = 3e2x and = x2 + 3x are, respectively, particular solutions of

    y″ − 6y′ + 5y = −9e2x

    and

    y″ − 6y′ + 5y = 5x2 + 3x −16.

    (b) Use part (a) to find particular solutions of

    y″ − 6y′ + 5y = 5x2 + 3x − 16 − 9e2x

    and

    y″ − 6y′ + 5y = −10x2 − 6x + 32 + e2x.

  6. (a) By inspection, find a particular solution of

    y″ + 2y = 10.

    (b) By inspection, find a particular solution of

    y″ + 2y = −4x.

    (c) Find a particular solution of y″ + 2y = −4x + 10.

    (d) Find a particular solution of y″ + 2y = 8x + 5.

Discussion Problems

  1. Let n = 1, 2, 3, …. Discuss how the observations Dnxn−1 = 0 and Dnxn = n! can be used to find the general solutions of the given differential equations.
    1. y″ = 0
    2. y‴ = 0
    3. y(4) = 0
    4. y″ = 2
    5. y‴ = 6
    6. y(4) = 24
  2. Suppose that y1 = ex and y2 = ex are two solutions of a homogeneous linear differential equation. Explain why y3 = cosh x and y4 = sinh x are also solutions of the equation.
    1. Verify that y1 = x3 and y2 = | x |3 are linearly independent solutions of the differential equation x2y″ −4xy′ + 6y = 0 on the interval (−∞, ∞).
    2. For the functions y1 and y2 in part (a), show that W(y1, y2) = 0 for every real number x. Does this result violate Theorem 3.1.3? Explain.
    3. Verify that Y1 = x3 and Y2 = x2 are also linearly independent solutions of the differential equation in part (a) on the interval (−∞, ∞).
    4. Besides the functions y1, y2, Y1, and Y2 in parts (a) and (c), find a solution of the differential equation that satisfies y(0) = 0, y′(0) = 0.
    5. By the superposition principle, Theorem 3.1.2, both linear combinations y = c1y1 + c2y2 and Y = c1Y1 + c2Y2 are solutions of the differential equation. Discuss whether one, both, or neither of the linear combinations is a general solution of the differential equation on the interval (−∞, ∞).
  3. Is the set of functions f1(x) = ex+2, f2(x) = ex−3 linearly dependent or linearly independent on the interval (−∞, ∞)? Discuss.
  4. By substituting into the differential equation

    find four linearly independent solutions of the equation. Give the general solution of the equation on (−∞, ∞).

  5. Suppose y1, y2, …, yk are k linearly independent solutions on (−∞, ∞) of a homogeneous linear nth-order differential equation with constant coefficients. By Theorem 3.1.2 it follows that yk+1 = 0 is also a solution of the differential equation. Is the set of solutions y1, y2, …, yk, yk+1 linearly dependent or linearly independent on (−∞, ∞)? Discuss.
  6. Suppose that y1, y2, …, yk are k nontrivial solutions of a homogeneous linear nth-order differential equation with constant coefficients and that k = n + 1. Is the set of solutions y1, y2, …, yk linearly dependent or linearly independent on (−∞, ∞)? Discuss.
  7. If and are particular solutions of the nonhomogeneous linear differential equation (7), then show that the difference of these particular solutions is a solution of the associated homogeneous differential equation (6).