Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Chapter 5: Equation Solving

Equations express the equality of two expressions and are essential tools for modeling and solving real-world problems. While a function describes the relationship between variables, solving an equation means finding the variable values that make the equality true. This section focuses on linear and quadratic equations, highlighting how their solutions, i.e., the roots, connect algebraic methods with geometric interpretation. We will also extend these ideas to equation-solving for other basic classes of functions introduced earlier.

Earlier, we described functions and particular points associated with the graph of a function that are typically of interest due to what they represent. We described - and -intercepts, where:

  • The -intercepts are the points at which the output value is zero.
  • The -intercept is the point at which the function has an input value of zero.

Analytically, these points can be found by solving:

  • -intercepts: solve
  • -intercept: solve

Both of these tasks are examples of equation solving, i.e., we set a function equal to a specific value (often zero) and find the corresponding input(s). The process of solving an equation, therefore, has both an algebraic side (manipulating numbers and symbols to isolate the variable) and a geometric side (finding where a graph meets a horizontal or vertical axis).

Concept: Solving an Equation

Using algebraic properties, we "isolate" a particular variable on one side of the equality sign so that we obtain a solution in the form:

where "stuff" can be an expression containing numbers, constants, other variables, and mathematical operators such as addition, subtraction, multiplication, division, square root, and the like.

Solutions to Equations as Roots

The concept of a root is central to solving many types of equations, fundamentally linking algebraic solutions to graphical interpretations.

Definition: Root (or Zero) of a Function

A root of a function is a point such that .

Graphically, these are the points where the function's graph intersects the -axis (i.e., a root is synonymous with the function's -intercepts).

Any equation of the form can be transformed into the problem of finding the roots of a new function:

This means that solving for the equality of two functions is equivalent to finding the -intercepts of their difference.

Solving Linear Equations

In this section, we illustrate the equation-solving process for the case where the resulting function is linear. In such cases, solving is equivalent to finding the root of the linear function .

Example 1: Roots of a Linear Function

Left: Graphs of the given functions and . Right: Solving as a root-finding problem for the resulting linear function .

Consider the functions:

Here, is linear and is a constant function.

Finding the intersection of the graphs means determining such that:

We can convert this into a root-finding problem by moving all terms to one side, expressing the equation in the standard form :

Here, the left-hand side can be regarded as a new function . Finding its root is equivalent to solving the original equation:

The solution represents the point where the graphs of and intersect. In terms of the root-finding approach, this is the zero of , i.e., the value of for which crosses the -axis.

Solving Quadratic Equations

In this section, we illustrate the equation-solving process for the case where the resulting difference is quadratic. In such cases, solving is equivalent to finding the root of the quadratic function .

Example: Roots of a Quadratic Function

Left: Graphs of the given functions and . Right: Solving as a root-finding problem for the resulting quadratic function .

Consider the functions:

Here, and are both quadratic.

Finding the intersection of the graphs means determining such that:

We convert this to a root-finding problem by moving everything to one side:

At this stage, we have reduced the problem to solving a quadratic equation:

There are two standard ways to find its roots:

  1. By factoring the quadratic expression into a product of two linear factors.
  2. By applying the quadratic formula, which works even when factoring is not straightforward.

In the examples that follows, we will illustrate both approaches, using the same function .

Solving Via Factorization

Factoring a quadratic expression means expressing it as a product of two linear factors. If this is possible, the zero product property can be applied:

This allows us to solve a quadratic equation by setting each factor equal to zero.

Example: Solving by Factorization

We are given the quadratic polynomial

and want to factorize it using the grouping method, which we learned about in the previous Chapter 4.

The expression contains three terms, but the grouping method requires four. Thus, the first step is to rewrite the trinomial as a four-term polynomial. We can do this using Algorithm 1 from Chapter 4.

Step 1:: Identify coefficients:

  • is the coefficient of the higest-order term
  • is the coefficient of the second-highest-order term
  • is the constant term

Step 2: Find two integers , such that

Choosing and satisfies these conditions since and .

Step 3: Rewrite the middle term using and :

Now we can apply the grouping method as described in Algorithm 2.

Step 1: Group the terms into pairs:

Step 2: Factor out the greatest common factor (GCF) from each group:

Step 3: A Common binomial factor appears:

Finally, we can now apply the zero product property to solve for :

Solving Via The Quadratic Formula

Another way to find the roots of is to apply the quadratic formula.

Definition: The Quadratic Formula

Consider the quadratic equation:

where . The solutions of this equation is given by the quadratic formula:

The discriminant determines the number of real solutions:

  • If : two distinct real solutions.
  • If : one real (repeated) solution.
  • If : no real solutions.

Note: the symbol in the formula above means that we consider the expression both when the square root positive and negative.

Example: The Quadratic Formula

To solve the quardratic equation

we set , , and in the formula:

Hence, we get:

These match the solutions obtained by factoring.

Factorized Form and Roots of a Polynomial

Just as quadratic equations can be expressed in factorized form as

higher-order polynomials can likewise be written as a product of linear factors. This leads us to the following definition.

Definition: Factorized Form of a Polynomial

A polynomial function of degree can be expressed as

where is the leading coefficient and each is a root (or zero) satisfying .

This form reveals several geometric features of the polynomial:

  • The number of factors equals the degree of the polynomial
  • Each root corresponds to an -intercept of the graph
  • The coefficient determines the vertical stretch and orientation of the curve. For example, changing its sign reflects the graph across the -axis.

The following examples illustrate how these properties appear graphically.

Examples: Factorized Form of a Polynomial

Polynomials in factorized form. Each root corresponds to an -intercept where .

Consider the first polynomial in the plot:

This function has three linear factors, so the polynomial is of degree three. The zeros, listed in the order they appear in the algebraic expression, are , , and . At each of these points, one factor becomes zero, defining an -intercept where the graph meets the -axis.

Now look at the second polynomial in the plot:

The only difference is the sign of the leading coefficient. Changing it from to reflects the entire graph across the -axis, while the zeros remain in the same order and at the same positions.

Finally, consider the third polynomial in the plot:

Here we have four linear factors, so the polynomial is of degree four. The zeros, again listed in the order of the factors, are , , , and . As before, each root defines an -intercept where the graph meets the -axis.

The Sign and Behavior of a Function Around Its Roots

Finding the roots of a function does more than just tell us where it intercepts the -axis: It also reveals where the function takes on positive or negative values.

By analyzing the sign of between its roots, we can determine on which intervals the function lies above or below the -axis, and thus describe its overall behavior.

The concepts of a function being Increasing on an Interval and Decreasing on an Interval further describe how the function behaves within those intervals, i.e., whether it rises or falls as changes.

These ideas are closely related: once the roots are known and the sign of is determined, examining whether the function is increasing or decreasing helps us describe its overall shape and how it varies. Together, they provide a more complete picture of a function’s behavior, even without graphing it.

Definition: Positive and Negative Intervals

Let be a real-valued function. We say that:

  • is positive on an interval if for all in that interval.
  • is negative on an interval if for all in that interval.

Graphically, this corresponds to whether the graph of the function lies above (positive) or below (negative) the -axis.

Because the sign of a function can only change at its roots, we can use the roots to divide the real line into intervals and then determine the sign of within each one.

Example: Determining the Function Sign

For our quadratic function , we found earlier, that the roots are:

These roots divide the real line into three intervals:

By testing a single point in each interval (for instance, ), we find:

IntervalTest ValueSign of Behavior
is positive
is negative
is positive

Inverse Functions

When solving an equation of the form:

we often want a general way to determine the input for any output value (in 's codomain). For some functions, it is possible to find another function that "reverses" the mapping performed by . This reversing function is called the inverse function.

Definition: Inverse of a Function

Let be a function. An inverse function satisfies:

for all and . This means that applying followed by (or vice versa) brings us back to the original value.

The inverse function essentially allows us to solve equations by applying to both sides:

However, not every function has an inverse. Understanding when an inverse exists is thus essential.

Warning: Existence Conditions and Common Mistakes

  • A function has an inverse only if it is bijective, that is:

    Otherwise, the mapping cannot be uniquely reversed.

  • The notation represents the inverse function, not the reciprocal: The superscript indicates reversal of the mapping, not exponentiation.

Definition: Identity Property of Inverses

The composition of a function and its inverse returns the identity function on the respective domains:

That is, the inverse of a function reverses the domain and codomain of . Graphically, the inverse corresponds to reflecting the graph of across the line .

Example: Finding an Inverse Function

Suppose is defined by

To find , solve for in terms of :

This expression tells us how to recover from a given output , so:

Now, suppose we want to solve the equation:

To determine for which value of the function is euqal to , we need to isolate on one side of the equality sign. Since we have already found the inverse of the function we can achieve this by applying to both sides:

Since reverses the action of , the left-hand side simplifies to :

In general, this is the reason for applying to both sides: it "undoes" on the side of the equality containing , essentially leaving alone.

Example: Verifying an Inverse Function

Consider the function in the earlier example, along with its inverse:

We can always check our work, by verifying the inverse properties, that is:

Doing so, we indeed see that:

Moreover, we see that:

Thus, and are true inverses: each "undoes" the other's operation. In particular, multiplies by and adds , while subtracts and divides by , reversing the steps in the opposite order.

Common Inverses

Each of the examples given below show frequently used function-inverse pairs with their domains and ranges.

Example: Linear Shift

Example: Linear Scaling

Example: Power and Root Functions

Logarithm Rules

Since logarithms are inverse functions of exponentials, each rule in the table above can be derived directly from the exponent rules defined in Chapter 5.

Example: ...

Function Inverse Domain of Range of
,
,
, odd
, even (principal root)
,
,

Logarithm Rules

For , , , and , the most important rules are given in the following table.

RuleFormulaDescription
Product RuleThe logarithm of a product equals the sum of the logarithms.
Quotient RuleThe logarithm of a quotient equals the difference of the logarithms.
Power RuleA power in the argument becomes a multiplier in front of the logarithm.
Logarithm of 1Any base raised to the power 0 equals 1.
Logarithm of the BaseAny base raised to the power 1 equals itself.
Inverse PropertyExponential and logarithmic functions cancel each other.
Natural Log of Since means .
Change of BaseConverts a logarithm from one base to another.

Note: The natural logarithm is simply , where is Euler's number. All these rules work the same way for as for for any base , .

Since logarithms are inverse functions of exponentials, each rule in the table above can be derived directly from the exponent rules defined in Chapter 5.

Solving Non-Linear Equations

Many equations in mathematics involve non-linear functions such as exponentials and logarithms. The solving principles remain the same: we transform the equation into an equivalent one where the variable of interest is isolated, checking that the solution satisfies any domain restrictions.

A common strategy for solving these equations is to undo an operation using its inverse. In this context, we can make direct use of the inverse function pairs introduced earlier. In particular, when the variable appears in an exponent, we apply a logarithm to both sides, and when it appears inside a logarithm, we apply an exponential.

Example: Solving an Exponential Equation

Let us solve the equation , for .

Example: Solving a Logarithmic Equation

Solve the equation for . Assume that , as the logarithm otherwise is not defined. We obtain:

Determining Whether a Relation is a Function

Graphically

A relation in which each -coordinate is matched with exactly one -coordinate is said to describe as a function of . This also means that, if the same -coordinate is associated with two different -coordinates, then the relation is not a function.

Example: Checking Functional Relations

Which of the following relations descbribe as a function of ?

Inspecting the points of reveals that the -coordinate is matched with two different -coordinates: Namely and . Hence in , y is not a function of . On the other hand, every -coordinate in occurs only once which means each -coordinate has only one corresponding -coordinate. So, does represent as a function of . We can verify this graphically as well:

fails (same with different ); passes (each has one )

The Vertical Line Test

More generally, this also leads to the vertical line test, which is a quick graphical method to decide whether a relation is a function.

Definition: Vertical Line Test

Polynomial (function) passes the test; circle (not a function) fails. Intersection points are marked.

A relation is a function if and only if every vertical line intersects its graph at most once.

If a vertical line intersects more than once, the relation assigns more than one output to the same input thus violating the definition of a function.

It is important to note that equations can describe valid relationships—like the shape of a circle—but do not define a function. Recognizing this helps us understand both the limits of function notation and the situations where we need use other representations (such as parametric or implicit forms).

Algebraically

We can also check whether an equation defines a function by solving for one variable in terms of the other. If solving produces more than one output value for the same input, then the relation is not a function.

Example: Equation That Is Not a Function

Does the equation represent a function with as input and as output? If so, express the relationship as a function .

Solution:

First we subtract from both sides:

We now try to solve for in this equation:

so, and . We get two outputs corresponding to the same input, so this relationship cannot be represented as a single function .