False position method: Difference between revisions

Revision as of 17:15, 26 April 2014

This article is about a root-finding algorithm. See all root-finding algorithms

Definition

False position method is a root-finding algorithm that is qualitative similar to the bisection method in that it uses nested intervals based on opposite signs at the endpoints to converge to a root, but is computationally based on the secant method.

Unless otherwise specified, the function will be denoted $f$ .

Initial exploratory phase

The exploratory phase of the false position method involves finding a pair of input values at which the function has opposite signs. This could be done by running the usual secant method and evaluating at each stage until we get to opposite signs, or by some other means. Once we have found two points in the domain at which the function value has opposite signs, we are ready to begin the false position method proper.

In the point estimate version of the iterative step, we will label these two initial guesses as $x_{0}$ and $x_{1}$ (so that $f (x_{0})$ and $f (x_{1})$ have opposite sign). In the nested interval version, we will label the smaller of them as $a_{0}$ and the larger as $b_{0}$ .

Iterative step

At stage $n$ , for $n \geq 2$ , we find the largest $k$ for which $f (x_{k})$ has sign opposite to $f (x_{n - 1})$ . We then define:

$x_{n} : = \frac{x_{k} f (x_{n - 1}) - x_{n - 1} f (x_{k})}{f (x_{n - 1}) - f (x_{k})}$

More details are below.

Prior knowledge: Prior to beginning stage $n$ :

We have a previous set of guesses $x_{0}, x_{1}, \dots, x_{n - 1}$ .
We know that $f (x_{0})$ and $f (x_{1})$ have opposite sign to each other (this is the initial condition). Therefore, among the values $f (x_{0}), f (x_{1}), \dots, f (x_{n - 1})$ , we know that there are both positive and negative values.

Iterative step:

We find the largest $k$ such that $f (x_{k})$ and $f (x_{n - 1})$ have opposite sign.
We compute:

$x_{n} : = \frac{x_{k} f (x_{n - 1}) - x_{n - 1} f (x_{k})}{f (x_{n - 1}) - f (x_{k})}$

Iterative step in nested interval description

We start with an initial interval $[a_{0}, b_{0}]$ (here $a_{0}$ is the smaller of $x_{0}$ and $x_{1}$ from the original setup, and $b_{0}$ is the larger of them).

At stage $n$ for $n$ a positive integer

Prior knowledge:

$f (a_{n - 1})$ and $f (b_{n - 1})$ are both numerically distinguishable from zero (so they have defined signs, positive or negative) and they have opposite signs.
Combining that with the fact that $f$ is a continuous function, the intermediate value theorem tells us that $f$ has a root on $[a_{n - 1}, b_{n - 1}]$ .

Iterative step:

Compute:

$x_{n} : = \frac{b_{n - 1} f (a_{n - 1}) - a_{n - 1} f (b_{n - 1})}{f (a_{n - 1}) - f (b_{n - 1})}$

Note that since $f (a_{n - 1})$ and $f (b_{n - 1})$ have opposite signs, this is a convex combination, and therefore $x_{n} \in [a_{n - 1}, b_{n - 1}]$ .

In the case that $f (x_{n}) = 0$ (or is numerically indistinguishable from zero), we declare that as the root and terminate the algorithm.
In the case that $f (x_{n})$ has opposite sign to $f (a_{n - 1})$ (and therefore the same sign as $f (b_{n - 1})$ ), the new interval $[a_{n}, b_{n}] = [a_{n - 1}, x_{n}]$ . Explicitly, $a_{n} = a_{n - 1}$ and $b_{n} = x_{n}$ .
In the case that $f (x_{n})$ has opposite sign to $f (b_{n - 1})$ (and therefore the same sign as $f (a_{n - 1})$ ), the new interval $[a_{n}, b_{n}] = [x, b_{n - 1}]$ . Explicitly, $a_{n} = x_{n}$ and $b_{n} = b_{n - 1}$ .

The values of $x_{n}$ as obtained here are the same as the values of $x_{n}$ in the preceding description. The main advantage of this description is that we are storing the intervals as we construct them rather than merely the point estimates, so that some aspects of how the procedure works are more transparent.

Convergence rate

Convergence for linear functions

In the case that $f$ is linear, we reach the root in one iteration after we have found values where the function has opposite signs. In fact, even if we applied the secant method directly, we would terminate in one step.

Convergence for twice differentiable functions

Fill this in later

@@ Line 4: / Line 4: @@
 '''False position method''' is a [[root-finding algorithm]] that is qualitative similar to the [[bisection method]] in that it uses nested intervals based on opposite signs at the endpoints to converge to a root, but is computationally based on the [[secant method]].
+Unless otherwise specified, the function will be denoted <math>f</math>.
 ==Initial exploratory phase==
@@ Line 49: / Line 51: @@
 Note that since <math>f(a_{n-1})</math> and <math>f(b_{n-1})</math> have opposite signs, this is a convex combination, and therefore <math>x_n \in [a_{n-1},b_{n-1}]</math>.
+* In the case that <math>f(x_n) = 0</math> (or is numerically indistinguishable from zero), we declare that as the root and terminate the algorithm.
+* In the case that <math>f(x_n)</math> has opposite sign to <math>f(a_{n-1})</math> (and therefore the same sign as <math>f(b_{n-1})</math>), the new interval <math>[a_n,b_n] = [a_{n-1},x_n]</math>. Explicitly, <math>a_n = a_{n-1}</math> and <math>b_n = x_n</math>.
+* In the case that <math>f(x_n)</math> has opposite sign to <math>f(b_{n-1})</math> (and therefore the same sign as <math>f(a_{n-1})</math>), the new interval <math>[a_n,b_n] = [x,b_{n-1}]</math>. Explicitly, <math>a_n = x_n</math> and <math>b_n = b_{n-1}</math>.
+The values of <math>x_n</math> as obtained here are the same as the values of <math>x_n</math> in the preceding description. The main advantage of this description is that we are storing the intervals as we construct them rather than merely the point estimates, so that some aspects of how the procedure works are more transparent.
-* In the case that <math>f(x)</math> has opposite sign to <math>f(a_{n-1})</math> (and therefore the same sign as <math>f(b_{n-1})</math>), the new interval <math>[a_n,b_n] = [a_{n-1},x_n]</math>. Explicitly, <math>a_n = a_{n-1}</math> and <math>b_n = x_n</math>.
+==Convergence rate==
-* In the case that <math>f(x)</math> has opposite sign to <math>f(b_{n-1})</math> (and therefore the same sign as <math>f(a_{n-1})</math>), the new interval <math>[a_n,b_n] = [x,b_{n-1}]</math>. Explicitly, <math>a_n = x_n</math> and <math>b_n = b_{n-1}</math>.
-The values of <math>x_n</math> as obtained here are the same as the values of <math>x_n</math> in the preceding description. The main advantage of this description is that we are storing the intervals as we construct them rather than merely the point estimates, so that some aspects of how the procedure works are more transparent.
+===Convergence for linear functions===
+In the case that <math>f</math> is linear, we reach the root in one iteration after we have found values where the function has opposite signs. In fact, even if we applied the [[secant method]] directly, we would terminate in one step.
+===Convergence for twice differentiable functions===
+{{fillin}}