False position method: Difference between revisions

From Calculus
No edit summary
Line 4: Line 4:


'''False position method''' is a [[root-finding algorithm]] that is qualitative similar to the [[bisection method]] in that it uses nested intervals based on opposite signs at the endpoints to converge to a root, but is computationally based on the [[secant method]].
'''False position method''' is a [[root-finding algorithm]] that is qualitative similar to the [[bisection method]] in that it uses nested intervals based on opposite signs at the endpoints to converge to a root, but is computationally based on the [[secant method]].
Unless otherwise specified, the function will be denoted <math>f</math>.


==Initial exploratory phase==
==Initial exploratory phase==
Line 49: Line 51:


Note that since <math>f(a_{n-1})</math> and <math>f(b_{n-1})</math> have opposite signs, this is a convex combination, and therefore <math>x_n \in [a_{n-1},b_{n-1}]</math>.
Note that since <math>f(a_{n-1})</math> and <math>f(b_{n-1})</math> have opposite signs, this is a convex combination, and therefore <math>x_n \in [a_{n-1},b_{n-1}]</math>.
* In the case that <math>f(x_n) = 0</math> (or is numerically indistinguishable from zero), we declare that as the root and terminate the algorithm.
* In the case that <math>f(x_n)</math> has opposite sign to <math>f(a_{n-1})</math> (and therefore the same sign as <math>f(b_{n-1})</math>), the new interval <math>[a_n,b_n] = [a_{n-1},x_n]</math>. Explicitly, <math>a_n = a_{n-1}</math> and <math>b_n = x_n</math>.
* In the case that <math>f(x_n)</math> has opposite sign to <math>f(b_{n-1})</math> (and therefore the same sign as <math>f(a_{n-1})</math>), the new interval <math>[a_n,b_n] = [x,b_{n-1}]</math>. Explicitly, <math>a_n = x_n</math> and <math>b_n = b_{n-1}</math>.
The values of <math>x_n</math> as obtained here are the same as the values of <math>x_n</math> in the preceding description. The main advantage of this description is that we are storing the intervals as we construct them rather than merely the point estimates, so that some aspects of how the procedure works are more transparent.


* In the case that <math>f(x)</math> has opposite sign to <math>f(a_{n-1})</math> (and therefore the same sign as <math>f(b_{n-1})</math>), the new interval <math>[a_n,b_n] = [a_{n-1},x_n]</math>. Explicitly, <math>a_n = a_{n-1}</math> and <math>b_n = x_n</math>.
==Convergence rate==
* In the case that <math>f(x)</math> has opposite sign to <math>f(b_{n-1})</math> (and therefore the same sign as <math>f(a_{n-1})</math>), the new interval <math>[a_n,b_n] = [x,b_{n-1}]</math>. Explicitly, <math>a_n = x_n</math> and <math>b_n = b_{n-1}</math>.


The values of <math>x_n</math> as obtained here are the same as the values of <math>x_n</math> in the preceding description. The main advantage of this description is that we are storing the intervals as we construct them rather than merely the point estimates, so that some aspects of how the procedure works are more transparent.
===Convergence for linear functions===
 
In the case that <math>f</math> is linear, we reach the root in one iteration after we have found values where the function has opposite signs. In fact, even if we applied the [[secant method]] directly, we would terminate in one step.
 
===Convergence for twice differentiable functions===
 
{{fillin}}

Revision as of 17:15, 26 April 2014

This article is about a root-finding algorithm. See all root-finding algorithms

Definition

False position method is a root-finding algorithm that is qualitative similar to the bisection method in that it uses nested intervals based on opposite signs at the endpoints to converge to a root, but is computationally based on the secant method.

Unless otherwise specified, the function will be denoted f.

Initial exploratory phase

The exploratory phase of the false position method involves finding a pair of input values at which the function has opposite signs. This could be done by running the usual secant method and evaluating at each stage until we get to opposite signs, or by some other means. Once we have found two points in the domain at which the function value has opposite signs, we are ready to begin the false position method proper.

In the point estimate version of the iterative step, we will label these two initial guesses as x0 and x1 (so that f(x0) and f(x1) have opposite sign). In the nested interval version, we will label the smaller of them as a0 and the larger as b0.

Iterative step

At stage n, for n2, we find the largest k for which f(xk) has sign opposite to f(xn1). We then define:

xn:=xkf(xn1)xn1f(xk)f(xn1)f(xk)

More details are below.

Prior knowledge: Prior to beginning stage n:

  • We have a previous set of guesses x0,x1,,xn1.
  • We know that f(x0) and f(x1) have opposite sign to each other (this is the initial condition). Therefore, among the values f(x0),f(x1),,f(xn1), we know that there are both positive and negative values.

Iterative step:

  • We find the largest k such that f(xk) and f(xn1) have opposite sign.
  • We compute:

xn:=xkf(xn1)xn1f(xk)f(xn1)f(xk)

Iterative step in nested interval description

We start with an initial interval [a0,b0] (here a0 is the smaller of x0 and x1 from the original setup, and b0 is the larger of them).

At stage n for n a positive integer

Prior knowledge:

  • f(an1) and f(bn1) are both numerically distinguishable from zero (so they have defined signs, positive or negative) and they have opposite signs.
  • Combining that with the fact that f is a continuous function, the intermediate value theorem tells us that f has a root on [an1,bn1].

Iterative step:

  • Compute:

xn:=bn1f(an1)an1f(bn1)f(an1)f(bn1)

Note that since f(an1) and f(bn1) have opposite signs, this is a convex combination, and therefore xn[an1,bn1].

  • In the case that f(xn)=0 (or is numerically indistinguishable from zero), we declare that as the root and terminate the algorithm.
  • In the case that f(xn) has opposite sign to f(an1) (and therefore the same sign as f(bn1)), the new interval [an,bn]=[an1,xn]. Explicitly, an=an1 and bn=xn.
  • In the case that f(xn) has opposite sign to f(bn1) (and therefore the same sign as f(an1)), the new interval [an,bn]=[x,bn1]. Explicitly, an=xn and bn=bn1.

The values of xn as obtained here are the same as the values of xn in the preceding description. The main advantage of this description is that we are storing the intervals as we construct them rather than merely the point estimates, so that some aspects of how the procedure works are more transparent.

Convergence rate

Convergence for linear functions

In the case that f is linear, we reach the root in one iteration after we have found values where the function has opposite signs. In fact, even if we applied the secant method directly, we would terminate in one step.

Convergence for twice differentiable functions

Fill this in later