mirror of
				https://github.com/saitohirga/WSJT-X.git
				synced 2025-10-26 10:30:22 -04:00 
			
		
		
		
	
		
			
	
	
		
			378 lines
		
	
	
		
			17 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
		
		
			
		
	
	
			378 lines
		
	
	
		
			17 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
|  | [section:remez The Remez Method] | ||
|  | 
 | ||
|  | The [@http://en.wikipedia.org/wiki/Remez_algorithm Remez algorithm] | ||
|  | is a methodology for locating the minimax rational approximation | ||
|  | to a function.  This short article gives a brief overview of the method, but | ||
|  | it should not be regarded as a thorough theoretical treatment, for that you | ||
|  | should consult your favorite textbook. | ||
|  | 
 | ||
|  | Imagine that you want to approximate some function f(x) by way of a rational | ||
|  | function R(x), where R(x) may be either a polynomial P(x) or a ratio of two | ||
|  | polynomials P(x)/Q(x) (a rational function).  Initially we'll concentrate on the  | ||
|  | polynomial case, as it's by far the easier to deal with, later we'll extend  | ||
|  | to the full rational function case.   | ||
|  | 
 | ||
|  | We want to find the "best" rational approximation, where | ||
|  | "best" is defined to be the approximation that has the least deviation | ||
|  | from f(x).  We can measure the deviation by way of an error function: | ||
|  | 
 | ||
|  | E[sub abs](x) = f(x) - R(x) | ||
|  | 
 | ||
|  | which is expressed in terms of absolute error, but we can equally use | ||
|  | relative error: | ||
|  | 
 | ||
|  | E[sub rel](x) = (f(x) - R(x)) / |f(x)| | ||
|  | 
 | ||
|  | And indeed in general we can scale the error function in any way we want, it | ||
|  | makes no difference to the maths, although the two forms above cover almost | ||
|  | every practical case that you're likely to encounter. | ||
|  | 
 | ||
|  | The minimax rational function R(x) is then defined to be the function that | ||
|  | yields the smallest maximal value of the error function.  Chebyshev showed | ||
|  | that there is a unique minimax solution for R(x) that has the following | ||
|  | properties: | ||
|  | 
 | ||
|  | * If R(x) is a polynomial of degree N, then there are N+2 unknowns: | ||
|  | the N+1 coefficients of the polynomial, and maximal value of the error | ||
|  | function. | ||
|  | * The error function has N+1 roots, and N+2 extrema (minima and maxima). | ||
|  | * The extrema alternate in sign, and all have the same magnitude. | ||
|  | 
 | ||
|  | That means that if we know the location of the extrema of the error function | ||
|  | then we can write N+2 simultaneous equations: | ||
|  | 
 | ||
|  | R(x[sub i]) + (-1)[super i]E = f(x[sub i]) | ||
|  | 
 | ||
|  | where E is the maximal error term, and x[sub i] are the abscissa values of the | ||
|  | N+2 extrema of the error function.  It is then trivial to solve the simultaneous | ||
|  | equations to obtain the polynomial coefficients and the error term. | ||
|  | 
 | ||
|  | ['Unfortunately we don't know where the extrema of the error function are located!] | ||
|  | 
 | ||
|  | [h4 The Remez Method] | ||
|  | 
 | ||
|  | The Remez method is an iterative technique which, given a broad range of | ||
|  | assumptions, will converge on the extrema of the error function, and therefore | ||
|  | the minimax solution. | ||
|  | 
 | ||
|  | In the following discussion we'll use a concrete example to illustrate | ||
|  | the Remez method: an approximation to the function e[super x][space] over | ||
|  | the range \[-1, 1\]. | ||
|  | 
 | ||
|  | Before we can begin the Remez method, we must obtain an initial value | ||
|  | for the location of the extrema of the error function.  We could "guess" | ||
|  | these, but a much closer first approximation can be obtained by first   | ||
|  | constructing an interpolated polynomial approximation to f(x). | ||
|  | 
 | ||
|  | In order to obtain the N+1 coefficients of the interpolated polynomial | ||
|  | we need N+1 points (x[sub 0]...x[sub N]): with our interpolated form  | ||
|  | passing through each of those points | ||
|  | that yields N+1 simultaneous equations: | ||
|  | 
 | ||
|  | f(x[sub i]) = P(x[sub i]) = c[sub 0] + c[sub 1]x[sub i] ... + c[sub N]x[sub i][super N] | ||
|  | 
 | ||
|  | Which can be solved for the coefficients c[sub 0]...c[sub N] in P(x). | ||
|  | 
 | ||
|  | Obviously this is not a minimax solution, indeed our only guarantee is that f(x) and  | ||
|  | P(x) touch at N+1 locations, away from those points the error may be arbitrarily | ||
|  | large.  However, we would clearly like this initial approximation to be as close to | ||
|  | f(x) as possible, and it turns out that using the zeros of an orthogonal polynomial | ||
|  | as the initial interpolation points is a good choice.  In our example we'll use the  | ||
|  | zeros of a Chebyshev polynomial as these are particularly easy to calculate,  | ||
|  | interpolating for a polynomial of degree 4, and measuring /relative error/ | ||
|  | we get the following error function: | ||
|  | 
 | ||
|  | [$../graphs/remez-2.png] | ||
|  | 
 | ||
|  | Which has a peak relative error of 1.2x10[super -3]. | ||
|  | 
 | ||
|  | While this is a pretty good approximation already, judging by the  | ||
|  | shape of the error function we can clearly do better.  Before starting | ||
|  | on the Remez method propper, we have one more step to perform: locate | ||
|  | all the extrema of the error function, and store | ||
|  | these locations as our initial ['Chebyshev control points]. | ||
|  | 
 | ||
|  | [note | ||
|  | In the simple case of a polynomial approximation, by interpolating through | ||
|  | the roots of a Chebyshev polynomial we have in fact created a ['Chebyshev | ||
|  | approximation] to the function: in terms of /absolute error/ | ||
|  | this is the best a priori choice for the interpolated form we can | ||
|  | achieve, and typically is very close to the minimax solution. | ||
|  | 
 | ||
|  | However, if we want to optimise for /relative error/, or if the approximation | ||
|  | is a rational function, then the initial Chebyshev solution can be quite far | ||
|  | from the ideal minimax solution.   | ||
|  | 
 | ||
|  | A more technical discussion of the theory involved can be found in this | ||
|  | [@http://math.fullerton.edu/mathews/n2003/ChebyshevPolyMod.html online course].] | ||
|  | 
 | ||
|  | [h4 Remez Step 1] | ||
|  | 
 | ||
|  | The first step in the Remez method, given our current set of | ||
|  | N+2 Chebyshev control points x[sub i], is to solve the N+2 simultaneous | ||
|  | equations: | ||
|  | 
 | ||
|  | P(x[sub i]) + (-1)[super i]E = f(x[sub i]) | ||
|  | 
 | ||
|  | To obtain the error term E, and the coefficients of the polynomial P(x). | ||
|  | 
 | ||
|  | This gives us a new approximation to f(x) that has the same error /E/ at | ||
|  | each of the control points, and whose error function ['alternates in sign] | ||
|  | at the control points.  This is still not necessarily the minimax  | ||
|  | solution though: since the control points may not be at the extrema of the error | ||
|  | function.  After this first step here's what our approximation's error | ||
|  | function looks like: | ||
|  | 
 | ||
|  | [$../graphs/remez-3.png] | ||
|  | 
 | ||
|  | Clearly this is still not the minimax solution since the control points | ||
|  | are not located at the extrema, but the maximum relative error has now | ||
|  | dropped to 5.6x10[super -4]. | ||
|  | 
 | ||
|  | [h4 Remez Step 2] | ||
|  | 
 | ||
|  | The second step is to locate the extrema of the new approximation, which we do  | ||
|  | in two stages:  first, since the error function changes sign at each | ||
|  | control point, we must have N+1 roots of the error function located between | ||
|  | each pair of N+2 control points.  Once these roots are found by standard root finding  | ||
|  | techniques, we know that N extrema are bracketed between each pair of | ||
|  | roots, plus two more between the endpoints of the range and the first and last roots. | ||
|  | The N+2 extrema can then be found using standard function minimisation techniques. | ||
|  | 
 | ||
|  | We now have a choice: multi-point exchange, or single point exchange. | ||
|  | 
 | ||
|  | In single point exchange, we move the control point nearest to the largest extrema to | ||
|  | the absissa value of the extrema. | ||
|  | 
 | ||
|  | In multi-point exchange we swap all the current control points, for the locations | ||
|  | of the extrema. | ||
|  | 
 | ||
|  | In our example we perform multi-point exchange. | ||
|  | 
 | ||
|  | [h4 Iteration] | ||
|  | 
 | ||
|  | The Remez method then performs steps 1 and 2 above iteratively until the control | ||
|  | points are located at the extrema of the error function: this is then | ||
|  | the minimax solution. | ||
|  | 
 | ||
|  | For our current example, two more iterations converges on a minimax | ||
|  | solution with a peak relative error of | ||
|  | 5x10[super -4] and an error function that looks like: | ||
|  | 
 | ||
|  | [$../graphs/remez-4.png] | ||
|  | 
 | ||
|  | [h4 Rational Approximations] | ||
|  | 
 | ||
|  | If we wish to extend the Remez method to a rational approximation of the form | ||
|  | 
 | ||
|  | f(x) = R(x) = P(x) / Q(x) | ||
|  | 
 | ||
|  | where P(x) and Q(x) are polynomials, then we proceed as before, except that now | ||
|  | we have N+M+2 unknowns if P(x) is of order N and Q(x) is of order M.  This assumes | ||
|  | that Q(x) is normalised so that its leading coefficient is 1, giving | ||
|  | N+M+1 polynomial coefficients in total, plus the error term E. | ||
|  | 
 | ||
|  | The simultaneous equations to be solved are now: | ||
|  | 
 | ||
|  | P(x[sub i]) / Q(x[sub i]) + (-1)[super i]E = f(x[sub i]) | ||
|  | 
 | ||
|  | Evaluated at the N+M+2 control points x[sub i]. | ||
|  | 
 | ||
|  | Unfortunately these equations are non-linear in the error term E: we can only | ||
|  | solve them if we know E, and yet E is one of the unknowns! | ||
|  | 
 | ||
|  | The method usually adopted to solve these equations is an iterative one: we guess the | ||
|  | value of E, solve the equations to obtain a new value for E (as well as the polynomial | ||
|  | coefficients), then use the new value of E as the next guess.  The method is | ||
|  | repeated until E converges on a stable value. | ||
|  | 
 | ||
|  | These complications extend the running time required for the development | ||
|  | of rational approximations quite considerably. It is often desirable | ||
|  | to obtain a rational rather than polynomial approximation none the less: | ||
|  | rational approximations will often match more difficult to approximate | ||
|  | functions, to greater accuracy, and with greater efficiency, than their | ||
|  | polynomial alternatives.  For example, if we takes our previous example | ||
|  | of an approximation to e[super x], we obtained 5x10[super -4] accuracy | ||
|  | with an order 4 polynomial.  If we move two of the unknowns into the denominator | ||
|  | to give a pair of order 2 polynomials, and re-minimise, then the peak relative error drops | ||
|  | to 8.7x10[super -5].  That's a 5 fold increase in accuracy, for the same number  | ||
|  | of terms overall. | ||
|  | 
 | ||
|  | [h4 Practical Considerations] | ||
|  | 
 | ||
|  | Most treatises on approximation theory stop at this point.  However, from | ||
|  | a practical point of view, most of the work involves finding the right | ||
|  | approximating form, and then persuading the Remez method to converge | ||
|  | on a solution. | ||
|  | 
 | ||
|  | So far we have used a direct approximation: | ||
|  | 
 | ||
|  | f(x) = R(x) | ||
|  | 
 | ||
|  | But this will converge to a useful approximation only if f(x) is smooth.  In | ||
|  | addition round-off errors when evaluating the rational form mean that this | ||
|  | will never get closer than within a few epsilon of machine precision.   | ||
|  | Therefore this form of direct approximation is often reserved for situations | ||
|  | where we want efficiency, rather than accuracy. | ||
|  | 
 | ||
|  | The first step in improving the situation is generally to split f(x) into | ||
|  | a dominant part that we can compute accurately by another method, and a  | ||
|  | slowly changing remainder which can be approximated by a rational approximation. | ||
|  | We might be tempted to write: | ||
|  | 
 | ||
|  | f(x) = g(x) + R(x) | ||
|  | 
 | ||
|  | where g(x) is the dominant part of f(x), but if f(x)\/g(x) is approximately | ||
|  | constant over the interval of interest then: | ||
|  | 
 | ||
|  | f(x) = g(x)(c + R(x)) | ||
|  | 
 | ||
|  | Will yield a much better solution: here /c/ is a constant that is the approximate | ||
|  | value of f(x)\/g(x) and R(x) is typically tiny compared to /c/.  In this situation | ||
|  | if R(x) is optimised for absolute error, then as long as its error is small compared | ||
|  | to the constant /c/, that error will effectively get wiped out when R(x) is added to | ||
|  | /c/. | ||
|  | 
 | ||
|  | The difficult part is obviously finding the right g(x) to extract from your | ||
|  | function: often the asymptotic behaviour of the function will give a clue, so | ||
|  | for example the function __erfc becomes proportional to  | ||
|  | e[super -x[super 2]]\/x as x becomes large.  Therefore using: | ||
|  | 
 | ||
|  | erfc(z) = (C + R(x)) e[super -x[super 2]]/x | ||
|  | 
 | ||
|  | as the approximating form seems like an obvious thing to try, and does indeed | ||
|  | yield a useful approximation. | ||
|  | 
 | ||
|  | However, the difficulty then becomes one of converging the minimax solution. | ||
|  | Unfortunately, it is known that for some functions the Remez method can lead | ||
|  | to divergent behaviour, even when the initial starting approximation is quite good. | ||
|  | Furthermore, it is not uncommon for the solution obtained in the first Remez step | ||
|  | above to be a bad one: the equations to be solved are generally "stiff", often | ||
|  | very close to being singular, and assuming a solution is found at all, round-off | ||
|  | errors and a rapidly changing error function, can lead to a situation where the | ||
|  | error function does not in fact change sign at each control point as required. | ||
|  | If this occurs, it is fatal to the Remez method.  It is also possible to | ||
|  | obtain solutions that are perfectly valid mathematically, but which are | ||
|  | quite useless computationally: either because there is an unavoidable amount | ||
|  | of roundoff error in the computation of the rational function, or because | ||
|  | the denominator has one or more roots over the interval of the approximation. | ||
|  | In the latter case while the approximation may have the correct limiting value at | ||
|  | the roots, the approximation is nonetheless useless. | ||
|  | 
 | ||
|  | Assuming that the approximation does not have any fatal errors, and that the only | ||
|  | issue is converging adequately on the minimax solution, the aim is to | ||
|  | get as close as possible to the minimax solution before beginning the Remez method. | ||
|  | Using the zeros of a Chebyshev polynomial for the initial interpolation is a  | ||
|  | good start, but may not be ideal when dealing with relative errors and\/or | ||
|  | rational (rather than polynomial) approximations.  One approach is to skew | ||
|  | the initial interpolation points to one end: for example if we raise the | ||
|  | roots of the Chebyshev polynomial to a positive power greater than 1  | ||
|  | then the roots will be skewed towards the middle of the \[-1,1\] interval,  | ||
|  | while a positive power less than one | ||
|  | will skew them towards either end.  More usefully, if we initially rescale the | ||
|  | points over \[0,1\] and then raise to a positive power, we can skew them to the left  | ||
|  | or right.  Returning to our example of e[super x][space] over \[-1,1\], the initial | ||
|  | interpolated form was some way from the minimax solution: | ||
|  | 
 | ||
|  | [$../graphs/remez-2.png] | ||
|  | 
 | ||
|  | However, if we first skew the interpolation points to the left (rescale them | ||
|  | to \[0, 1\], raise to the power 1.3, and then rescale back to \[-1,1\]) we | ||
|  | reduce the error from 1.3x10[super -3][space]to 6x10[super -4]: | ||
|  | 
 | ||
|  | [$../graphs/remez-5.png] | ||
|  | 
 | ||
|  | It's clearly still not ideal, but it is only a few percent away from | ||
|  | our desired minimax solution (5x10[super -4]). | ||
|  | 
 | ||
|  | [h4 Remez Method Checklist] | ||
|  | 
 | ||
|  | The following lists some of the things to check if the Remez method goes wrong,  | ||
|  | it is by no means an exhaustive list, but is provided in the hopes that it will | ||
|  | prove useful. | ||
|  | 
 | ||
|  | * Is the function smooth enough?  Can it be better separated into | ||
|  | a rapidly changing part, and an asymptotic part? | ||
|  | * Does the function being approximated have any "blips" in it?  Check | ||
|  | for problems as the function changes computation method, or | ||
|  | if a root, or an infinity has been divided out.  The telltale | ||
|  | sign is if there is a narrow region where the Remez method will | ||
|  | not converge. | ||
|  | * Check you have enough accuracy in your calculations: remember that | ||
|  | the Remez method works on the difference between the approximation | ||
|  | and the function being approximated: so you must have more digits of | ||
|  | precision available than the precision of the approximation | ||
|  | being constructed.  So for example at double precision, you | ||
|  | shouldn't expect to be able to get better than a float precision | ||
|  | approximation. | ||
|  | * Try skewing the initial interpolated approximation to minimise the | ||
|  | error before you begin the Remez steps. | ||
|  | * If the approximation won't converge or is ill-conditioned from one starting | ||
|  | location, try starting from a different location. | ||
|  | * If a rational function won't converge, one can minimise a polynomial | ||
|  | (which presents no problems), then rotate one term from the numerator to | ||
|  | the denominator and minimise again.  In theory one can continue moving | ||
|  | terms one at a time from numerator to denominator, and then re-minimising,  | ||
|  | retaining the last set of control points at each stage. | ||
|  | * Try using a smaller interval.  It may also be possible to optimise over | ||
|  | one (small) interval, rescale the control points over a larger interval, | ||
|  | and then re-minimise. | ||
|  | * Keep absissa values small: use a change of variable to keep the abscissa | ||
|  | over, say \[0, b\], for some smallish value /b/. | ||
|  | 
 | ||
|  | [h4 References] | ||
|  | 
 | ||
|  | The original references for the Remez Method and it's extension | ||
|  | to rational functions are unfortunately in Russian: | ||
|  | 
 | ||
|  | Remez, E.Ya., ['Fundamentals of numerical methods for Chebyshev approximations],  | ||
|  | "Naukova Dumka", Kiev, 1969. | ||
|  | 
 | ||
|  | Remez, E.Ya., Gavrilyuk, V.T., ['Computer development of certain approaches  | ||
|  | to the approximate construction of solutions of Chebyshev problems  | ||
|  | nonlinearly depending on parameters], Ukr. Mat. Zh. 12 (1960), 324-338. | ||
|  | 
 | ||
|  | Gavrilyuk, V.T., ['Generalization of the first polynomial algorithm of  | ||
|  | E.Ya.Remez for the problem of constructing rational-fractional  | ||
|  | Chebyshev approximations], Ukr. Mat. Zh. 16 (1961), 575-585. | ||
|  | 
 | ||
|  | Some English language sources include: | ||
|  | 
 | ||
|  | Fraser, W., Hart, J.F., ['On the computation of rational approximations  | ||
|  | to continuous functions], Comm. of the ACM 5 (1962), 401-403, 414. | ||
|  | 
 | ||
|  | Ralston, A., ['Rational Chebyshev approximation by Remes' algorithms],  | ||
|  | Numer.Math. 7 (1965), no. 4, 322-330. | ||
|  | 
 | ||
|  | A. Ralston, ['Rational Chebyshev approximation, Mathematical  | ||
|  | Methods for Digital Computers v. 2] (Ralston A., Wilf H., eds.),  | ||
|  | Wiley, New York, 1967, pp. 264-284. | ||
|  | 
 | ||
|  | Hart, J.F. e.a., ['Computer approximations], Wiley, New York a.o., 1968. | ||
|  | 
 | ||
|  | Cody, W.J., Fraser, W., Hart, J.F., ['Rational Chebyshev approximation  | ||
|  | using linear equations], Numer.Math. 12 (1968), 242-251. | ||
|  | 
 | ||
|  | Cody, W.J., ['A survey of practical rational and polynomial  | ||
|  | approximation of functions], SIAM Review 12 (1970), no. 3, 400-423. | ||
|  | 
 | ||
|  | Barrar, R.B., Loeb, H.J., ['On the Remez algorithm for non-linear  | ||
|  | families], Numer.Math. 15 (1970), 382-391. | ||
|  | 
 | ||
|  | Dunham, Ch.B., ['Convergence of the Fraser-Hart algorithm for rational  | ||
|  | Chebyshev approximation], Math. Comp. 29 (1975), no. 132, 1078-1082. | ||
|  | 
 | ||
|  | G. L. Litvinov, ['Approximate construction of rational | ||
|  | approximations and the effect of error autocorrection], | ||
|  | Russian Journal of Mathematical Physics, vol.1, No. 3, 1994. | ||
|  | 
 | ||
|  | [endsect][/section:remez The Remez Method] | ||
|  | 
 | ||
|  | [/  | ||
|  |   Copyright 2006 John Maddock and Paul A. Bristow. | ||
|  |   Distributed under the Boost Software License, Version 1.0. | ||
|  |   (See accompanying file LICENSE_1_0.txt or copy at | ||
|  |   http://www.boost.org/LICENSE_1_0.txt). | ||
|  | ] | ||
|  | 
 |