GRADIENT - Maple Help

All Products Maple MapleSim

Home : Support : Online Help : Programming : codegen Package : GRADIENT

codegen

GRADIENT

compute the Gradient of a Maple procedure

	Calling Sequence
	GRADIENT(F) GRADIENT(F, X)

Parameters

F	-	Maple procedure
X	-	list of symbols (parameters of F)

Description

•

The first argument F is a Maple procedure which computes a function of x1,x2,...,xn. The GRADIENT command outputs a new procedure G, which when executed at given values for x1,x2,...,xn, returns a vector of the partial derivatives of F w.r.t. x1,...,xn at the given values. This is known as automatic differentiation. It often leads to a more efficient computation than symbolic differentiation, that is, what you would obtain from using linalg[grad]. For example, given

F := proc(x,y) local t;

t := exp(-x);

y*t+t

end proc;

The output of G := GRADIENT(F); is the Maple procedure

G := proc(x, y) local df, t;

t := exp(-x);

df := array(1 .. 1);

df[1] := y + 1;

return -df[1]*exp(-x), t

end proc

When G is called with arguments $(1.0, 1.0)$ , the output is the vector $(−0.7357588824, 0.3678794412)$ . G can subsequently be optimized by optimize(G) so that the exponential is computed only once, to obtain

proc(x, y)

local df, t;

t := exp(-x);

df := array(1 .. 1);

df[1] := y + 1;

return -df[1]*t, t

end proc

One can obtain derivatives of any function which can be represented by a computation constructed by composition of arithmetic operators and any mathematical functions, including functions containing loops and making subroutine calls.

The remaining arguments to GRADIENT are optional, they are described in the following. There are a number of limitations on the Maple procedure F that the GRADIENT command can handle. These are also discussed below.

•

By default, GRADIENT computes the partial derivatives of F w.r.t. all formal parameters present in F. The optional argument X, a list of symbols, may be used to specify which formal parameters to take the derivative w.r.t. For example, the call GRADIENT(F,[x]); computes the partial derivative of F with respect to the formal parameter x only. The result is

proc(x, y)

local df, t;

t := exp(-x);

df := array(1 .. 1);

df[1] := y + 1;

return -df[1]*exp(-x)

end proc

The procedure F may not refer to it's parameters using args[i]. For example, you may not define F as follows

F := proc() local t;

t := exp(-args[1]);

args[2]*t+t

end proc;

Nor is it possible to differentiate w.r.t. an array of parameters. For example, you cannot presently define F as

F := proc(x::array(1..2)) local t;

t := exp(-x[1]);

x[2]*t+t

end proc;

and compute the gradient w.r.t. the array x.

•

Two algorithms are supported, the so-called forward and reverse modes. By default, GRADIENT tries to use the reverse mode since it usually leads to a more efficient code. If it is unable to use the reverse mode, the forward mode is used. The user may specify which algorithm is to be used by giving the optional argument mode=forward or mode=reverse.

•

The advantage of the reverse mode is that the cost of computing the gradient is usually cheaper than the forward mode, especially if the number of partial derivatives n is high. It is also cheaper than symbolic differentiation (Maple's diff command) and divided differences. Specifically, if the cost of computing F is m arithmetic operations, then the cost of computing the Gradient of F using the forward and reverse modes is O(m n) and O(m+n) respectively. However, the best result is usually obtained by pre-optimizing the input procedure F and post-optimizing the output procedure of Gradient. Also, before applying GRADIENT it is best to split up long products in F. This can be done with the codegen[split] command.

•

The vector of partial derivatives is, by default, returned as a sequence. The optional argument result_type=list, result_type=array, or result_type=seq specifies that the vector of derivatives returned by G is to be a Maple list, array, and sequence respectively. For example, the call GRADIENT(F,result_type=array); yields

proc(x, y)

local df, grd, t;

t := exp(-x);

df := array(1 .. 1);

df[1] := y + 1;

grd := array(1 .. 2);

grd[1] := - df[1]*exp(-x);

grd[2] := t;

return grd

end proc

•	The optional argument function_value=true causes GRADIENT to compute the function value $F (x1, ..., xn)$ at the same time as the Gradient. It is returned as the first value in the vector. For example the call GRADIENT(F,function_value=true); yields

proc(x, y)

local df, t;

t := exp(-x);

df := array(1 .. 1);

df[1] := y + 1;

return y*t + t, - df[1]*exp(-x), t

end proc

•	See also codegen[HESSIAN] and codegen[JACOBIAN] for routines for computing Hessians and Jacobians using automatic differentiation of functions represented by Maple procedures. Also, the derivative operator D in Maple can be used to compute a derivative of a Maple procedure in one variable.

•	The command with(codegen,GRADIENT) allows the use of the abbreviated form of this command.

Examples

>	$with (codegen) &colon;$

>	F := proc(x,y) local t; t := xy; x+t-yt; end proc;

$F ≔ proc (x, y) local t &semi; t ≔ y &ast; x &semi; x + t - y &ast; t end proc$

(1)

>	$F (x, y)$

$- y^{2} x + y x + x$

(2)

>	$G ≔ GRADIENT (F)$

$G ≔ proc (x, y) local df, t &semi; t ≔ y &ast; x &semi; df ≔ array (1 .. 1) &semi; df [1] ≔ - y + 1 &semi; return y &ast; df [1] + 1, x &ast; df [1] - t end proc$

(3)

>	$G (x, y)$

$y (- y + 1) + 1, x (- y + 1) - y x$

(4)

>	$G ≔ GRADIENT (F, mode = forward)$

$G ≔ proc (x, y) local dt, t &semi; dt ≔ array (1 .. 2) &semi; dt [1] ≔ y &semi; dt [2] ≔ x &semi; t ≔ y &ast; x &semi; return dt [1] &ast; (- y + 1) + 1, dt [2] &ast; (- y + 1) - t end proc$

(5)

>	$G (x, y)$

$y (- y + 1) + 1, x (- y + 1) - y x$

(6)

>	$G ≔ GRADIENT (F, function_value = true)$

$G ≔ proc (x, y) local df, t &semi; t ≔ y &ast; x &semi; df ≔ array (1 .. 1) &semi; df [1] ≔ - y + 1 &semi; return - t &ast; y + t + x, y &ast; df [1] + 1, x &ast; df [1] - t end proc$

(7)

>	$G (x, y)$

$- y^{2} x + y x + x, y (- y + 1) + 1, x (- y + 1) - y x$

(8)

>	$G ≔ GRADIENT (F, result_type = array)$

$G ≔ proc (x, y) local df, grd, t &semi; t ≔ y &ast; x &semi; df ≔ array (1 .. 1) &semi; df [1] ≔ - y + 1 &semi; grd ≔ array (1 .. 2) &semi; grd [1] ≔ y &ast; df [1] + 1 &semi; grd [2] ≔ x &ast; df [1] - t &semi; return grd end proc$

(9)

>	$G ≔ GRADIENT (F, result_type = list)$

$G ≔ proc (x, y) local df, t &semi; t ≔ y &ast; x &semi; df ≔ array (1 .. 1) &semi; df [1] ≔ - y + 1 &semi; return [y &ast; df [1] + 1, x &ast; df [1] - t] end proc$

(10)

>	$H ≔ GRADIENT (G, result_type = list)$

$H ≔ proc (x, y) local df, df1, dfr0, t &semi; t ≔ y &ast; x &semi; df1 ≔ - y + 1 &semi; df ≔ array (1 .. 2) &semi; dfr0 ≔ array (1 .. 2) &semi; df [2] ≔ y &semi; dfr0 [2] ≔ x &semi; dfr0 [1] ≔ - 1 &semi; return [[0, df1 - df [2]], [y &ast; dfr0 [1] + df1, x &ast; dfr0 [1] - dfr0 [2]]] end proc$

(11)

>	$optimize (H)$

$proc (x, y) local t1 &semi; t1 ≔ - 2 &ast; y + 1 &semi; [[0, t1], [t1, - 2 &ast; x]] end proc$

(12)

This next example illustrates pre and post optimization. The gradient is computed with respect to phi and omega only. Since the torus procedure returns three values, the Gradient is really the Jacobian matrix of partial derivatives.

>	torus := proc(phi,omega,R,r) local x,y,z; x := cos(phi)(R+rcos(omega)); y := sin(phi)(R+rcos(omega)); z := r*sin(omega); [x,y,z] end proc:

>	$torus ≔ optimize (torus)$

$torus ≔ proc (φ, ω, R, r) local t1, t2, t4, t5, t6 &semi; t1 ≔ \cos (φ) &semi; t2 ≔ \cos (ω) &semi; t4 ≔ r &ast; t2 + R &semi; t5 ≔ \sin (φ) &semi; t6 ≔ \sin (ω) &semi; [t4 &ast; t1, t5 &ast; t4, t6 &ast; r] end proc$

(13)

>	$G ≔ GRADIENT (torus, [φ, ω], result_type = list)$

$G ≔ proc (φ, ω, R, r) local df, dfr0, dfr1, t1, t2, t4, t5, t6 &semi; t1 ≔ \cos (φ) &semi; t2 ≔ \cos (ω) &semi; t4 ≔ t2 &ast; r + R &semi; t5 ≔ \sin (φ) &semi; t6 ≔ \sin (ω) &semi; df ≔ array (1 .. 5) &semi; dfr0 ≔ array (1 .. 5) &semi; dfr1 ≔ array (1 .. 5) &semi; df [3] ≔ t1 &semi; df [2] ≔ df [3] &ast; r &semi; df [1] ≔ t4 &semi; dfr0 [4] ≔ t4 &semi; dfr0 [3] ≔ t5 &semi; dfr0 [2] ≔ dfr0 [3] &ast; r &semi; dfr1 [5] ≔ r &semi; return [[- df [1] &ast; \sin (φ), - df [2] &ast; \sin (ω)], [dfr0 [4] &ast; \cos (φ), - dfr0 [2] &ast; \sin (ω)], [0, dfr1 [5] &ast; \cos (ω)]] end proc$

(14)

>	$optimize (G)$

$proc (φ, ω, R, r) local df, dfr0, t1, t2, t3, t4, t5, t6 &semi; t1 ≔ \cos (φ) &semi; t2 ≔ \cos (ω) &semi; t3 ≔ t2 &ast; r &semi; t4 ≔ t3 + R &semi; t5 ≔ \sin (φ) &semi; t6 ≔ \sin (ω) &semi; df ≔ array (1 .. 5) &semi; dfr0 ≔ array (1 .. 5) &semi; df [2] ≔ t1 &ast; r &semi; dfr0 [2] ≔ t5 &ast; r &semi; [[- t5 &ast; t4, - t6 &ast; df [2]], [t4 &ast; t1, - t6 &ast; dfr0 [2]], [0, t3]] end proc$

(15)

We redo the same example but this time using the makeproc command to first build the torus procedure, returning an array instead of a list.

>	$A ≔ array ([\cos (φ) (R + r \cos (ω)), \sin (φ) (R + r \cos (ω)), r \sin (ω)]) &colon;$

>	torus := optimize( makeproc(A,[phi,omega,R,r]) );

$torus ≔ proc (φ, ω, R, r) local A, t1, t2, t4, t5, t6 &semi; A ≔ array (1 .. 3) &semi; t1 ≔ \cos (φ) &semi; t2 ≔ \cos (ω) &semi; t4 ≔ t2 &ast; r + R &semi; A [1] ≔ t4 &ast; t1 &semi; t5 ≔ \sin (φ) &semi; A [2] ≔ t5 &ast; t4 &semi; t6 ≔ \sin (ω) &semi; A [3] ≔ t6 &ast; r &semi; A end proc$

(16)

>	$G ≔ optimize (GRADIENT (torus, [φ, ω], result_type = array))$

$G ≔ proc (φ, ω, R, r) local df, dfr0, grd, t1, t2, t4, t5, t6 &semi; t1 ≔ \cos (φ) &semi; t2 ≔ \cos (ω) &semi; t4 ≔ t2 &ast; r + R &semi; t5 ≔ \sin (φ) &semi; t6 ≔ \sin (ω) &semi; df ≔ array (1 .. 8) &semi; dfr0 ≔ array (1 .. 8) &semi; df [2] ≔ t1 &ast; r &semi; dfr0 [2] ≔ t5 &ast; r &semi; grd ≔ array (1 .. 3, 1 .. 2) &semi; grd [1, 1] ≔ - t5 &ast; t4 &semi; grd [1, 2] ≔ - t6 &ast; df [2] &semi; grd [2, 1] ≔ t4 &ast; t1 &semi; grd [2, 2] ≔ - t6 &ast; dfr0 [2] &semi; grd [3, 1] ≔ 0 &semi; grd [3, 2] ≔ t2 &ast; r &semi; grd end proc$

(17)

This example shows that local procedures are handled.

>	f := proc(x) local s, t; s := proc(x) local e; e := exp(x); (e-1/e)/2 end proc; t := s(x^2); 1-xt+x^2t; end proc:

>	$GRADIENT (f)$

$proc (x) local dfr0, ds, lf1, s, t &semi; ds ≔ proc (x) local df, e &semi; e ≔ \exp (x) &semi; df ≔ array (1 .. 1) &semi; df [1] ≔ 1 / 2 + 1 / (2 &ast; e &Hat; 2) &semi; return [df [1] &ast; \exp (x)] end proc &semi; s ≔ proc (x) local e &semi; e ≔ \exp (x) &semi; 1 / 2 &ast; e - 1 / (2 &ast; e) end proc &semi; t ≔ s (x &Hat; 2) &semi; dfr0 ≔ array (1 .. 1) &semi; lf1 ≔ ds (x &Hat; 2) &semi; dfr0 [1] ≔ x &Hat; 2 - x &semi; return 2 &ast; x &ast; dfr0 [1] &ast; lf1 [1] + 2 &ast; t &ast; x - t end proc$

(18)

This final example illustrates the need for breaking up large products.

>	F := proc(u,v,w,x,y,z) uvwxy*z end proc;

$F ≔ proc (u, v, w, x, y, z) u &ast; v &ast; w &ast; x &ast; y &ast; z end proc$

(19)

>	$G ≔ GRADIENT (F)$

$G ≔ proc (u, v, w, x, y, z) return v &ast; w &ast; x &ast; y &ast; z, u &ast; w &ast; x &ast; y &ast; z, u &ast; v &ast; x &ast; y &ast; z, u &ast; v &ast; w &ast; y &ast; z, u &ast; v &ast; w &ast; x &ast; z, u &ast; v &ast; w &ast; x &ast; y end proc$

(20)

>	$cost (G)$

$24 multiplications$

(21)

>	$F ≔ split (F)$

$F ≔ proc (u, v, w, x, y, z) local s0, s1, s2, s3 &semi; s0 ≔ v &ast; u &semi; s1 ≔ w &ast; x &semi; s2 ≔ y &ast; z &semi; s3 ≔ s1 &ast; s0 &semi; return s3 &ast; s2 end proc$

(22)

>	$G ≔ GRADIENT (F)$

$G ≔ proc (u, v, w, x, y, z) local df, s0, s1, s2, s3 &semi; s0 ≔ v &ast; u &semi; s1 ≔ w &ast; x &semi; s2 ≔ y &ast; z &semi; s3 ≔ s1 &ast; s0 &semi; df ≔ array (1 .. 4) &semi; df [4] ≔ s2 &semi; df [3] ≔ s3 &semi; df [2] ≔ df [4] &ast; s0 &semi; df [1] ≔ df [4] &ast; s1 &semi; return v &ast; df [1], u &ast; df [1], df [2] &ast; x, df [2] &ast; w, df [3] &ast; z, df [3] &ast; y end proc$

(23)

>	$cost (G)$

$8 storage + 9 assignments + 12 multiplications + 12 subscripts$

(24)

Maple

Maple Add-Ons

MapleSim

MapleSim Add-Ons

Systems Engineering

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim

Maple

Powerful math software that is easy to use

Maple Add-Ons

MapleSim

Advanced System Level Modeling

MapleSim Add-Ons

Systems Engineering

Consulting Services

Maple T.A. and Möbius

Education

Industries

Automotive and Aerospace

Robotics

Machine Design & Industrial Automation

Other

Application Areas

Product Pricing

Purchasing

Institutional Student Licensing

Maplesoft Elite Maintenance (EMP)

Support

Product Training

Online Product Help

Webinars & Events

Publications

Content Hubs

Examples & Applications

Community

About Maplesoft

Media Center

User Community

Contact

Online Help

All Products Maple MapleSim