Wikipedia:Reference desk/Archives/Mathematics/2020 August 15

Mathematics desk
< August 14 << Jul | August | Sep >> August 16 >
Welcome to the Wikipedia Mathematics Reference Desk Archives
The page you are currently viewing is a transcluded archive page. While you can leave answers for any questions shown below, please ask new questions on one of the current reference desk pages.


August 15

edit

Chain rule with multiple variables

edit

I asked the question given in the link earlier on the mathematics stack exchange site. I was wondering if anyone could help me understand the answer I got in layman's terms. I simply don't understand. The question is asking for clarity on the chain rule with multiple variables.

https://math.stackexchange.com/questions/3790900/chain-rule-with-a-function-depending-on-functions-of-different-variables/3791017?noredirect=1#comment7809514_3791017 — Preceding unsigned comment added by 69.47.2.70 (talk) 06:00, 15 August 2020 (UTC)[reply]

Let me first get something out of the way. By a (not uncommon, but possibly confusing) abuse of notation, when a mathematical text says something like "Let  ", the symbol " " is made to do dual duty. In the right-hand side of this definition – which has the form of an equation but really is a definition – it denotes a function; in this case a function of just one variable. In the left-hand side, on the other hand, the symbol is defined to do duty as standing for an expression, in this case the expression " "; it will serve as an abbreviation. To determine which of the two is intended in any further use requires a mathematical interpretation of the context, but as a rule of thumb application to an argument, as in " ", tells us it stands for the function, while the lack of an argument, as in  , tells us it that this time it abbreviates the expression, this thus being convenient shorthand notation for  . Clearly, the expression   depends on  ; this dependence is explicit. Therefore " " used as abbreviation also depends on  , but now the dependence is implicit. This can be tricky or sneaky: a symbol may implicitly depend on another symbol, which in turn implicitly depends on yet another symbol, and so on.
    The formula you give for the chain-rule derivative brings the potential confusion to an entirely new level by mixing these two roles in a single formula. For simplicity, I'll illustrate the issue with the univariate version of the chain rule. Then, what we have is something like
 
 
What is going on here can be more easily be explained by introducing some extra symbols, as follows:
 
 
So the expression   in (1) is not simply the derivative of   with respect to  , which is a function of one argument, but stands here for the result of applying that function to  . You can kind of guess that because in the context this is an operand of a multiplication, so it has to be a number, not a function. You can get a number out by applying the function to some argument. The only reasonable candidate hanging around for assuming the role of that required argument is  .
Now take the definition   and the expression  . As before, this stands for  . Unless   sneakily implicitly depends on   – but for reasons that should be obvious mathematical etiquette does not allow one to secretly introduce dependences, so in the absence of evidence to the contrary we are allowed to assume no such implicit dependences exists – the term   in that expression does not depend on  . As   varies, it remains constant. Therefore,  .
I hope this helps. I am not going to attempt to explain the elaborate answer on the stack exchange, which should take me several hours.  --Lambiam 12:41, 15 August 2020 (UTC)[reply]
For the two example questions asked in that Stack Exchange post (assuming no unstated relation between variables):
  •  :
 
 
 
  •  :
 
 
 
 
These are the elements of the matrix multiplication at the end of Philipp's first response.
-- ToE 02:10, 17 August 2020 (UTC)[reply]
Note that there is a careless error in the first matrix multiplication, midway through Philipp's first response. The matrices are set up correctly, but the product is incorrect. Where they wrote:
 ,
they should have written:
 .
-- ToE 02:33, 17 August 2020 (UTC)[reply]
Matrix multiplication error aside, I don't think Philipp did you a favor by answering your question in terms of the total derivative and Jacobian matrix. Linear algebra, of which you say you know little, offers useful tools for a terse and elegant representation of multivariate concepts, but it isn't necessary to address your question. As Lambiam explained above,
 
is the same as saying:
 
 
Simple, right? -- ToE 14:27, 17 August 2020 (UTC)[reply]