Talk:Adequality
This is the talk page for discussing improvements to the Adequality article. This is not a forum for general discussion of the article's subject. |
Article policies
|
Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
Archives: 1Auto-archiving period: 12 months ![]() |
![]() | This article is rated C-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | ||||||||||
|
bug in article history
editThere is some kind of bug in the article history that prevents one from viewing old versions of the article (and displays the current version instead). This makes it difficult to view the recent proposed additions. I therefore copied the material to talk (as was originally proposed by User:Klaus Barner). See below. Tkuvho (talk) 15:24, 4 March 2013 (UTC)
Behind the word adequality (Latin: adaequalitas) hides a long story of scholarly interpretations of a technical term used by Pierre de Fermat in his seminal work of extreme values and tangents.
The Latin noun adaequalitas and the accompanying Latin verb adaequare are used by Fermat in his treatises and letters about his method of determining maxima and minima of algebraic terms and of computing tangents to algebraic curves (like conic sections). It is a technical term within this method and does not mean the method itself.
The verb adaequare and the noun adaequalitas are used by Fermat in different idioms like adaequentur, adaequabuntur, comparare per adaequalitatem, debet adaequare, fiat per adaequalitatem, and, in his French correspondence, comparer par adéquation, however, most frequently in the form adaequabitur (third person singular passive future) which means "it will be (made) equal" (indefinite tens).
The verb adaequare is synonymous with the verb aequare (Thesaurus Linguae Latinae, Vol. I, p.562). The latter word was used by Viète nearly exclusively (with very few exceptions) to denote equality in place of the symbol =. This symbol was invented by Robert Recorde in 1557. Viète wrote most frequently aequabitur, and Fermat adopted it from him. However, Fermat used both words, aequabitur and adaequabitur, in place of the equals sign, obviously systematically and in a subtle difference of meanings.
In Fermat’s work about maxima and minima and tangents there is a place where he says that he adopted the word “adaequentur” from Diophantus. In his treatise “’’Methodus ad disquirendam maximam et minimam’’” (Fermat, Œuvres, Vol. I, pp.133-136) Fermat writes: ‘’Adaequentur, ut loquitur Diophantus, duo homogenia maxima aut minima aequalia …’’ (We make, as Diophantus says, two expressions of maximum or minimum in like manner equal.) But there are more questions than answers what that exactly means. This topic is taken up further below.
And this is the cause of a long scholarly debate which still goes on. For the purpose of this article we will denote aequabitur by and adaequabitur (and its equivalent idioms) by .
Fermat’s example Nr. 1. (Œuvres, Vol. I, p.134) To divide the line by the point this way that the product of the two segments and becomes a maximum.
Fermat’s solution: Let . Next let be one of the two segments. Then the other one will be . And the product, which shall become a maximum, then equals . However, if the first segment is then the second one will be . The product of the segments is then
- .
Next Fermat puts the two terms equal:
- .
Subtracting common terms on both sides yields
Now Fermat [tacitly assuming that is unequal zero] cancels the common factor and gets
Finally Fermat deletes and yields the solution
Fermat then remarks: “The question is answered if we take
Nec potest generalior dari methodus.” (It is impossible to give a more general method.)
There are several questions one would like to ask. What is Fermat’s idea behind his method? Did he explain how he found it? Has Fermat proven the validity of his proceeding? And, if not, is it possible to do this by means of modern analysis? And, above all: what is the precise meaning of “adaequabitur” ( )?
Let us first hear the voices of the scholarly experts! When Paul Tannery in 1896 published his French translation of Fermat’s Latin treatises on maxima and minima (Fermat, Œuvres, Vol. III, pp. 121-156) he was obviously insecure of the meaning of adaequare and aequalitas and avoided a translation of these words from Latin into modern French. He invented instead the new French verb “adégaler” and adopted Fermat’s “adéquation”. His introduction of the sign for “adaequabitur” in mathematical formulas, however, had the most serious consequences.
Heinrich Wieleitner (1929):[1] “Fermat replaces A with A+E. Then he puts the new expression roughly equal ( angenähert gleich) to the old one, cancels equal terms on both sides, and divides by the highest possible power of E. He then cancels all terms which contain E and puts that, what remains, really equal. From that A results. That E should be as small as possible is nowhere said and is at best expressed by the word "adaequalitas". (Wieleitner uses the symbol .)
Max Miller (1934):[2] "Thereupon one should put the both terms, which express the maximum and the minimum, approximately equal (näherungsweise gleich), as Diophantus says." (Miller uses the symbol .)
Jean Itard (1948):[3] "One knows that the expression <<adégaler>> is adopted by Fermat from Diophantus, translated by Xylander and by Bachet. It is about an approximate equality (égalité approximative) ". (Itard uses the symbol .)
Joseph Ehrenfried Hofmann (1963):[4] "Fermat chooses a quantity h, thought as sufficiently small, and puts f(x+h) roughly equal (ungefähr gleich) to f(x). His technical term is adaequare." (Hofmann uses the symbol .)
Peer Strømholm (1968):[5] "The basis of Fermat's approach was the comparition of two expressions which, though they had the same form, were not exactly equal. This part of the process he called "comparare par adaequalitatem" or "comparer per adaequalitatem", and it implied that the otherwise strict identity between the two sides of the "equation" was destroyed by the modification of the variable by a small amount: . This, I believe, was the real significance of his use of Diophantos' πἀρισον, stressing the smallness of the variation. The ordinary translation of 'adaequalitas' seems to be "approximate equality", but I much prefer "pseudo-equality" to present Fermat's thought at this point."
Claus Jensen (1969)[6] Moreover, in applying the notion of adégalité - which constitutes the basis of Fermat's general method of constructing tangents, and by which is meant a comparition of two magnitudes as if they were equal, although they are in fact not - I will employ the nowdays mor usual symbol .
Michael Sean Mahoney (1971)[7] "Fermat's Method of maxima and minima, which is clearly applicable to any polynomial P(x), originally rested on purely finitistic algebraic foundations. It assumed, counterfactually, the inequality of two equal roots in order to determine, by Viete's theory of equations, a relation between those roots and one of the coefficients of the polynomial, a relation that was fully general. This relation then led to an extreme-value solution when Fermat removed his counterfactual assumption and set the roots equal. Borrowing a term from Diophantus, Fermat called this counterfactual equality "adequality". (Mahoney uses the symbol .)
Charles Henry Edwards, Jr. (1979):[8] For example, in order to determine how to subdivide a segment of length into two segments and whose product is maximal, that is to find the rectangle with perimeter that has the maximal area, he [Fermat] proceeds as follows. First he substituted (he used A, E instead of x, e) for the unknown x, and then wrote down the following "pseudo-equality" to compare the resulting expression with the original one:
After canceling terms, he divided through by e to obtain
Finally he discarded the remaining term containing e, transforming the pseudo-equality into the true equality
that gives the value of x which makes maximal. Unfortunately, Fermat never explained the logical basis for this method with sufficient clarity or completeness to prevent disagreements between historical scholars as to precisely what he meant or intended.
Kirsti Anderson (1980):[9] “The two expressions of the maximum or minimum are made "adequal", which means something like as nearly equal as possible." (Mrs. Anderson uses the symbol .)
After reading these quotations one must admit that Edwards very precisely describes the state of the art of the research about the meaning of “adaequare” around 1980. No-one of the authors, including Edwards himself, has a well-founded conception what Fermat meant with “adaequare” and “adaequalitas”. And Edwards holds Fermat’s vagueness responsible for that.
The first author, who resolutely contradicts this opinion, is Herbert Breger. “Taking into account that brilliant mathematicians usually are not so very confused when talking about their own central mathematical ideas” he wrote a long paper[10] and proposed a solution of the problem: “I want to put forward my hypothesis: Fermat used the word "adaequare" in the sense of "to put equal" ... In a mathematical context, the only difference between "aequare" and "adaequare" seems to be that the latter gives more stress on the fact that the equality is achieved.” (Page 197f. of his paper.)
Breger’s central considerations are as follows. As mentioned above, Fermat wrote in his treatise “’’Methodus ad disquirendam maximam et minimam’’” (Fermat, Œuvres, Vol. I, pp.133-136): ‘’Adaequentur, ut loquitur Diophantus, duo homogenia maxima aut minima aequalia …’’ (We make, as Diophantus says, two expressions of maximum or minimum in like manner equal.) Indeed, one finds in Diophantus’ Arithmetica several places where Diophantus uses the words παρισὀτης and πἀρισον which Bachet de Méziriac translates as ‘’adaequalitas’’ respectively as ‘’adaequale’’. Both words, which occur in the 11th and 14th problems of the fifth book of the Arithmetica, are used in context of the regula falsi which Diophantus uses to solve a certain class of Diophantine equations. The word παρισὀτης is invented by Diophantus to label this method. Fermat himself does not use these words. The meaning of πἀρισον is “almost equal” and the meaning of παρισὀτης is something like “approximation of the solution” which makes sense in the context of the regula falsi. However, Breger questions the correctness of Bachet’s translation of those Greek words.[dubious – discuss] (Bachet had adopted his own translation from Wilhelm Xylander.) Paul Tannery[11] translates πἀρισον with proximum and παρισὀτης with appropinquatio, which is much better.
The verbs aequare and adaequare are synonymous and mean “to make equal”. All great Latin dictionaries say unanimously: both verbs mean “to make equal”. And Breger concludes that both words, aequabitur and adaequabitur, mean “equals”. He provides many strong arguments for this claim. And he is right, as one will see below. However, he fails to answer the question: why should Fermat use two synonymous verbs obviously and systematically in a different sense? Breger’s answer to this question is completely unsatisfactory.
The only way to resolve this paradoxical situation is to analyze the meaning and the different functions of the equals sign . There are hundreds of books about logic and/or set theory, but almost all of them say no word about the equals sign, they just use it. The notable exception is the work of Fermat’s virtual compatriot Nicolas Bourbaki. In his Elements of Mathematics he devotes a paragraph (of 3 pages) to the equals sign and its proper use. The following quotations are from Bourbaki (Theory of Sets’’. Addison Wesley, Reading Mass.1968, §5.1).
“An equalitarian theory is a theory Ƭ which has a relational sign of weight 2, written (read “equals”).
If and are terms in Ƭ, the assembly is a relation in Ƭ (called the relation of equality). - In practice it is denoted by or .”
“The negation of the relation is denoted by or (where the sign is read is different from).”
“When a relation of the form has been proved in a theory Ƭ, it is often said (by abuse of language) that and are the same or are identical.”
“Likewise, when is true in Ƭ, we say that that and are distinct in place of saying that that is different from .”
Now one should consider a set of relations of equality, one example is from Euler, one from Viète, and the rest is Fermat’s.
Relation of equality Nr.1. (Euler)
This beautiful relation of equality is due to Euler. The equals sign = stands between two constants. They look very different, but they represent the same real number. That has been proven by Euler. So, according to Bourbaki, the two constants in the relation of equality above are the same or identical. If the term is denoted by and the term is denoted by then is true.
Relation of equality Nr.2. (Fermat)
This relation of equality is considered by Fermat (Œuvres, Vol. I, p.155). The letter denotes a constant, the length of a line. This relation may be called a conditional equation. It is neither true nor false. When it has been solved, for instance by the positive solution
and is replaced by the solution the relation of equality above is proved and, according to Bourbaki, the constants on both sides of the relation are the same or identical.
Relation of equality Nr.3. (Fermat)
- .
This relation of equality is from Fermat (Œuvres, Vol. I, p.140). Again, denotes the positive length of a line. This relation is easily proved by the rules of algebra. The relation is universally valid. According to Bourbaki the terms on both sides of the equals sign are the same or identical. If the term is denoted by and the term is denoted by then is true.
Relation of equality Nr.4. (Fermat)
- .
This relation of equality is on the same page as the relation Nr. 3. However, this relation is not universally valid. It is easy to find a counterexample. For instance and yields . This proves, according to Bourbaki, that the two terms on both sides of the relation are distinct. If the term is denoted by and the term is denoted by then .
The relations of equality Nr.5 and Nr.6 follow further below.
Fermat’s example Nr. 2. (Œuvres, Vol. I, p.140) To divide the line by the point this way that the product of the square over and the line is maximal.
Fermat’s solution: Let and so that . If now we replace by then the product, which results by multiplication of the square of and , is
Now Fermat puts this last product equal to the first one:
- ,
word for word explaining: “Id comparo primo solido tanquam essent aequalia , licet revera non sint, et hujus modi comparationem vocavi adaequalitatem.” (I compare this [product] with the first product as if they were equal, even though, in reality, they are not, and I called this kind of comparison adaequalitas.) [It is exactly that case, where Bourbaki writes though is true. It is a relation of equality which describes a relation of two variables, x and e, which are not independent.] After that Fermat cancels those terms which both sides have in common, to wit , in which on one side nothing remains, however, on the other side
Fermat prefers to separate the terms with a minus sign from those with a plus sign (comparando sunt ergo homogenea notata signo + cum iis quae notantur signo - ) yielding the equation
He then cancels the factor :
Fermat deletes all terms, which still contain the factor :
Finally, tacitly assuming that is no sensible solution, Fermat cancels the factor . And he gets
Therefore the line must be divided that way that
Fermat does not always give complete proofs of his results concerning the determination of maxima and minima or of computing tangents. However, in those cases, where he does it, he follows always exactly the same scheme of his “method”. First Fermat prepares the problem for the application of his method. That depends on the specific conditions and assumptions of the problem. The outcome always consists of two different terms of similar structure, one containing only the variable and the other one containing . The terms may be polynomials, quotients of polynomials, and algebraic terms containing roots. These two different terms are than set equal ( ). If necessary, this equation is then transformed by algebraic operations until Fermat gets a polynomial equation. And, a sheer miracle, all summands of the equation contain the factor or a power of it. Then this factor (the highest common power of ) is cancelled, and finally those remaining terms, which still contain as a factor, are deleted. The result is a conditional equation, which gives, if solved, the solution of the problem. The last equation is the first one where Fermat changes back from adaequabitur to aequabitur. All operations are purely algebraic.
It would be interesting to know, how the adherents of counterfactual equality or almost equality would justify Fermat’s algebraic operations (like cross-multiplying or squaring out roots). However, these guys never give up. But there will be a lucky punch.
Relation of equality Nr.5. (Viète)
Of course, this is a special case of Viète’s more general formula for the factorization of the term . Let be the term and the term . Then the relation of equality is true. That is easily proved (on the basis of the axioms of a commutative ring) and may be noted by means of Bourbaki’s quantifiers as
We now turn to the story of Fermat and Descartes’ folium cartesii, given by the relation of equality
Relation of equality Nr.6. (Fermat)
This relation of equality is found in Fermat’s Œuvres (Vol. II, p.156). It describes a curve which was proposed to him by René Descartes. Let, again, be the term , and the term . We want to show that is true. The relation of equality above is not universally true. If one chooses and one gets the false equation . Therefore
Applying the rule twice we get
which proves that is true.
Fermat’s example Nr. 3. (The lucky punch.) In his letter to Mersenne of 18 January 1638 (Fermat, Œuvres, Vol. III, pp. 126-132) Descartes challenges Fermat to compute the tangent of a new curve, which he had detected and which is nowadays called folium cartesii. On page 129/130 Descartes writes:
“Then, apart from this, his [Fermat’s] alleged rule is not that universal as it seems to him. And it is impossible to extend it to problems which are a little more difficult, but only to the easiest. So he could test it, after having better understood it. He should use it to find, for instance the tangents to the curve which I propose to be the following. At an arbitrary place of the curve take the point . Having drawn the perpendicular the two cubes of side lengths and are, together, equal (égaux) to the rectangular solid formed of the same sides and and the length of a given side.”
If one puts , , and , then one gets the equation of the folium cartesii:
One usually chooses which makes the curve looking nice and the computations easier. Mersenne hesitated for a long time sending a copy of Descartes’ letter to Fermat. But finally he did. And Fermat answered in June 1638 (Œuvres, Vol. II, p.156) and gave the solution of Descartes’ challenge. Fermat formulates the description of the curve, which Descartes had proposed, as follows: “Soit la courbe , de laquelle la propriété est telle que, quelque point qu’on prenne sur la dite courbe, comme , tirant la perpendiculaire , les deux cubes et soient égaux au parallélépipède compris sous une ligne droite donnée, comme , et les deux lignes et .“ Fermat then introduces two variable auxiliary points and , needed for his “method”. He then starts with his “method” and writes : “Il faudra comparer, par adéquation, les deux cubes et avec le solide compris sous . (one must compare, by adequality, the two cubes … ). That means
What may this adequality mean? Approximately equal? Counterfactually equal? As nearly equal as possible? Pseudo-equal? Almost equal?[12] Or put equal? The reader should decide it himself.
Why did Fermat not just write aequabitur instead of adaequabitur? The answer should include some biographical considerations. When Fermat (born November 1607) in 1628 created his “method” he was very young, about 20 or 21 years old. He lived as an âvocat at the parlement de Bordeaux where his friend Etienne d’Espagnet was a conseiller at the same parlement. D’Espagnet possessed a very valuable scientific library which he had inherited from his father, a friend of Viète. In this library Fermat studied the writings of the ancients and Viète’s work. Viète used most frequently the verb aequare and, on some rare occasions, the verb adaequare (8 times) and the noun adaequalitas (once), instead of Recorde’s equals sign. (François Viète, Opera Mathematica. Georg Olms, New York 1970, pp. 80, 134, 135, 137, 141, and 143). In all these cases the meaning of adaequare is to put equal. At that time Fermat had still not studied Diophantus. So he borrowed the verb adaequare and the noun adaequalitas from his admired “example” Viète. Later, in his correspondence with Mersenne and Descartes, he referred to Bachet de Meziriac’s Diophantus. And that was very misleading because Bachet’s translation is false. Viète exclusively dealt with conditional equations and universally valid equations (formulas, theorems), never, however, with relations of equality which describe a relation between two (or more) variables, which are not independent and, occasionally, describe a curve. So Fermat felt that he should give these new relations of equality a different name: adaequabitur. However, when Fermat starts applying his “method” and puts two different terms equal for the first time he prefers to write adaequentur, adaequabuntur, comparare per adaequalitatem, debet adaequare, or fiat per adaequalitatem, and, in his French correspondence, comparer par adéquation.
How did Fermat hit upon his method? Fermat explained that very explicitly in his treatise Methodus de maxima et minima (Œuvres , Vol. I, pp.147-153) which has been preserved as Mersenne’s copy. The following is a concise account in modern mathematical language, nevertheless trying to repeat Fermat’s idea as anadulterated as possible. Let be a term, for instance a polynomial term as like that in the first example. The maximum of this term is to be determined, for instance in the Interval . Now Fermat argues, picking up a remark of Pappus of Alexandria, that the place , where the maximum is attained, that is , is singular (plural: unicae et singulares). That means that the value is not attained at another place of the interval again. Because, if one chooses a value somewhat smaller than then there are two different places in the interval, say and with , for which holds
If one again chooses a constant with , so to this constant will belong and of that kind, that
and and That means that the nearer comes to the closer will be to . Only for the place , where attains its maximum, there exists no other place in the interval where this value is again attained, which corresponds to
This gave Fermat the idea of the following method for computing He puts
and substracting common terms on both sides of this relation of equality all remaining terms contain (or a power of it) as factor. Tacitly assuming he cancels the highest common factor of (that may well be if contains a square root) . In the remaining equation Fermat then puts (equals nihil), which gives him a conditional equation for .
Did Fermat prove the validity of his “method”? If this question means: “gave Fermat a proof which fulfills the standard of modern analysis?” then the answer is – of course – no. Fermat did not possess an idea of the continuum which is familiar to us since Cantor, Dedekind, and Weierstraß. His concept of number could possibly be described by naïve constructivism: if one possesses, for a given quantity, an algorithm which allows him to approximate this quantity arbitrarily precisely by rational numbers then this quantity responds to a number. Fermat had no concept of function, of continuity, of convergence, and of limit. He never considers the local gradient of a curve. (When Fermat determines a tangent he always computes, as a second point of the tangent, the point of intersection of the tangent with an obviously chosen reference line, most often the axis of symmetry of the curve.) Fermat had explained how he hit upon the idea of his “method”. And this method always produced the correct solution. He had, admittedly, some trouble with his attempt to explain to his contemporaries (especially Descartes) his proceeding to compute tangents. However, this had another reason.
Is it possible to prove the validity of Fermat’s “method” by means of modern analysis? (This section is intended to the mathematically inclined reader. A one-year course in calculus will be sufficient. We need, in particular, Ulisse Dini’s implicit function theorem for two variables.) The reader should be sure about the circumstance that every attempt to justify Fermat’s method with the aid of modern real analysis must inevitably be anachronistic. However, one can try to reduce this to a minimum. We will not at all alter Fermat’s equations. We turn our attention to the last three equations in Fermat’s proof. These have always the same structure. We will explain our consideration by taking the example of Fermat’s second problem above. There the third from last equation reads
Then, tacitly assuming , Fermat cancels the factor :
Thus the resulting equation is not valid when . But then Fermat deletes all terms, which still contain the factor , that means that he puts , and the result is the conditional equation
This is the critical point of Fermat’s “method”. The most obvious question is: Is it possible to replace Fermat’s putting by an argument?
The quadratic equation has the solutions and As one easily sees by elementary arguments: is a (relative) minimum of Well, we would like to show: In a neighbourhood of the variable is locally a continuous function with
Temporarily, we write the last relation before deleting in the form
- .
If we put , then we may write
- .
Now we apply the implicit function theorem. The function is a function of and with continuous derivatives and . We note down
We have and . These are the preconditions for the application of the implicit function theorem. According Dini there exists a and a continuous function defined on the interval with the properties
and
- .
This is, written in the initial form,
- .
If we, finally, let in this equation we arrive, taking into account the continuity of all involved functions, at
That is it, what we wanted to achieve. And it justifies Fermat’s “method” in this special case. However, as the reader may check, this kind of using Dini’s theorem works in all cases of determining maxima and minima and of computing tangents which Fermat brings through to the end.
- ^ Wieleitner, H.:Bemerkungen zu Fermats Methode der Aufsuchung von Extremwerten und der Berechnung von Kurventangenten. Jahresbericht der Deutschen Mathematiker-Vereinigung 38 (1929)24-35, p.25
- ^ Miller, M.: Pierre de Fermats Abhandlungen über Maxima und Minima. Akademische Verlagsgesellschaft, Leipzig (1934), p.1
- ^ Itard, I: Fermat précurseur du calcul différentiel. Arch Int. Hist. Sci. 27 (1948), 589-610, p.597
- ^ Hofmann, J.E.: Über ein Extremwertproblem des Apollonius und seine Behandlung bei Fermat. Nova Acta Leopoldina (2) 27 (167) (1963), 105-113, p.107
- ^ Strømholm, P.: Fermat's method of maxima and minima and of tangents. A reconstruction. Arch. Hist Exact Sci. 5 (1968), 47-69, p.51
- ^ Jensen, C.: Pierre Fermat's method of determining tangents and its application to the conchoid and the quadratrix. Centaurus 14 (1969), 72-85, p.73
- ^ Mahoney, M.S.: Fermat, Pierre de. Dictionary of Scientific Biography, vol. IV, Charles Scribner's Sons, New York (1971), p.569.
- ^ Edwards, C.H., Jr.:The historical Development of the Calculus. Springer, New York 1979, p.122f
- ^ Anderson, K.: Techniques of the calculus 1630-1660. In: Grattan-Guinness, I. (ed): From the Calculus to Set Theory. An Introductory History. Duckworth, London 1980, 10-48, p.23
- ^ Breger, H.: The mysteries of adaequare: A vindication of Fermat. Arch. Hist. Exact Sci. 46 (1994), 193-219
- ^ Tannery,Paul (ed.): Diophantus Alexandrinus, Opera Omnia, Vol. I, Teubner, Stuttgart 1974
- ^ Mikhail G. Katz, David M. Schaps, Steven Shnider: Almost equal : Method of adequality from Diophantus to Fermat and beyond. Perspective on Science 21 (2013), no.3, 1-45
Etienne d’Espagnet
editThe comments on Etienne d’Espagnet are interesting. It would be good to have a source for this. Tkuvho (talk) 15:46, 4 March 2013 (UTC)
- Here I am referring to the following comment by User:Klaus Barner: D’Espagnet possessed a very valuable scientific library which he had inherited from his father, a friend of Viète. In this library Fermat studied the writings of the ancients and Viète’s work. Viète used most frequently the verb aequare and, on some rare occasions, the verb adaequare (8 times) and the noun adaequalitas (once), instead of Recorde’s equals sign. (François Viète, Opera Mathematica. Georg Olms, New York 1970, pp. 80, 134, 135, 137, 141, and 143). In all these cases the meaning of adaequare is to put equal. At that time Fermat had still not studied Diophantus. So he borrowed the verb adaequare and the noun adaequalitas from his admired “example” Viète. Later, in his correspondence with Mersenne and Descartes, he referred to Bachet de Meziriac’s Diophantus. This seems problematic for the following reason. The earliest version of Fermat's method that we have dates from around 1636, the so-called "methodus". In the methodus, Fermat already attributes the term "adequality" to Diophantus. Whatever version of the method that Fermat may have given to d'Espagnet in 1629 is not extant to the best of my knowledge. The assumption that such a text may have contained the term "adequality" seems to be pure speculation, as well as the claim that such a hypothetical use of the term may have originated from Vieta. Of course, I would be happy to see evidence to the contrary, but short of such evidence the passage above seems to be massaging the facts to fit the desired conclusions. Tkuvho (talk) 14:10, 7 March 2013 (UTC)
pseudoequality
editThe description "equating things that are different (the exact meaning is subject to controversy; see below)" is rather paradoxical not very informative, it seems to me. If "approximate equality" does not describe a majority of the opinions cited below, we should try to come up with a more descriptive term that does describe a majority of the opinions, instead of the paradoxical phrase (which is unsourced and so does not reflect any opinion given below). How about "approximate equality or pseudo-equality"? Alternatively, we could rely on Andre Weil's authority (which is beyond dispute) and simply use whichever term he used. Tkuvho (talk) 19:40, 9 March 2013 (UTC)
- I agree that, if he has written on "adequality" and Fermat's method, Weil must be cited in this article and probably in the lead. Unfortunately, I have not an easy access to Weil's work on the subject. On the other hand, I am not so proud of my formulation in the lead. However it is a tentative to solve an apparent contradiction: "Adequality" is a neologism that does not appear in any dictionary. Thus it requires some kind of definition in the lead. But the meaning of the word is controversial; in fact the controversy may be extended to usefulness of the neologism and to know if its denotes or not a mathematical topic. This makes a hard task to simultaneously introduce the context and follow WP:NPOV policy in the lead. I agree that my formulation is not satisfactory. Therefore I have wrote a new version of the lead that is, I hope, purely factual. I'll been bold. Feel free to modify it, if you find a better version.D.Lazard (talk) 15:36, 11 March 2013 (UTC)
- Indeed you made a helpful contribution to the article. I will try to look up what Weil actually wrote. If it is quotable I will replace the current formulation in the lede by his formulation, with a citation, if this is OK with other editors. Tkuvho (talk) 15:47, 11 March 2013 (UTC)
- Weil wrote as follows: "[Fermat] introduces the technical term adaequalitas, adaequare, etc., which he says he has borrowed from Diophantus. As Diophantus V.ll shows, it means an approximate equality, and this is indeed how Fermat explains the word in one of his later writings." Tkuvho (talk) 16:16, 11 March 2013 (UTC)
- Indeed you made a helpful contribution to the article. I will try to look up what Weil actually wrote. If it is quotable I will replace the current formulation in the lede by his formulation, with a citation, if this is OK with other editors. Tkuvho (talk) 15:47, 11 March 2013 (UTC)
- This appears in Weil, A., Book Review: The mathematical career of Pierre de Fermat. Bull. Amer. Math. Soc. 79 (1973), no. 6, 1138--1149. Tkuvho (talk) 16:22, 11 March 2013 (UTC)
- According to Weil, the meaning of the term derives from Diophantus's parisotes, rather than from the Latin term. If so, the focus on the latin term in the current introduction is misplaced. I suggest we cite Weil and skip the part about "controversy", which is not sufficeintly informative for the lede. Tkuvho (talk) 16:34, 11 March 2013 (UTC)
Fermat's allegedly inconsistent procedure
editThe page currently contains the following material concerning Barner's article on adequality:
- On page 36, Barner writes: "Why did Fermat continually repeat his inconsistent procedure for all his examples for the method of tangents? Why did he never mention the secant, with which he in fact operated? I do not know."
Editors are invited to comment on the utility of including this citation in the page. Tkuvho (talk) 06:57, 5 October 2014 (UTC)
I think Barner's secant idea is a misinterpretation leading to a straw-man argument, but we can't blame him because Fermat himself was not clear nor consistent. Fermat's methods worked, but he did not completely understand what he was doing, nor the underlying assumptions that made his method work. [comment by Ng F K] — Preceding unsigned comment added by 112.199.248.16 (talk) 22:40, 29 January 2015 (UTC)
- Hi Ng, thanks for your comment. Can you clarify what sources you are basing yourself on? Tkuvho (talk) 09:50, 30 January 2015 (UTC)
- I based it on a comment above from yourself(?). QUOTE The Math. Semesterber. piece suffers from serious weaknesses which make it less convincing. For example, the author's interpretation forces him to assume that Fermat made a systematic mistake in confusing secant and tangent lines in his presentation. Other interpretations do not suffer from such a weakness. Tkuvho (talk) 18:06, 20 February 2013 (UTC) UNQUOTE
Katz's article (reference #17) also describes the controversy well, although I disagree with some parts.
By the way, I am not a registered editor of Wikipedia but I came to the "adequality" article expecting a clean answer to what it meant. I didn't expect that this concept is under debate. I ended up trying to answer my own question on what "adequality" means or should mean (see my section on the link to Big O concept). What I wrote does not require Implicit Function Theorem nor Non-stardard Analysis. I don't think it is original research because I believe the ideas contained therein can be found in any analysis textbook that discusses the Big O notation. I just tried to reformulate them in terms of different degrees of equivalence classes, and to explain why the E 'vanishes' without being contradictory. I hope regular editors would take a look to see if it makes sense and give your feedback. Thanks. [Ng F K] — Preceding unsigned comment added by 220.255.1.172 (talk) 17:41, 30 January 2015 (UTC)
- That's a very long entry down there. Could you summarize your explanation please? A few lines if possible will be helpful. The WP:OR issue is not necessarily that serious because you could have your analysis published and then it could be cited here. Tkuvho (talk) 08:49, 1 February 2015 (UTC)
- The first few lines up to the line before "In mathematics, precision of meaning is important." summarises my idea. The Big O notation is not really necessary, but from what i understand, modern mathematicians and computer scientists express approximate equality by using '=' with some Big O type of notation that specifies the level of precision. The theory is (ought to be) well understood, however it is a kind of abuse of notation. Nevertheless, it presents no problem if the reader knows what it really means.
Even without the Big O notation, we can actually define different levels of "adequality" by different equivalence relations ~1 ~2 ~3 etc in increasing levels of precision which behave like the "equal" relation. In fact, "=" relation can be viewed as "~infinity", as it were, being the most precise. From this point of view, i believe the concept of "adequality" can be explained rigorously. We just need to specify the level of precision, like ~1 ~2 ~3 etc instead of ~, because I believe Fermat just conflated all these different levels of adequality treated them as the same.
In the later sections, I tried to explain the various issues and paradoxes caused by the confusion.
[Ng F K]
- Hi Ng, you are a moving target. Your edits at this page are dispersed amoung about a dozen IP addresses. Would you mind creating a wiki account so as to make it possible to view your edits more easily? This only takes a few seconds and if you don't like it you can always stop using it. Tkuvho (talk) 13:14, 1 February 2015 (UTC)
Link with Big-O Notation
edit[ modified after reading arXiv paper by Katz reference #17 ]
I believe the concept of adequality can be expressed precisely in modern terms using the Big O notation [1] Namely, that
f(x) ~ f(x + e)
means
f(x+e) = f(x) + O(e^n)
or
| f(x+e) - f(x) | ≤ k.e^n for some positive k
where n=2 for problem of tangents (linear approximation)
n is odd number ≥3 for problem of extrema (maxima & minima)
- In mathematics, precision of meaning is important. Fermat and his critics were not clear on how 'near' is near. Some of the criticisms were straw man arguments.
- Various interpretations mentioned in Katz can be shown to be related or equivalent. "Kinetic approach" is a metaphor but can be linkd to epsilon-delta [2] approach or non-standard [3] approach.
- It can be shown that functions adequately equal or "adequal" under O(e^n)
are reflexive f~f, symmetric (f~g ==> g~f) and transitive (f~g and g~h implies f~h) and therefore ~ is an equivalence relation on functions. Equivalence relations have properties similar to the exact "=" relation, which is also itself an equivalence relation. This explains why ~ can be manipulated algebraically in a way similar to =. In fact that is what some mathematical analysts and computer scientists do nowadays with the Big O.
- I think Fermat basically got the right idea, but he was not clear, nor did he explain it precisely. By being fudgey on "some power of e", he does not seem to appreciate that when terms are expanded to and the dominant power of e is odd, you can get an inflexion point. Maxima and minima can also occur if the dominant power of e is an even power ≥ 2
{{by Ng F K|29 January 2015|220.255.1.137}}
Some further thoughts to resolve the paradox of "equal" vs "pseudo-equal" If we define the relations:
adequal to 1st order i.e. f ~1 g means f - g = O(e^1) adequal to 2nd order i.e. f ~2 g means f - g = O(e^2)
...
adequal to nth order i.e. f ~n g means f - g = O(e^n)
we will notice that these pseudo-equal or approximation relations get more and more stringent as n increases. [The "error" terms get smaller with higher powers, and higher order means better precision.]
If m≥n, then f ~m g implies f ~n g (but converse may not be true)
The ultimate most stringent relation is the exact equal relation "=" which we may think of as ~infinity.
Examples
cos E ~3 1 - E^2/2 i.e. cos E = 1 - E^2/2 + O(E^3) x + E ~1 x i.e. x + E = x + O(E) x + E ~0 x i.e. x + E = x + O(1) x = x i.e. x = x + O(E^infinity, so to speak)
How to explain the "disappearance" of E without saying that E=0? We say that to the required order of approximation x + E is practically the same as x. I think Fermat's mistake was not specifying the order precision. His adequal ~ sometimes means ~2 and sometimes means ~3.
Working out Fermat's example, my way:- Let f(x) = bx - x^2
f(x+e) - f(x) = be - 2ex + e^2
This implies
f(x+e) - f(x) ~3 be - 2ex + e^2 [f(x+e) - f(x)]/e ~2 b - 2x + e
Set LHS to zero to find maximum
0 ~2 b - 2x + e
We "downgrade" the relationship to get
0 ~1 b - 2x x ~1 b/2
However, e can be made arbitrarily small (as small as we want). If x not= b/2, we can always choose e to be smaller than, say, half of their gap to contradict that. The only way to avoid contradiction is
x = b/2.
I hope this removes the paradox of e being a non-zero quantity that seems to be equated to zero. What Fermat means by "adequate" ~ is a conflation of the ~1, ~2, ~3, ... relationships, which seems to be the source of confusion. Also Fermat did not explain the "arbitrariness" of e, which may be obvious to him, but not to his readers.
[comment by Ng F K]
On the purely algebraic nature of the manipulations
Why can we manipulate adequalities algebraically just like equations?
Once the equivalence relations have been established, we can carry over the algebraic operations to the equivalence classes, the way most things are done in Number Theory, Ring Theory, ... etc. we just need to check that the new algebraic operations are "well-defined" in the sense that if we chose different representatives to the classes and carried out the operations, we still ended up with the same result. Later on, we can 'abuse notation' (as some mathematicians like to do) by replacing equivalence classes by their representatives. For example: from the relation
0 ~2 b - 2x + e
we can write
[0] = [b] - 2[x] + [e]
where [ ] denote equivalence classes under ~2 or up to order O(e^2). Now we abuse notation and write
0 = b - 2x + e
This is how we can make and adequality relation ~2 behave 'as if' it is an equality relation. Was this what Fermat had in mind? Not sure, but probably not. But this is how I would 'repair' what he wrote, and explain the equal vs unequal paradox and to explain why we can just do pure algebraic manipulations without worrying about the element of approximation (it is already taken care of by the equivalence classes).
The 'Formal' nature of E
Why does E appear to be a pseduo-variable? I believe this comes from the Principle of Comparison of Coefficients.
For example: If a + b.E + c.E^2 = p + q.E + r.E^2, then a = p, b = q and c = r.
Why? Field theory [4] says that a polynomial of degree n has at most n real roots.
(a-p) + (b-q).E + (c-r).E^2 = 0
is identically true for an infinite number of values. This is impossible, unless a-p = 0, b - q = 0 and c - r = 0. The upshot of this is, when we see an expression like a + b.E + c.E^2, the coefficients a, b and c stand out and give a vector space-like or ring structure [5]. Because of this a + b.E + c.E^2 |---> (a,b,c) isomorphism , the role of E seems to be merely formal.
Why doesn't the size of E seem to matter One can use non-standard analysis idea of infinitesimal. However, we can also explain it using epsilon-delta type of analysis.
Remember f ~ g means | f - g | ≤ k.E^n for all E>0 (*)
It is not mentioned here that E is meant to be small. However, if you think a bit further: Let's say you have some bigger number E' > E. Then this would imply that
| f - g | ≤ k.E'^n
If (*) can be satisfied by some E, it can definitely be satisfied by some bigger number. This is like if you can score a goal in soccer using narrow goal posts, you can definitely score with wider goal posts. The question is: can you score even if the the goal posts get narrower and narrower. This is implied in the universal quantifier "all". If (*) can be satisfied by _all_ E, it can be satisfied even with the small Es. — Preceding unsigned comment added by 112.199.248.16 (talk) 23:19, 29 January 2015 (UTC)
- I am not sure I understand why you say that n has to be an odd number for the problem of maxima and minima. Isn't it n=2 also for this problem, similarly to the problem of tangency? Tkuvho (talk) 15:00, 2 February 2015 (UTC)
- Referring to the Taylor series [6], solving for the tangent is equivalent to expanding up to an error to the first power, and all the second and higher powers are brought under K.e^2 i.e. O(e^2) using the big O notation. Finding maxima and minima (in non-degenerate cases) is equivalent to expanding up to e^2 and leaving higher powers under K.e^3, and we set the 1st derivative equal to zero (same as setting the coefficient of the e^1 to zero), and then the + or - sign of the coefficient of e^2 (same as the sign of the second derivative) tells us whether it's a maximum or minimum. I guess we can all stick to n=2 always, but we have to infer the maximality or minimality from the context somehow. There is also this problem of degenerate cases e.g. (sin x)^1000 which Fermat probably didn't deal with — Preceding unsigned comment added by Mathducator (talk • contribs) 16:34, 10 February 2015 (UTC)
References