Functional Logic : Inquiry and Analogy
Author: Jon Awbrey
This report discusses C.S. Peirce's treatment of analogy, placing it in relation to his overall theory of inquiry. The first order of business is to introduce the three fundamental types of reasoning that Peirce adopted from classical logic. In Peirce's analysis both inquiry and analogy are complex programs of reasoning that develop through stages of these three types, although normally in different orders.
Note on notation. The discussion that follows uses minimal negations, expressed as bracketed tuples of the form and logical conjunctions, expressed as concatenated tuples of the form as the sole expression-forming operations of a calculus for boolean-valued functions or propositions. The expressions of this calculus parse into data structures whose underlying graphs are called cacti by graph theorists. Hence the name cactus language for this dialect of propositional calculus.
1. Three Types of Reasoning
1.1. Types of Reasoning in Aristotle
1.2. Types of Reasoning in C.S. Peirce
Here we present one of Peirce's earliest treatments of the three types of reasoning, from his Harvard Lectures of 1865 “On the Logic of Science”. It illustrates how one and the same proposition might be reached from three different directions, as the end result of an inference in each of the three modes.
|We have then three different kinds of inference:|
|Deduction or inference à priori,|
|Induction or inference à particularis,|
|Hypothesis or inference à posteriori.|
|(Peirce, CE 1, p. 267).|
|If I reason that certain conduct is wise because it has a character which belongs only to wise things, I reason à priori.|
|If I think it is wise because it once turned out to be wise, that is, if I infer that it is wise on this occasion because it was wise on that occasion, I reason inductively [à particularis].|
|But if I think it is wise because a wise man does it, I then make the pure hypothesis that he does it because he is wise, and I reason à posteriori.|
|(Peirce, CE 1, p. 180).|
Suppose we make the following assignments:
|B||=||“a certain character”,|
|C||=||“a certain conduct”,|
|D||=||“done by a wise man”,|
|E||=||“a certain occasion”.|
Recognizing that a little more concreteness will aid the understanding, let us make the following substitutions in Peirce's example:
|B||=||“Benevolence”,||a certain character,|
|C||=||“Contributes to Charity”,||a certain conduct,|
|E||=||“Earlier today”,||a certain occasion.|
The converging operation of all three reasonings is shown in Figure 2.
The common proposition that concludes each argument is AC, to wit, “contributing to charity is wise”.
Deduction could have obtained the Fact AC from the Rule AB, “benevolence is wisdom”, along with the Case BC, “contributing to charity is benevolent”.
Induction could have gathered the Rule AC, after a manner of saying that “contributing to charity is exemplary of wisdom”, from the Fact AE, “the act of earlier today is wise”, along with the Case CE, “the act of earlier today was an instance of contributing to charity”.
Abduction could have guessed the Case AC, in a style of expression stating that “contributing to charity is explained by wisdom”, from the Fact DC, “contributing to charity is done by this wise man”, and the Rule DA, “everything that is wise is done by this wise man”. Thus, a wise man, who happens to do all of the wise things that there are to do, may nevertheless contribute to charity for no good reason, and even be known to be charitable to a fault. But all of this notwithstanding, on seeing the wise man contribute to charity we may find it natural to conjecture, in effect, to consider it as a possibility worth examining further, that charity is indeed a mark of his wisdom, and not just the accidental trait or the immaterial peculiarity of his character — in essence, that wisdom is the reason he contributes to charity.
1.3. Comparison of the Analyses
1.4. Aristotle's “Apagogy” : Abductive Reasoning as Problem Reduction
Peirce's notion of abductive reasoning was derived from Aristotle's treatment of it in the Prior Analytics. Aristotle's discussion begins with an example that may appear incidental, but the question and its analysis are echoes of an important investigation that was pursued in one of Plato's Dialogues, the Meno. This inquiry is concerned with the possibility of knowledge and the relationship between knowledge and virtue, or between their objects, the true and the good. It is not just because it forms a recurring question in philosophy, but because it preserves a certain correspondence between its form and its content, that we shall find this example increasingly relevant to our study.
A couple of notes on the reading may be helpful. The Greek text seems to imply a geometric diagram, in which directed line segments AB, BC, AC are used to indicate logical relations between pairs of the terms in A, B, C. We have two options for reading these line labels, either as implications or as subsumptions, as in the following two paradigms for interpretation.
|Read as Implications:|
|AB||=||A ⇐ B,|
|BC||=||B ⇐ C,|
|AC||=||A ⇐ C.|
|Read as Subsumptions:|
|AB||=||A subsumes B,|
|BC||=||B subsumes C,|
|AC||=||A subsumes C.|
Here, “X subsumes Y” means that “X applies to all Y”, or that “X is predicated of all Y”. When there is no danger of confusion we may write this as “X ≥ Y”.
We have Reduction (απαγωγη, abduction): (1) when it is obvious that the first term applies to the middle, but that the middle applies to the last term is not obvious, yet nevertheless is more probable or not less probable than the conclusion; or (2) if there are not many intermediate terms between the last and the middle; for in all such cases the effect is to bring us nearer to knowledge.
(1) E.g., let A stand for “that which can be taught”, B for “knowledge”, and C for “morality”. Then that knowledge can be taught is evident; but whether virtue is knowledge is not clear. Then if BC is not less probable or is more probable than AC, we have reduction; for we are nearer to knowledge for having introduced an additional term, whereas before we had no knowledge that AC is true.
(2) Or again we have reduction if there are not many intermediate terms between B and C; for in this case too we are brought nearer to knowledge. E.g., suppose that D is “to square”, E “rectilinear figure”, and F “circle”. Assuming that between E and F there is only one intermediate term — that the circle becomes equal to a rectilinear figure by means of lunules — we should approximate to knowledge.
|(Aristotle, “Prior Analytics” 2.25)|
The method of abductive reasoning bears a close relation to the sense of reduction in which we speak of one question reducing to another. The question being asked is “Can virtue be taught?” The type of answer which develops is the following.
If virtue is a form of understanding, and if we are willing to grant that understanding can be taught, then virtue can be taught. In this way of approaching the problem, by detour and indirection, the form of abductive reasoning is used to shift the attack from the original question, whether virtue can be taught, to the hopefully easier question, whether virtue is a form of understanding.
The logical structure of the process of hypothesis formation in the first example follows the pattern of “abduction to a case”, whose abstract form is diagrammed and schematized in Figure 5.
o-------------------------------------------------o | | | T = Teachable | | o | | ^^ | | | \ | | | \ | | | \ | | | \ | | | \ R U L E | | | \ | | | \ | | F | \ | | | \ | | A | \ | | | o U = Understanding | | C | ^ | | | / | | T | / | | | / | | | / | | | / C A S E | | | / | | | / | | | / | | | / | | |/ | | o | | V = Virtue | | | | T = Teachable (didacton) | | U = Understanding (epistemé) | | V = Virtue (areté) | | | | T is the Major term | | U is the Middle term | | V is the Minor term | | | | TV = "T of V" = Fact in Question | | TU = "T of U" = Rule in Evidence | | UV = "U of V" = Case in Question | | | | Schema for Abduction to a Case: | | | | Fact: V => T? | | Rule: U => T. | | ---------------- | | Case: V => U? | o-------------------------------------------------o Figure 5. Teachability, Understanding, Virtue
1.5. Aristotle's “Paradigm” : Reasoning by Analogy or Example
Here we present Aristotle's treatment of analogical inference or “reasoning by example”. The Greek word for this is παραδειγμα, from which we derive the English word “paradigm”, and it suggests a kind of “side-show”, or a parallel comparison of cases.
We have an Example (παραδειγμα, or analogy) when the major extreme is shown to be applicable to the middle term by means of a term similar to the third. It must be known both that the middle applies to the third term and that the first applies to the term similar to the third.
E.g., let A be “bad”, B “to make war on neighbors”, C “Athens against Thebes”, and D “Thebes against Phocis”. Then if we require to prove that war against Thebes is bad, we must be satisfied that war against neighbors is bad. Evidence of this can be drawn from similar examples, e.g., that war by Thebes against Phocis is bad. Then since war against neighbors is bad, and war against Thebes is against neighbors, it is evident that war against Thebes is bad.
|(Aristotle, “Prior Analytics” 2.24)|
1.6. Peirce's Formulation of Analogy
Note. A few changes in Peirce's notation have been made to facilitate comparison between the two versions.
Version 1. “On the Natural Classification of Arguments” (1867)
The formula of analogy is as follows: are taken at random from such a class that their characters at random are such as
Such an argument is double. It combines the two following:
Owing to its double character, analogy is very strong with only a moderate number of instances.
|(Peirce, CP 2.513; CE 2, 46–47)|
The form of this analysis is illustrated in Figure 7.
Version 2. “A Theory of Probable Inference” (1883)
The formula of the analogical inference presents, therefore, three premisses, thus: are a random sample of some undefined class, of whose characters are samples,
We have evidently here an induction and an hypothesis followed by a deduction; thus:
|(Peirce, CP 2.733)|
The form of this analysis is illustrated in Figure 8.
1.7. Dewey's “Sign of Rain” : An Example of Inquiry
To illustrate the place of the sign relation in inquiry we begin with Dewey's elegant and simple example of reflective thinking in everyday life.
A man is walking on a warm day. The sky was clear the last time he observed it; but presently he notes, while occupied primarily with other things, that the air is cooler. It occurs to him that it is probably going to rain; looking up, he sees a dark cloud between him and the sun, and he then quickens his steps. What, if anything, in such a situation can be called thought? Neither the act of walking nor the noting of the cold is a thought. Walking is one direction of activity; looking and noting are other modes of activity. The likelihood that it will rain is, however, something suggested. The pedestrian feels the cold; he thinks of clouds and a coming shower.
|(Dewey 1991, 6–7)|
In this narrative we can identify the characters of the sign relation as follows: coolness is a Sign of the Object rain, and the Interpretant is the thought of the rain's likelihood. In his 1910 description of reflective thinking Dewey distinguishes two phases, “a state of perplexity, hesitation, doubt” and “an act of search or investigation” (Dewey 1991, 9), comprehensive stages which are further refined in his later model of inquiry. In this example, reflection is the act of the interpreter which establishes a fund of connections between the sensory shock of coolness and the objective danger of rain, by way of his impression that rain is likely. But reflection is more than irresponsible speculation. In reflection the interpreter acts to charge or defuse the thought of rain (the probability of rain in thought) by seeking other signs which this thought implies and evaluating the thought according to the results of this search.
Figure 9 illustrates Dewey's “Sign of Rain” example, tracing the structure and function of the sign relation as it informs the activity of inquiry, including both the movements of surprise explanation and intentional action. The dyadic faces of the sign relation are labeled with just a few of the loosest terms that apply, indicating the “significance” of signs for eventual occurrences and the “correspondence&rdqu; of ideas with external orientations. Nothing essential is meant by these dyadic role distinctions, since it is only in special or degenerate cases that their shadowy projections can maintain enough information to determine the original sign relation.
If we follow this example far enough to consider the import of thought for action, we realize that the subsequent conduct of the interpreter, progressing up through the natural conclusion of the episode — the quickening steps, seeking shelter in time to escape the rain — all of these acts form a series of further interpretants, contingent on the active causes of the individual, for the originally recognized signs of rain and for the first impressions of the actual case. Just as critical reflection develops the associated and alternative signs which gather about an idea, pragmatic interpretation explores the consequential and contrasting actions which give effective and testable meaning to a person's belief in it.
Figure 10 charts the progress of inquiry in this example according to the three stages of reasoning identified by Peirce.
- Abduction. The first, faltering step into the cycle of inquiry is taken through the flexion of abductive reasoning. The fact C ⇒ A, the coolness of the air in the pedestrian's current situation, brings into play from his worldly experience (or from other kinds of background knowledge) the rule B ⇒ A, that a chill in the air is a feature of situations that betoken rain. This fact and this rule, working in tandem, precipitate a plausible explanation for the observed phenomena. The hiker abduces the case C ⇒ B, that bodes for rain in the current situation.
- Deduction. …
- Induction. …
In this analysis of the first steps of Inquiry, we have a complex or a mixed form of inference that can be seen as taking place in two steps:
1. The first step is an Abduction that abstracts a Case from the consideration of a Fact and a Rule.
|Fact:||C ⇒ A,||In the Current situation the Air is cool.|
|Rule:||B ⇒ A,||Just Before it rains, the Air is cool.|
|Case:||C ⇒ B,||The Current situation is just Before it rains.|
2. The final step is a Deduction that admits this Case to another Rule and so arrives at a novel Fact.
|Case:||C ⇒ B,||The Current situation is just Before it rains.|
|Rule:||B ⇒ D,||Just Before it rains, a Dark cloud will appear.|
|Fact:||C ⇒ D,||In the Current situation, a Dark cloud will appear.|
This is nowhere near a complete analysis of the Rainy Day inquiry, even insofar as it might be carried out within the constraints of the syllogistic framework, and it covers only the first two steps of the relevant inquiry process, but maybe it will do for a start.
2. Functional Conception of Quantification Theory
Up till now quantification theory has been based on the assumption of individual variables ranging over universal collections of perfectly determinate elements. Merely to write down quantified formulas like and involves a subscription to such notions, as shown by the membership relations invoked in their indices. Reflected on pragmatic and constructive principles, however, these ideas begin to appear as problematic hypotheses whose warrants are not beyond question, projects of exhaustive determination that overreach the powers of finite information and control to manage. Therefore, it is worth considering how we might shift the scene of quantification theory closer to familiar ground, toward the predicates themselves that represent our continuing acquaintance with phenomena.
Higher Order Propositional Expressions
By way of equipping this inquiry with a bit of concrete material, I begin with a consideration of higher order propositional expressions, in particular, those that stem from the propositions on 1 and 2 variables.
Higher Order Propositions and Logical Operators (n = 1)
A higher order proposition is, very roughly speaking, a proposition about propositions. If the original order of propositions is a class of indicator functions then the next higher order of propositions consists of maps of the type
For example, consider the case where Then there are exactly four propositions and exactly sixteen higher order propositions that are based on this set, all bearing the type
Table 11 lists the sixteen higher order propositions about propositions on one boolean variable, organized in the following fashion: Columns 1 and 2 form a truth table for the four turned on its side from the way that one is most likely accustomed to see truth tables, with the row leaders in Column 1 displaying the names of the functions for = 1 to 4, while the entries in Column 2 give the values of each function for the argument values that are listed in the corresponding column head. Column 3 displays one of the more usual expressions for the proposition in question. The last sixteen columns are topped by a collection of conventional names for the higher order propositions, also known as the measures for = 0 to 15, where the entries in the body of the Table record the values that each assigns to each
I am going to put off explaining Table 12, that presents a sample of what I call interpretive categories for higher order propositions, until after we get beyond the 1-dimensional case, since these lower dimensional cases tend to be a bit condensed or degenerate in their structures, and a lot of what is going on here will almost automatically become clearer as soon as we get even two logical variables into the mix.
|Just false||Nothing exists|
|Everything is||is linear|
|is not uniform||is informed|
|Not just true|
|is uniform||is not informed|
|Something is not||is not linear|
|Not just not|
|Not just false||Something exists|
Higher Order Propositions and Logical Operators (n = 2)
By way of reviewing notation and preparing to extend it to higher order universes of discourse, let us first consider the universe of discourse based on two logical features or boolean variables and
The universe of discourse consists of two parts, a set of points and a set of propositions.
The points of form the space:
Each point in may be indicated by means of a singular proposition, that is, a proposition that describes it uniquely. This form of representation leads to the following enumeration of points:
Each point in may also be described by means of its coordinates, that is, by the ordered pair of values in that the coordinate propositions and take on that point. This form of representation leads to the following enumeration of points:
The propositions of form the space:
As always, it is frequently convenient to omit a few of the finer markings of distinctions among isomorphic structures, so long as one is aware of their presence and knows when it is crucial to call upon them again.
The next higher order universe of discourse that is built on is which may be developed in the following way. The propositions of become the points of and the mappings of the type become the propositions of In addition, it is convenient to equip the discussion with a selected set of higher order operators on propositions, all of which have the form
To save a few words in the remainder of this discussion, I will use the terms measure and qualifier to refer to all types of higher order propositions and operators. To describe the present setting in picturesque terms, the propositions of may be regarded as a gallery of sixteen venn diagrams, while the measures are analogous to a body of judges or a panel of critical viewers, each of whom evaluates each of the pictures as a whole and reports the ones that find favor or not. In this way, each judge partitions the gallery of pictures into two aesthetic portions, the pictures that likes and the pictures that dislikes.
There are measures of the type Table 13 introduces the first 24 of these measures in the fashion of the higher order truth table that I used before. The column headed shows the values of the measure on each of the propositions for = 0 to 23, with blank entries in the Table being optional for values of zero. The arrangement of measures that continues according to the plan indicated here is referred to as the standard ordering of these measures. In this scheme of things, the index of the measure is the decimal equivalent of the bit string that is associated with 's functional values, which can be obtained in turn by reading the column of binary digits in the Table as the corresponding range of boolean values, taking them up in the order from bottom to top.
We now examine measures at the high end of the standard ordering. Instrumental to this purpose we define a couple of higher order operators, and both symbolized by cursive upsilon characters and referred to as the absolute and relative umpire operators, respectively. If either one of these operators is defined in terms of more primitive notions then the remaining operator can be defined in terms of the one first established.
Given an ordered pair of propositions as arguments, the relative operator reports the value if the first implies the second, otherwise
To express it another way:
In writing this, however, it is important to notice that the appearing on the left side and the appearing on the right side of the logical equivalence have different meanings. Filling in the details, we have:
Writing types as subscripts and using the fact that it is possible to express this a little more succinctly as follows:
Finally, it is often convenient to write the first argument as a subscript, hence
As a special application of this operator, we next define the absolute umpire operator, also called the umpire measure. This is a higher order proposition which is given by the relation Here, the subscript on the left and the argument on the right both refer to the constant proposition In most contexts where is actually applied the subscript is safely omitted, since the number of arguments indicates which type of operator is intended. Thus, we have the following identities and equivalents:
The umpire measure is defined at the level of truth functions, but can also be understood in terms of its implied judgments at the syntactic level. Interpreted this way, recognizes theorems of the propositional calculus over giving a score of to tautologies and a score of to everything else, regarding all contingent statements as no better than falsehoods.
One remark in passing for those who might prefer an alternative definition. If we had originally taken to mean the absolute measure, then the relative version could have been defined as
Measure for Measure
Define two families of measures:
by means of the following formulas:
The values of the sixteen on each of the sixteen boolean functions are shown in Table 14. Expressed in terms of the implication ordering on the sixteen functions, says that is above or identical to in the implication lattice, that is, in the implication ordering.
The values of the sixteen on each of the sixteen boolean functions are shown in Table 15. Expressed in terms of the implication ordering on the sixteen functions, says that is below or identical to in the implication lattice, that is, in the implication ordering.
Applied to a given proposition the qualifiers and tell whether rests or respectively, in the implication ordering. By way of example, let us trace the effects of several such measures, namely, those that occupy the limiting positions of the Tables.
Thus, is a totally indiscriminate measure, one that accepts all propositions whereas and are measures that value the constant propositions and respectively, above all others.
Finally, in conformity with the use of the fiber notation to indicate sets of models, it is natural to use notations like:
to denote sets of propositions that satisfy the umpires in question.
Extending the Existential Interpretation to Quantificational Logic
Previously I introduced a calculus for propositional logic, fixing its meaning according to what C.S. Peirce called the existential interpretation. As far as it concerns propositional calculus this interpretation settles the meanings that are associated with merely the most basic symbols and logical connectives. Now we must extend and refine the existential interpretation to comprehend the analysis of quantifications, that is, quantified propositions. In doing so we recognize two additional aspects of logic that need to be developed, over and above the material of propositional logic. At the formal extreme there is the aspect of higher order functional types, into which we have already ventured a little above. At the level of the fundamental content of the available propositions we have to introduce a different interpretation for what we may call elemental or singular propositions.
Let us return to the 2-dimensional case In order to provide a bridge between propositions and quantifications it serves to define a set of qualifiers that have the following characters:
Intuitively, the operators may be thought of as qualifying propositions according to the elements of the universe of discourse that each proposition positively values. Taken together, these measures provide us with the means to express many useful observations about the propositions in and so they mediate a subtext that takes place within the higher order universe of discourse Figure 16 summarizes the action of the operators on the within
Application of Higher Order Propositions to Quantification Theory
Our excursion into the vastening landscape of higher order propositions has finally come round to the stage where we can bring its returns to bear on opening up new perspectives for quantificational logic.
There is a question arising next that is still experimental in my mind. Whether it makes much difference from a purely formal point of view is not a question I can answer yet, but it does seem to aid the intuition to invent a slightly different interpretation for the two-valued space that we use as the target of our basic indicator functions. Therefore, let us declare a type of existential-valued functions where is a couple of values that we interpret as indicating whether of not anything exists in the cells of the underlying universe of discourse, venn diagram, or other domain. As usual, let us not be too strict about the coding of these functions, reverting to binary codes whenever the interpretation is clear enough.
With this interpretation in mind we note the following correspondences between classical quantifications and higher order indicator functions:
The following Tables develop these ideas in more detail.
Appendix : Generalized Umpire Operators
In order to get a handle on the space of higher order propositions and eventually to carry out a functional approach to quantification theory, it serves to construct some specialized tools. Specifically, I define a higher order operator called the umpire operator, which takes up to three propositions as arguments and returns a single truth value as the result. Formally, this so-called multi-grade property of can be expressed as a union of function types, in the following manner:
In contexts of application the intended sense can be discerned by the number of arguments that actually appear in the argument list. Often, the first and last arguments appear as indices, the one in the middle being treated as the main argument while the other two arguments serve to modify the sense of the operation in question. Thus, we have the following forms:
The intention of this operator is that we evaluate the proposition on each model of the proposition and combine the results according to the method indicated by the connective parameter In principle, the index might specify any connective on as many as arguments, but usually we have in mind a much simpler form of combination, most often either collective products or collective sums. By convention, each of the accessory indices is assigned a default value that is understood to be in force when the corresponding argument place is left blank, specifically, the constant proposition for the lower index and the continued conjunction or continued product operation for the upper index Taking the upper default value gives license to the following readings:
This means that if and only if holds for all models of In propositional terms, this is tantamount to the assertion that or that
Throwing in the lower default value permits the following abbreviations:
This means that if and only if holds for the whole universe of discourse in question, that is, if and only is the constantly true proposition The ambiguities of this usage are not a problem so long as we distinguish the context of definition from the context of application and restrict all shorthand notations to the latter.
- Aristotle, “Prior Analytics”, Hugh Tredennick (trans.), in Aristotle, Volume 1, Loeb Classical Library, William Heinemann, London, UK, 1938.
Inquiry and Analogy
|Author:||Jon Awbrey||November 1, 1995|
|Course:||Engineering 690, Graduate Project||Winter Term, January 1995|
|Supervisors:||F. Mili & M.A. Zohdy||Oakland University|
| Version: Draft 3.25 | Created: 01 Jan 1995 | Relayed: 01 Nov 1995 | Revised: 24 Dec 2001 | Revised: 12 Mar 2004
Inquiry Driven Systems