DRDODDS Web Page in M/X

Fuzzy and Crisp Sets, Usuality


In most everyday kinds of thinking, and linguistic reflections of that, people use crisp sets to categorize things. Being a member of a crisp set is an all or nothing affair. A woman is pregnant or she isn't, she's never "partly pregnant" or " a little pregnant".

Thinking with crisp sets makes everything simpler, because something either is or is not a member of a crisp set. They can be used to represent black and white conceptual thinking. Oftentimes too, when something is a member of a given crisp set it is then (at the same time) not a member of any other crisp set. Again this simplifies the logic used with this kind of thinking process. Linguistic constructions which reflect this kind of thinking can be quite useful. Especially when crisp categories are used.

Fuzziness, or fuzzy sets are used frequently also by most people in everyday thinking and their linguistic constructs reflect them as well.

In a society (like that of North America) where science and technology are raised nearly to the level of a religion, it is an implicit heinous crime to use terms like "fuzzy" because it seems to run counter to the cherished principle of exactitude with which engineering has made such great strides in the last two or three centuries.

In this page the term fuzzy is not used pejoratively, rather, it is used as the "father of fuzzy sets" (Lotfi Zadeh) himself has described in many technical papers and presentations, as have a multitude of authors since his initial development of the domain.

You wont have to be a rocket scientist to follow the math involved (there has to be math and algorithms and such, we are dealing with computers after all). While entire books filled with esoteric looking mathematics have been written about the topic of fuzzy sets we will simply refer you to them, rather than subject you to the lashings they present to the more casual reader.



Fuzzy sets are very relevant to ordinary English speakers

Perhaps a collection of examples of fuzzy sets as they appear in everyday language would be helpful.

Just as we dont have to be able to write down the grammar of English to speak it we also do not have to be mathematicians to use fuzzy terms in our everyday English. In fact almost no one, who speaks English fairly fluently, is able to write down the entire grammar of the English language, or even one fully describing his own subset of it.

Birds manage to fly without 4 years of university aeronautics training.

Before we launch into how to convey fuzzy set usage in natural linguistics to the computer, let's look at a few linguistic examples of fuzzy sets. The casual reader will be able to relate to these more easily than the math.


These were some examples of relatively everyday linguistic expressions which couched non-crisp set membership concepts into easy to understand forms.

Let us now examine the fuzziness contained in each sentence in turn.

The examination will be conducted in both a linguistic and a mathematical manner.

Keep in mind that categorization is a very important functionality of human thinking and language usage, and that an awareness of Rosch' Theory of Prototypes stands one in good stead for the appreciation of what cognitive underpinnings we humans draw upon in that thinking and natural language use.



In the case of the first statement

In the case of the first statement

  • "How close is it to dinnertime, I'm hungry?"
  • we see that the word "close" is the fuzzy component. Close is a member of the category "spatiality".

    Now spatiality is nothing more than a collection of linguistic representations which refer to the various concepts to do with existential (or virtual) three dimensionality.

    Three dimensionality refers to volumes, rather than planes (surfaces) or lines. As a conscious adult part of our perception of ourselves is that "I" am constituted by a continuous surface which is the outside of an enclosed volume. I call this volume "me". Everything on one side of this two dimensional continuous surface is "me" ( usually something "fleshy, boney, or sinuey"), everything on the other side of this "me" surface is "not me". There is a subjective sense of "space" largely dependent on how we perceive our"height" vis-a-vis our surroundings, and also the length of our limbs, our arms and our legs.

    Objects, and people, which (who) reach up to "my waist" are "short" or "small", while objects which tower over my head are "tall" or "big". Objects which are within immediate reach of my outstretched arm are "close" (to me), and objects which are progressively multiple arm-lengths distant from "me" are "far away". An object which is but a few steps with my feet away from where "I" am standing is "close" while an object which is many steps is "far".

    This subjectivity is called situated, and is an important concept for you to remember. Natural language usage is from/between situated, conscious sources, not unconnected ethereal capabilities such as standard computer programs.

    Notice how wonderfully well this measurement system is universal for all people. It doesnt matter if the absolute length of my arm is x centimetres and that of yours is y centimetres because these linguistic spatial terms are all relative or subjective (self-referential in a way).

    Thus "near" to a baby means the same thing to him as "near" does to you, even though your arm is likely much longer than his.

    Once a language-using system (whether a person or a computer) has been able to create the "thinking" facility called projection, the person or system is able to make use of a very powerful tool. Projection, sometimes called "extension", is the "mental" capability whereby the thinker creates, in his mind's eye, a small "theatre in the round" where he mentally maps his perceptions of the world into a time varying subjective three dimensional space.

    [In some (admittedly crude) ways computers are already able to compute the translation of sensor data, such as a tv camera image, into an eidetic image within the confines of a three dimensional coordinate matrix. There are industrial robots,equipped with a tv camera, that are able to locate simple objects on a simple surface solely by "vision" through the camera. They have been programmed such that they can examine the picture with a computer program and extract some information from it, things like the edges of the objects in the picture. The program is able to "map" these edges into a three dimensional co-ordinate space in a mathematical "volume" inside the computer (in it's memory). It is able to locate other objects there, according to their visual edges and the various angles and rotations of the camera. All this information is put together to allow the system to "place" "seen" objects into the "visualization space". By examining the visualization space, rather than laboriously rescanning half the real world with its camera, the robot system is able to use the visualization space for planning actions, in particular, seeing where various objects "are" viv-a-vis their mapping.]

    In humans, as we progress from infancy to toddler, we develop the mental facility to use our "situated" or personally subjective "me" space to relate to a bigger world. In the "theatre of the round" of our mind's eye we place a "me", but we knowingly map some real world thing into the place where "me" is. Now we can use all our skills of relating to (spatialized) "me" on the rest of the universe! Metaphor. Analogy.

    Further down in this section we'll look at the (simple) mathematics of these linguistic terms.



    In the case of the second statement


    In the case of the second statement

  • "We told them that we'd meet them around 9 o'clock."
  • In this sentence "around" is the fuzzy component.Around is also a member of the category "spatiality". It is in fact related to "close" that we saw in the first sentence. It shares the feature of relating a particular subset of "all spatiality" (such as the English word "everywhere"), with "close" in the first sentence. What is different about "around", from "close", is that "around" is used to signify a different particular subset than "close" does. The particular subset that "around" implies is that of proximity but in (possibly) two or three dimensional axes. "Around" carries with its meaning the feature of centrality coupled with proximity. "Close" is intended to denote proximity but there is no intention to convey any sense of dimensionality, whether it be 2d or 3d. "Around" is used with the intention of suggesting a "set" which might be represented visually by a volume (or a plane) within which at some location is a point, which is used as the metaphorical "center" or "zero axis" for representing the centrality.

    As a simple first approximation, one might use a spheroid (sphere) to represent the edge or border of the set (for "around"). This notion actually corresponds well with how most people describe their use of "around". Well, this simple circle/sphere represents "inside" and "outside" the set. Or, "contained in" or not "contained in", the set.This is like the use of the Venn diagram.

    Well, of course, few people have such a black and white concept of "around" as this. First order predicate logic loves it, and that's why fopl does NOT adequately describe how people USE natural language.

    We natural language users have a more sophisticated usage of "around". Let's add some sophistication to our previous model. Let us say that the notion of around can be visually represented by some sphere, centered on our previous point of centrality AND which has a gradient, running from the point to the outer shell. Perhaps if you think of a series of concentric spheres radiating out from the center point. Each sphere is infinitely thin and is coloured some colour of grey. Let's say that the center point itself is the brightest possible grey (which some folks call "white"). Each successive infinitely thin sphere away from the center is slightly darker than the one it contains. At the place where the shell exists, the outer part of our original sphere, is darkest in (grey) colour.

    Now we see that by refering to a particular level of grey (or "brightness") we can signify a distance from the center point. If it helps you visualize more easily, think of a ruler placed against the center point, with the zero location of the ruler at the center point.Now we can say that a distance from the center point is at so many inches or millimetres on the ruler. We thus have the possibility of saying that some spot is,say, 12.57 inches away from the center point. If the ruler is a yardstick then we see that 12 inches is 1/3 of the distance from the center point along the ruler towards the end of the ruler. For most of us 1/3 is "small", 1/2 is "even" and 2/3 is "large". Since we are talking about "around" we translate "small", and "large" into distance terms, and we get "near" and "far", respectively.



    In the case of the third statement


    In the case of the third statement

  • "Just before you start that let's see what almost happened."
  • In this statement we see "just, before" and "almost" as the fuzzy components. "Before" and "almost" are explained to the computer (mathematically) in a way which is very similar to "close" in statement one above. That's because these two terms, which are fuzzy, are used by natural language speakers to convey yet further spatial metaphors (just as "close" does). English usage is ripe with spatial metaphors because they relate our common or shared conceptualizations of space (the three dimensions x,y,z) and time. The conceptualizations of space and time are common to all societies who are language using. The details of their linguistic constructs of these shared concepts varies.

    The individuals in these societies are all (normally) equipped with arms, legs, heads and bodies pretty much like ours. Since their thinking and language usage is also situated they too relate to the universe in terms of the length of their arms, their height, the length of their stride and so on. It is not surprising that the language they invent and use reflects the same subjective realities, much as English does.

    "Just" may be explained to a computer in the following way (via spatial metaphor, using mathematical representation to convey the spatiality.) In the context of the third statement, "just" may be equitably represented by "delta", a mathematical construct, which means an unstated very small magnitude value. In English, some people might use the expression "a little bit" to similar advantage as "delta".

    The statement then translates as: (using the real number line visualization), that range of number values falling within the "width" of the delta value, which is itself located leftward from the foot of the function graphed by "before". "Before" is a graph (function) which maps the degree of membership of values along the abscissa (x-line). By definition, "before" 's "rightmost extent" occurs at the point (x value) on the real number line where centrality is located. (If the centrality point is at, say, x=500, then "before" is a range of x values less than 500, extending "leftward" (aka toward zero) to a value of x such that the membership function for "before" reaches zero. Then "just" is a small range of x values less than this location (where (the "before") membership is zero).



    In the case of the fourth statement


    In the case of the fourth statement

  • "When I was small he seemed nearly as tall as a giant."
  • This might be the recollection of someone seeing the relative, or subjective, height of his father, when he (the recollector) was a very young person. "Small" and "nearly as tall as" are both fuzzy components. That the recollector perceived himself/herself as "small" was relative to the size of his/her father and mother. But not the cat. Even babies are not "small" relative to (house) cats.

    The perception of smallness was based upon the "mentalism" (operation of the mind) which recognized that the general volume filled by himself/herself was clearly much less in magnitude than the volume filled by "dad" or "mom". The same argument goes for the kid's height versus the height of "dad" and "mom". We adults take for granted that there is some "device" (ie mind) which can apprehend that "people" "have" "volumes" (or "height"), and that some magnitudes are "less" than others. Perhaps at that age he/she perceived himself/herself to be "the same size as Ralph" (the family's red and white handkerchief wearing labrador dog).

    Likewise, a "giant" would be several to many times his/her height (at that age), and since "dad" was several times his/her height it wasnt too much of a stretch to perceive him as being only a few kid heights shorter than a giant, or "nearly(tall as(x))". This could be translated as "height not quite = x". Alternatively, "as tall as" means"height =", x.
    nearly(height = x) could be restated as height = x - delta. another restatement, or equivalency, is height = just (less than (x)). but notice that this last equivalency says something slightly different. This is the power of language, and sometimes its drawback.



    In the case of the fifth statement


    In the case of the fifth statement

  • "A big dog is smaller than a small house."
  • You must remember that computers do not have "experiences" like we have had. They have not experienced dogs or houses. A mindless parsing of the sentence (which is what most computers do after all) provides us with: indefinite count of big(x) = less than indefinite count of small(x).

    Isn't that meaningful to you? No? Well that is how computers "see" natural language. And natural language is just as unclear to them as that statement conveys. They have no "feel" for language because they are not "situated", they do not "experience". They are like ghost minds with no instrumentality to sense from. They are less than that because (in my opinion) they don't (yet) have minds. They are less organized than that. (in the sophistication/complexity of their processing).

    Now we'll return to something that is meaningful in regular English. The key to comprehending the fifth statement is that the receiver of the statement "knows" about dogs, and houses. That knowledge typically is like the following. For dogs, we know their various shapes and sizes, their colours, the fur texture and length (approximately of course), their vocalizations, and their favourite fire hydrants. For houses we know their various shapes and sizes, their colours, their material components (brick, wood, plastic, etc) and many other "typical descriptive details".

    So! Since "everybody knows .." then a non-brain damaged speaker of english knows that any size dog is always smaller than any size house.(We're talking human-habitation houses; not dog, doll, or model train houses.) This is what some people call common-sense. It is only commonsense if you have a normal brain, didnt grow up in a cave, or a society that doesnt have houses. (or dogs).

    Can you see that the typical computer is such a disadvantaged "person"? The mathematical section next up will show you how the computer can be provided with an appreciation similar to what you have just read that we humans have, and take for granted, never think about.



    In the case of the sixth statement


    In the case of the sixth statement

  • "Sometimes the weekend seems very far away."
  • In this final sentence of this example set we see that "far" is the fuzzy component. "Very" is what linguists call a hedge. Both of these words are discussed next.

    For most speakers of English "far" is just the simple opposite of "close" (or "near"). In fact in the mathematical representation we show in the next section we can see that one is treated as being the complement of the other. It should be pointed out that some English speakers use these words in such a way that they are NOT simply mirror reflections of each other. In that case one must endeavour to discover in what way their usage differs from the regular usage. (That is covered elsewhere.)

    "Very", as we said earlier, is a hedge. A hedge is a term or word that is used to modify the meaning of a fuzzy term. "very far" modifies the function "far" such that the grade of membership function amplifies membership inside of "far" considerably more than plain "far" would for the same range of x.

    Perhaps you thought that I was going to ignore "sometimes". The most likely usage of "sometimes" is to appeal to "some" and "times" as semantic items. Another way of saying this is the construction "sometimes" states "some(x)" where x is "times" in this case. Depending on the context it is used in, "times" means instances of datevalues (July 1, etc), or instances of timevalues (3pm, etc), or relative times (before, then, now, soon, etc.)

    "Some" means, make a new set or collection, populate this new set with members from the set x ("times", in this case). The total count of members used from the source set is fewer than the total population of the entire source set. (The "definition" of "some".) In a way it means randomly select a subset from the source set and use them. If, for example, the source contains the digits 1 through 9, then "some" could mean 1,3,7. It could also mean 3,5,9. It could also mean ... You get the idea.

    The mathematics of these linguistic terms.

    Now we'll take a look at the (simple) mathematics of these linguistic terms.

    First we'll get three equations out of the way, and explain them in "plain English".
    The first two of these equations may be called "near" and "far", ("close" and "distant") while the third could reasonably be called "around" (or "about").

    The function (ie equation) for "far" is : (1.1)

    g(x) = 1/2 + 1/PI*tan-1 [(x-k1)/k2]

    The function (ie equation) for "near" is : (1.2)

    g'(x) = 1 - g(x)


    The function (ie equation) for "around" is : (1.3)

    f(x) = exp - [(x-k1)/k2]2


    In (1.1), the (arctan) function maps out a sigmoidal, or "S" shaped curve (when plotted), having an intermediate K1 gradient and increases in a fashion contextually dependent on object under reference.

    (1.2), is simply the complementary function of (1.1). This is often used to obtain a mirror reflection of a curve in fuzzy set systems.

    In (1.3), the function is symetrical about K1, where it peaks, and has a 'slope' of K2.

    (1.1) maps a value of x into a grade of membership value g(x). For values on the x-line (the abscissa) which are typically average or middle values, dependent on the context, k1 has the value of x at that point. Thus, x values less than then "middle" value of x (ie less than k1) are lower in grade of membership (because they fall on the lower tail of the function's curve). The "Q" or "width" of the curve is controlled by k2. It is influenced strongly by the nature of the context of the text using the term "far". The function "records" values of x found in that contextual environment, thus a "middle" value is known, and upper and lower occuring values as well (which help define the value of K2).

    If the context is referenced to "me" then the range of x might typically be based on either "my" height, or the length of "my" outstretched arm. The same process occurs if "me" is an infant, or an elephant, having a different height and arm length than the author .



    Usuality?! What the heck is Usuality?

    "He's dead, Jim."*

    Is this usuality? well, uh, not exactly. Let's look at what Lotfi Zadeh meant by his term, usuality.

    First of all, it is NOT the same as probability.

    Read the extensive usuality feature, press the button. view the usuality file







    (* International viewers of this web page may not have been watchers of the 1960's TV program, Star Trek, from which this phrase was taken. Trekkie fans around the world will recognize this oft said, and punned, statement. If you don't see the relevancy or humour in this reference please just bear with us anyway. (For the record, nobody thinks dead people are funny.)




















    This page generated with Dreamweaver 1.0 and Homesite 3.0