Hjørland’s Semantics and Knowledge Organization, pt. 2

Hjørland, Birger. “Semantics and Knowledge Organization.” ARIST 41 (2007): 367-405.

Originally read 18 June 2007 because it was cited by Zhang, J. (2007). Ontology and the Semantic Web. Proceedings of the North American Symposium on Knowledge Organization. Vol. 1. Available: http://dlist.sir.arizona.edu/1897

Re-read 28-29 Sep 2007 for two reasons: (1) Seems vastly relevant to my CAS project and (2) it is one of two articles referenced for Dr. Hjørland’s Research Fellow lecture [9 Oct 4-5 PM, Rm 126 GSLIS].

I will not be explicating this article as such here. I am going to use this post to note some of the points of contact that I noticed between Hjørland’s thoughts and Integrationism, to record and ask questions that I had and need to find an answer for, etc.


Semantics and Its “Warrant”

Theories of semantics should be formulated in ways that provide methodological implications for determining meanings and relations in semantic tools such as thesauri and semantic networks. Often such theories are not clear; this renders the theories vague and unhelpful (377).

What does i.v. say on this?

Frohmann (1983) has discussed the semantic bases and theoretical principles of some classification system. His is one of the few papers in IS to recognize that problems in classification should be seen as problems related to semantic theories (378).

Read this 19 June 2007; re-read this for an i.v. angle?

Frohmann presents two semantic theories. … According to the second, the categories to which a concept belongs must be found in the specific literature or discourse of which the associated term is a part. Consequently, the semantic relations are not given a priori, but are formulated a posteriori. This distinction has implications for classification theory (378).

Oh boy, does it ever?

Thus, a basic problem in KO is whether semantic relations are a priori or a posteriori; … (378).

This question is also related to one about the possibility of universal solutions to KO because a posteriori relations are unlikely to be universal (379).

Is there a way to incorporate both? How would be go about truly trying to incorporate a posteriori relationships?

However, it is well known that, for example, synonyms are seldom synonyms in all contexts. It thus becomes important not to think of semantic relations as simply “given,” but to ask: When are two concepts A and B to be considered synonyms ( or homonyms or otherwise semantically related?) When is a semantic relation? We should again ask the pragmatist question: What difference does it make whether, in a given situation, we choose to consider A and B as semantically related in a specific way? (379, emphases mine).

This certainly made me think of Harris (1973). What is the i.v. on “When is a semantic relation?”?

Short discussion of Ogden and Richard’s (1923) triangle of meaning/semiotic triangle (379-380). Where did I see Harris’ take on this?

Hjørland then goes on to discuss “some theoretical possibilities about the nature of concepts and semantic relations: (379):

  • Query/situation specific or idiosyncratic
  • Universal, Platonic entities/relations
  • “Deep semantics” common to all languages (or inherent in cognitive structures)
  • Specific to specific empirical languages (e.g., Swedish)
  • Domain- or discourse-specific
  • Other (e.g., determined by a company or workgroup, “user-oriented”)

Concerning Query/Situation-Specific or Idiosyncratic Semantics

In a way, it is the specific “information need” that determines which relations are fruitful and which are not in a given search session. A semantic relation that increases recall and precision in a given search [is a mighty powerful relationship!] is relevant in that situation (380-381, plus my commentary).

The pragmatic fallback is well represented in this quote:

This pragmatist point of departure is important to keep in mind in developing a theory of concepts and semantics. Semantic relations relate to a given task or situation and not all users of a given set of semantic relations will share the same view of which terms are equivalent. On the other hand, it is clear that if we base a semantic theory on an individual/idiosyncratic view of concepts and semantics, it is not possible to design systems for more than one user or situation—an absurd conclusion. We need more stable principles on which to determine semantic relations. We need a semantic theory about the meaning of words as forms of typified practices. Knowledge about semantics in typified practices may then be used by information searchers in order to include or exclude certain documents (381).

Concerning Universal, Platonic Entities/Relations

Not much to say here. Is a very short section. I will be looking at the following articles, both of which are in AKO 8:

Green, Rebecca. “Conceptual Universals in Knowledge Organization and Representation” (15-27) and Green, Rebecca, Carol A. Bean and Michèle Hudon, “Universality and Basic Level Concepts” (311-317).

I’ll also be looking at both Green, et. al. books on relationships for a refresh. You all didn’t think I had forgotten about Dr. Green, did you?

Concerning “Deep Semantics” Common to All Languages or Inherent in Cognitive Structures (A Priori Relations)

Semantic primitives in concept theory and in IS. Innate ideas (rationalistic) in semantics, facet-analytic tradition (Ranganathan) and formal concept analysis (Priss).

Although this rationalist theory dominates the literature (and is associated with the cognitive view), I do not find it fruitful for KO (384).

More talk about science, what is his view on KO in non-science areas?

Concerning Semantics Specific to Given Empirical Languages

Natural languages are structures in which the words classify the world differently (384).

Hjelmslev’s “tree” chart.

Concerning Domain- or Discourse-Specific Semantics

Although objects have objective properties, representation of those properties in languages and concepts is always more or less “subjective” or “biased” by individuals, social groups, or different cultures (385).

Objects may well have subjective properties also.

The implication is that semantic relations reflect human interests. … This does not imply that all semantic relations are domain-specific (385).

Certainly does not.

Goes on to show that we need to evaluate the literatures of specific domains or discourses to identify and analyze the different methodologies and assumptions made as an aid to determining meaning.

In this way, meanings are linked to different views, interests, and goals; accordingly, terms can be generally considered polysemous. [en 7] Attempts to standardize terminology may unwittingly suppress certain views (387).

Or wittingly suppress. See early Harris on standardization. Is also a comment on definitions and definitional change. Endnote 7 is a comment on the German tradition of Begriffsgeschichte, discussed in the section on semantic relations (en7, 396). [Need to look at this.]

Aspergum vs. Ecotrin vs. aspirin = i.v., circumstantial.

The implication of different paradigms for KO and semantics is that any bibliography of a certain size must confront conflicting ways of defining concepts and determining semantic relations (388).

There is a trade-off between being an optimal tool for the information seeker and a practical tool for the library manager. For the theory of IS, it is nonetheless important to describe the principles of designing optimal search tools (388-389). [the pragmatic fallback]

The point is that the kind of information presented here is necessary for any informed decision about classification practice. Exactly the same kind of information would be helpful for the information seeker … (389). [the macrosocial feeding the circumstantial]

Perhaps the most important task of the information professional is to make the different interests and paradigms visible so that the user can make an informed choice (390). [How does this fit within an i.v.?]

Other Kinds of Warrant

Discusses Beghtol’s (1986) article on warrant. But what about “user warrant” (390)? [Have another read of Beghtol]

Mentions oral and written sources.

Semantic Relations

Relations between concepts. senses, or meanings should not be confused with relations between the terms, words, expressions, or signs that are used to express the concepts. It is, however, common to mix both of these kinds of relations under the heading “semantic relations” (see references omitted). For this reason, synonyms, homonyms, and so forth, are considered under the label “semantic relations” in this chapter (391).

Amen! But much harder in practice to keep these straight or even to see the difference. [See preceding paragraphs to the above quote for some explication.]

On the call for richer sets of relationships in our tools and a a critique of the recall/precision view of IR:

What information searchers need are maps that inform them about the world (and the literature about that world) in which they live and act (393).

Begriffsgeschichte (is this idea of use to me?) = conceptual history.

Historians and other humanistic researchers have realized that in order to use sources from a given period, one must know what the terms meant at the time. Therefore, they have developed impressive historical dictionaries that provide detailed information about conceptual developments within different domains, … (393).

Implication of broadening the view within IS to use important work on semantic relations is that “different domains need different kinds of semantic tools displaying different kinds of semantic relations” (393). Well, this actually follows from much of the previous discussion, but this view implies that we need to look more broadly.

The “Intellectual” Versus the Social Organization of Knowledge

On citations are semantic relations:

I hold that the citing relation is in itself a kind of semantic relation. In support of this claim, I distinguish between “ontological” and social semantic relations and argue that citing relations belong to the latter (394).

Discusses further the difference between and uses of these.

Conclusion

The pragmatist view of semantics suggests that words and expressions are tools for interaction and their meanings are their functions within the interaction, constituting their capacities to serve it in their distinctive ways. [Integrationist] When information professionals classify documents or informational objects, the relevant meanings and properties are available only on the basis of some descriptions. This important consideration, … , stands in opposition to the prevailing implicit assumption that all relevant properties are obvious to the information specialists and that the latter follow certain given principles providing an optimal classification that is objective, neutral, and universal—hence, technically efficient (395, emphases mine).

I am not going to argue that no one thinks that way—some do—but I sure would like to put them to work on some real world projects so they can quickly learn the folly of their blindered thinking.

Traditional approaches to KO have a tighter affiliation with positivism than with the pragmatist view of semantics. … The implication is that traditional views have provided solutions that are, at best, statistical averages and thus sub-optimal (396).

No disagreement from me on this one. In fact, one could say that first sentence is what is driving me to this topic in the first place, urgently prodded along by the works of Roy Harris. And while I agree with the second sentence, what corners will need to be cut due to the pragmatic fallback? Hjørland has pointed to this himself several times in this paper; see above in a couple of places.

This is a very good paper, despite all my questioning of it. I will be spending more time with it I can assure you as it will most likely serve as a cornerstone of my CAS project. I agree with the vast majority of it, and several months back, before I had read so much Harris and related integrationist critiques, I accepted even more of it.

Citations from within this Hjørland paper:

Beghtol, C. (1986). Semantic validity: Concepts of warrant in bibliographic classification systems. Library Resources & Technical Services, 30 109-125.

Frohmann, B. P. (1983). An investigation of the semantic bases of some theoretical principles of classification proposed by Austin and the CRG. Cataloging & Classification Quarterly 4: 11-27.

External citations:

Harris, Roy. Synonymy and Linguistic Analysis. Toronto: University of Toronto Press, 1973.

López-Huertas, Mariá, and International Society for Knowledge Organization. Challenges in knowledge representation and organization for the 21st century : integration of knowledge across boundaries : proceedings of the seventh international ISKO conference, 10-13 July 2002,. Würzburg: Ergon-Verlag, 2002 [Advances in Knowledge Organization v. 8].

Hjørland’s Semantics and Knowledge Organization, pt. 1

Hjørland, Birger. “Semantics and Knowledge Organization.” ARIST 41 (2007): 367-405.

Originally read 18 June 2007 because it was cited by Zhang, J. (2007). Ontology and the Semantic Web. Proceedings of the North American Symposium on Knowledge Organization. Vol. 1. Available: http://dlist.sir.arizona.edu/1897

Re-read 28-29 Sep 2007 for two reasons: (1) Seems vastly relevant to my CAS project and (2) it is one of two articles referenced for Dr. Hjørland’s Research Fellow lecture [9 Oct 4-5 PM, Rm 126 GSLIS].

I will not be explicating this article as such here. I am going to use this post to note some of the points of contact that I noticed between Hjørland’s thoughts and Integrationism, to record and ask questions that I had and need to find an answer for, etc.

While some of my questions will be more directly to myself, please feel free to chime in on any of them. I am certainly willing to entertain authors other than Hjørland and Harris, but unless the question is specifically other-directed I will be looking primarily for an answer from within Integrationism and/or the “(the ’sociological-epistemological paradigm’ or the ‘domain analytic approach’)” (Hjørland, 1998, 611).

I think my primary complaint about this article revolves around just what does Hjørland mean by “semantics?” He never really provides a fully formed definition. At the bottom of the 3rd page (369) we find, “Because concepts provide the meaning behind words and semantics is the study of meaning, the study of concepts, meaning, and semantics should form one interdisciplinary field.” On the next page we find, “Semantics, by the way, is not concerned solely with word meaning. Pictures as well as other signs are also the objects of semantics” (370).

So, semantics is broader than just the word. OK, fully accepted. And it is the study of meaning. Probably OK. But then what is meant by “meaning?” Ah. Here’s the rub. The meaning of “meaning” is even slipperier than “truth” or “intelligence” or most any other hard-to-define word. Still. It would be useful to have some explicit idea of what Hjørland intends it to mean. I think I have a pretty clear idea, but then once in a while when I’m reading some line or paragraph I go, “Huh, so what is “semantics” or “meaning” in this case?”

I will add section headings along with page numbers to help give a better idea of where I am in the text.

Introduction: The Importance of Semantics for Information Science

The difference between “a word” and “a concept” is that different words may have the same meaning and similar words may have different meanings, whereas one concept expresses one meaning (367, emphasis mine).

Just a few months ago I would have taken this to be completely acceptable. But now? I don’t know. I was literally stopped in my tracks when I read this for the 2nd time. Assuming one is slightly familiar with various critiques of word meanings, how do we stipulate concepts as unitary? Is this an ontological definition or merely a stipulative one? How can it be anything other than stipulative, at least empirically? If I disagree and claim that concepts are not univocal then does this mean that no one shares any concepts? And then do we need another level above concepts? This quickly becomes an infinite regress as in various epistemological views of truth. Can we just declare a stopping point? Is this simply foundationalism re concepts?

Intergrationism has a way to deal with this regarding word meanings in language use so that we do not end up with individually personal languages while maintaining that there is no such thing as the language. Have a look and see if this can apply to concepts. And what is the integrational view (i.v.) of concepts anyway?

I see no way out of saying that this is a stipulative definition. Does that matter? Are concepts only a theoretical fiction? If so, is the work they do for us enough to offset this? [Note to others: I swear I'll ask one or two small questions. At least I hope I will!]

Quote from Soergel on thesauri (368). What is the i.v. of thesauri?

“It is thus important to include bibliometrics within the concept of KOS for both theoretical and practical reasons” (369). Handled correctly, can this be a reflection of integration? That is, reflections of the macrosocial and circumstantial parameters [see the section "Three integrational parameters." Will be using these a lot.] [see 2002d, 432-436]

On KOS:

Indeed, KOS in a narrow, IS-oriented sense are those systems related specifically to organizing bibliographical records (in databases), whereas KOS in a wide, general sense are related to the organization of literatures, traditions, disciplines, and people in different cultures (369).

Yes, context is very important. While I see myself as primarily working within the IS tradition, I want to generally be thinking in the broader sense, and working to import more of that sense into IS.

On semantic tools:

The term “semantic tool” should be reserved for systems that provide selections of concepts more or less enriched with information about semantic relations; KOS should be used as a broader term including, but not limited to, semantic tools.

The field of KO within IS is thus concerned with the construction, use, and evaluation of semantic tools for IR (369).

While I tend to agree with this it does seem a bit arbitrary or, at least, preliminary as “semantic” has never truly been defined. Nonetheless, I am probably in agreement.

Relation of concepts, words and meaning:

Because concepts provide the meaning behind words … (369).

Do they? Again, what is the i.v. of concepts?

The Status of Semantic Research in Information Science

Van Rijsbergen (1986, p. 194) has pointed out that the concept of meaning has been overlooked in IS and discussed why the whole area is in crisis. The fundamental basis of all the previous work—including his own—is wrong, he claims, because it has been based on the assumption that a formal notion of meaning is not required to solve IR problems (370).

Is there a formal notion of “meaning” in the i.v.? Probably not. Is one required for IS? I do believe so, tentatively anyway. Get and read van Rijsbergen. OK, read van Rijsbergen. Formal definition in this sense means formally specific and thus machine implementable.

Outline of traditions within KO:

Broughton, Hansson, Hjørland, and López-Huertas (2005) have suggested the following traditions are the most important ones for KO: … (371).

Need to read Broughton, et. al. (already have it). Will be useful for one of the overview sections where I am setting up the problem.

On the domain-analytic tradition:

The domain-analytic approach is rather traditional in its identification of semantic relations based on literary warrant. However, it is not positivist, for it regards semantic relations as determined by theories and epistemologies, which more or less influence all fields of knowledge (372).

OK, but what about communicational context? The macrosocial other than epist. or theoretical, and the circumstantial?

Semantics and the Philosophy of Science

Justification for taking a “radical” view (and so much more):

The different theories and epistemologies that are in competition with one another may be more or less fruitful (or harmful) for information science. It is important to realize this and to take the risk of defending a particular theory. If this is not done, other views will never be sufficiently falsified, confirmed, or clarified. In the process of defending a particular view, one learns what other views it is necessary to reject. As pragmatist philosophers have long suggested, in order to make our thoughts clear, we have to ask what practical consequences follow from taking one or another view (or meaning) as true. If our theory (or meaning) does not have any practical implications, then it is of no consequence (372, emphasis mine).

Amen!

Peregrin suggests two dominant paradigms in semantics: positivist and pragmatic. (Read Peregrin 30 Sep)

Pragmatist semantics suggests that expressions are tools for interaction and their meanings are their functions within the interaction, giving them the capacity to support it in their distinctive way (Hjørland, 373)

Hjørland cites Harris’ (2005) The semantics of science in an endnote at this point. So he is clearly aware of Harris.

Hjørland and Nissen Perdersen (2005) on the foundations of a theory of classification for IR:

4. Selected “properties of the objects to be classified must reflect the purpose of the classification” (373). This is contextual and should be an aid to integration.

6. To serve their differing purposes, different domains may well “need different descriptions and classifications of objects…” (373). This is also contextual and macrosocial.

10. “The criteria for classification should be based on an understanding of the specific goals, values, and interests at play” (374). Also possibly integrational at the macrosocial and/or circumstantial levels.

Several of the other points are also somewhat integrational.

“I agree that meanings and classification criteria are implicit in the literature to be retrieved, as outlined here” (375, emphasis mine). Hmmm? i.v.?

This section ends with Hjørland’s take on Putnam and a listing of three points of departure for both, which sound fairly integrational. Maybe not the 1st, though.

  • A focus on the relation between meaning and the real world (realism)
  • A focus on the functional/pragmatic nature of meaning (pragmatism)
  • A focus on the development of meaning is a social context (historicism and meaning collectivism/holism) (375)

Semantics and Subject Knowledge

End of quote from Bada, et. al. on domain models requiring expertise from within the community to construct them reflects the macrosocial (376).

Ideas on the education of LIS professionals and subject knowledge (377). Can this be given an i.v. spin?

Stay tuned. Or not. There’s a lot more of this paper….

Citations from within this Hjørland paper:

Broughton, V., et. al. (2005). Knowledge organization: Report of working group 7. In L. Kajberg & L. Lørring (Eds.), European Curriculum Reflections on Education in Library and Information Science. Copenhagen: Royal School of Library and Information Science. Retrieved December 15, 2005, from www.db.dk/LIS-EU/workshop.asp

Hjørland , B. & Nissen Perdersen, K. (2005). A substantive theory of classification for information retrieval. Journal of Documentation, 61, 582-597. Retrieved December 15, 2005, from www.db.dk/bh/Core%20Concepts%20in%20LIS/Hjorland%20&%20Nissen.pdf [Verified October 1, 2007]

van Rijsbergen, C. J. (1986). A new theoretical framework for information retrieval. Proceedings of the Annual International ACM SIGIR Conference on Research and Developments in Information Retrieval, 194-200.

External citations:

Hjørland, Birger. “Theory and Metatheory of Information Science: A New Interpretation.” Journal of Documentation. 54.5 (1998): 606-621.

Some things read this week, 30 September – 6 October 2007

Sunday, 30 Sep

van Rijsbergen, C. J. (1986). A new theoretical framework for information retrieval. Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 194-200. Retrieved via ACM Portal.

Cited by Hjørland (2007). Semantics and knowledge organization. ARIST 41: 370.

A useful paper in that the author declares:

I have reluctantly concluded that the fundamental basis of all previous work is wrong. Almost all of the previous work in Information Retrieval (including my own) has been based on the assumption that a formal notion of meaning is not required to solve the information retrieval problem (194).

In discussing the need for a formal semantics:

That is, a document is retrieved if it logically implies the request. However, as we all know, documents rarely imply requests; there is always a measure of uncertainty associated with such an implication. And so, a notion of probable, or approximate, implication is needed …. Modelling the information retrieval process in this way goes beyond the keyword approach, and specifies, once and for all, what relationship between a document and a request is to hold to compute probable relevance (195, emphasis mine).

This is (one big) reason why computer-based IR, as good as it may become, is doomed to incompleteness. There is simply no way, no freaking way, in which anyone could ever specify, once and for all, all of the relevance relationships between documents and a request, much less specify those formally. [But, then, human-based IR faces the same problem for but for somewhat different reasons.]

He does go on to show that he does knows a bit about relevance, such as documents themselves are not, in fact, relevant to requests. And one must love the wonderfully named Logical Uncertainty Principal, which is the main product of this paper.

Peregrin, J. (2004). Pragmatism & semantics. English version of Pragmatism und Semantik. In A. Fuhrmann & E. J. Olsson (Eds.), Pragmatisch denken (pp. 89-108). Frankfurt am Main: Ontos. English version retrieved 30 Sep 2007, from http://jarda.peregrin.cz/mybibl/PDFTxt/482.pdf.

Cited by Hjørland (2007). Semantics and knowledge organization. ARIST 41: 372.

Discusses what he calls the Carnapian and Deweyan paradigms in language. The intent is to show how “the technical apparatus engendered by the Carnapian approach, with is wealth of results, can be put into the service of the Deweyan paradigm – if we liberate it from the Carnapian representationalist ideology” (3).

On Wittgenstein’s analogy to chess:

Thus the meaning of an expression can be compared to the role of a chess piece, which acquires its role of, say, a ‘knight’ by being handled according to the rules of chess (4).

But meanings and rules can be played upon; are these just alternate rules, or mis-use of the rules to another end?

Makes us of Sellars’ rules of semantics as rules of inference, which relies heavily on the primacy of sentences and on locating sentences in a logical space as propositions. But it simply is not the case that any of the bits below the sentence level have no meaning, nor that communication can not occur with sentence fragments or single words.

And the whole logical space/proposition issue is heavily positivistic! Clearly not all communication is propositional.

Such objections point out that if we start to treat formal semantics as the basis for a philosophy of language, we are likely to run into a vicious circle: we reduce philosophically problematical concepts to the seemingly perspicuous formal semantic concepts, which, however, ultimately rest on the obscure concepts to be explicated (10-11).

Amen to that!

But to place the Carnapian approach in the service of the Deweyan he falls back on possible world semantics. Gah! Can we please do away with the so-called possible worlds? Possible worlds are an supra-metalinguistic way of talking about our already common-sense, lay, metalinguistic way of discussing alternative scenarios and logical possibility and necessity. To formalize this way of talking into possible world semantics leads one easily down the path from a linguistic way of knowing (epistemology) to postulating actually existent possible worlds (metaphysics).

On the pragmatic fallback, as I am tentatively calling it (them?):

And I think that the inferentialist should realize that modeling is a very useful thing. Thus I think that although language is not literally a nomenclature or a code (as the Carnapian paradigm has it [orthodox linguistics]) it remains useful, at times, to see it as a code, just as it is often useful to see atoms as cores orbited by electrons (12).

This is a very interesting paper, but I do not think it has won me over to its way of thinking. I am concerned that we will, especially in IR, have to resort to the pragmatic fallback. But Sellars’ view is still far too positivistic and thus rules out much of what we would call communication. Perhaps this view was acceptable when libraries were the gatekeepers and we dealt only in “serious” reading material. But this is, in some respects, a new age and the past age is long past. Perhaps libraries need not worry about some of this when one considers the sorts of material that they deal with (but I doubt that!). But KO and IR is much broader than libraries. And even if KO and IR uses a sub-set of our theories of language and communication (assuming we separate them; perhaps not), we should have theories that cover all of communication and language and then explicitly pull out the bits we need. We should not be starting from a limited theory to begin with.

Harris, Roy, and George Wolf, eds. Integrational Linguistics: A First Reader. 1st ed, Kidlington, Oxford, UK: Pergamon, 1998.

Re-read:

  • Introduction
  • Ch. 1: Harris, R. “Language as Social Interaction: Integrationalism versus Segregationalism.”
  • Ch. 2: Harris, R. “The Integrationist Critique of Orthodox Linguistics.”

Read:

  • Ch. 8: Harris, R. Three Models of Signification.”

I skipped ahead to chap. 8 as I want to get a handle on the integrationist view (i.v.) of meaning.

Discussion of these is going to have to wait.

Monday, 1 Oct

American Society for Information Science and Technology. Theories of Information Behavior. Medford, N.J: Published for the American Society for Information Science and Technology by Information Today, 2005.

Preface

  • Ch. 1: Bates, Marcia J. “An Introduction to Metatheories, Theories, and Models.”
  • Ch. 2: Dervin, Brenda. “What Methodology Does to Theory: Sense-Making Methodology as Exemplar.”
  • Ch. 3: Wilson, T. D. “Evolution in Information Behavior Modeling: Wilson’s Model.”
  • Theory 60: Hjørland, Birger. “The Socio-Cognitive Theory of Users Situated in Specific Contexts and Domains.”
  • Theory 2: Belkin, Nicholas J. “Anomalous State of Knowledge.”
  • Theory 5: Bates, Marcia J. “Berrypicking.”

This book looks useful enough that I ordered my own copy, with my ASIST discount of course. If you have the slightest aversion with authors referring to themselves in the third-person or heavily self-citing then you may want to skip it or take it in small doses. But the self-citation in many cases makes perfect sense as many of the authors are writing about their own theories. But the third-person stuff, especially the “Article x is clearly a most influential paper in LIS having been cited 642 times” [made up example], is simply past precious.

The book as a physical item seems to be of fairly good quality, although I do have a few gripes. Page margins are far too limited, especially the outer margins. The type face is generally readable, although a tad too small for some, but it has two features I do not like. First, and only minimally pain-inducing is the hyphen, which slants upward from left to right at about a 40 degree angle. Far worse, and especially grating since it occurs extremely frequently due to citation style and time period of most citations, is that the numeral 1 is a capital I. WTF is that? I realize that some old typewriters and perhaps early computer printers used either an “l” or an “I” for a “1″. But this book was published in 2005! Why would anyone use a type face that uses a capital I for a 1 in 2005? Information Today should be ashamed. [it also has a ridiculously long "/".]

I primarily checked this book out to get a copy of Hjørland’s “The Socio-Cognitive Theory of Users Situated in Specific Contexts and Domains.” It will also be of immense value in the section of my paper where I critique various aspects of our field. By providing a brief overview of 72 theories in a lit review format, along with highlighting applicable research projects, the book will prove exceptionally useful.

I read the above theories to try and get a handle on how they might or might not fit in with Integrationism.

Hjørland’s use of the socio-cognitive view and domain analytic theory can, I believe, easily be given an integrationist reading. Within integrationism, the “three parameters relevant to the identification of signs within the temporal continuum” are biomechanical, macrosocial and circumstantial [Harris, see previous link]. The biomechanical and macrosocial parameters are clearly shown in Hjørland and, I believe, the circumstantial can be pulled out of the “socially constructed” easily enough.

Belkin’s anomalous state of knowledge (ASK) is explicitly cognitivist and, thus, may not translate as well. It most certainly will not fall under Hjørland’s views easily. What is his view of ASK? [Note to self to ask him; noted.]

Bates’ Berrypicking; hard to say from this article. Seems as if it could fit in many other views and theories. Unfortunately, the assumptions and epistemologies underlying her model are almost completely opaque in this article. Will need to check the original articles themselves.

Schneider, K. G. “Range of Desire: In the military, I learned to love women and guns.” nerve.com

Very enjoyable read. Parts of this resonated deeply with me, some parts not so much, and some seemed very different than my experience. But this is Karen’s story so that last clause in the previous sentence isn’t too relevant.

Hjørland, Birger. (2002). “Epistemology and the Socio-Cognitive Perspective in Information Science.” JASIST 53 (4): 257-270.

Through the lens of psychology literature demonstrates the differences between the cognitive and socio-cognitive views, discusses domain analysis, shows that knowledge of subject literature(s) is required for effective info retrieval, demonstrates that different paradigms and epistemologies imply different information needs and relevance criteria.

Some of these points ought to be blatantly self-evident but they generally ignored in our literature. These points can fit within an integrationist view most likely.

Hjørland, Birger. (2004). “Domain Analysis: A Socio-Cognitive Orientation for Information Science Research.” Bulletin of the ASIST, Feb/March 2004: 17-21.

This is good, but short, overview of domain analysis based on the author’s talk at the ASIS&T 2003 Annual Meeting. For anyone looking for a short intro to domain analysis and several other of the author’s views (socio-cognitive view, pragmatic realism) this is a great place to start.

For some reason the close juxtaposition of IS & IT in the 1st several paragraphs of this article made me make an odd sort of observation:

IS and/vs. IT

“is” and/vs. “it”

being and/vs. thing

So tell me about relevance again, will you? Relationships are defined by what?

Tuesday, 2 Oct

Davis, Hayley G. Words: An Integrational Approach. Richmond, Surrey: Curzon, 2001.

  • Ch. 2: Methodology: The Word of the Layperson
  • Ch. 3: What Do Lay Speakers Say About Words?

Wednesday, 3 oct

Hayley (above).

  • Ch. 4: Words and Linguistic Meaning

Hjørland, B. “Domain analysis in information science: Eleven approaches – traditional as well as innovative.” Journal of Documentation 58 (4), 2002: 422-462. doi: 10.1108/00220410210431136

This is a long but useful article about the uses of domain analysis in information science. It pointed me to several resources of which probably ought to play a role in my critique of language theorizing and use in LIS.

I loved this quote, under the head of Indexing and retrieving specialties, as it serves to justify my extending stay at GSLIS:

Too often library and information specialists feel they lack adequate subject knowledge. In order to claim the existence of the field as a serious field of study one has, however, to develop sufficient subject knowledge in at least one field (e.g. LIS itself). The application of LIS principles to a specific task may make research in information science more relevant and realistic (429, emphasis mine).

The following is a claim made in many places by Hjørland which I am going to need more time to formulate an adequate response to, but I want to note it here:

The tendency to try to measure users’ information needs by questioning them or by studying their behavior seems to me to be mistaken. What information is needed to solve a given problem is not primarily a psychological question, but a theoretical/philosophical one (431).

While I tend to agree with this, at least in restricted domains, I do not think it is so applicable in, say, general culture. Certainly there are assumptions I am making if I want to do a Google search on Britney’s custody woes as reported in the popular press, but I do not think theory and philosophy are going to be of much use and certainly will not be dominant in my “need.”

Thus, I am led to think that this is going to be more of a continuum, and perhaps/more likely multi-dimensionally continuous. I think Hjørland’s view on this is a bit too influenced by scientific-type knowledge, “serious research” and the academic environment. But if IS and KO only focus on these limited areas of knowledge then the game is already up. We must have a wider influence or the Googles and Microsofts of the corporate world will quickly eat us up. [Noted to ask him about this.]

His spin on bibliometrics, here and elsewhere, makes it seem like they can possibly be given a integrationist spin (e.g., p. 433).

On taking the easy way out citationally (underrepresentation and overrepresentation):

In LIS there may be a corresponding tendency to overcite easy theories and methods at the expense of more difficult but also more important papers (435). [Oh, like Bush, perhaps.]

Under Document and genre studies:

These important concepts need, however, to be based on more general theories of documents, their communicative purposes and functions, their elements and composition and their potential values in information retrieval. Different disciplines or discourse communities develop special kinds of documents as adaptations to their specific needs (437).

Seems pretty integrative and reflective of the macrosocial, and perhaps of the circumstantial as well.

Terminological studies, language for special purpose (LSP), database semantics and discourse studies was the most productive citationally for me. LSPs and sublanguages will be critical to my critique of language in LIS. Can we legitimately speak of sublanguages within Integrationism, or must they be given a different spin? LSPs seem to reflect the macrosocial at first blush.

Ammon’s sociolinguistic theory of LSPs seems useful in cross- and interdisciplinary information seeking (444-445).

Spells out Hjørland’s approach (so far) to LSPs and database semantics (4 main assumptions) (445-446):

  1. “Signs and their meaning are formed by social groups primarily as part of the social division of labour in society.”
  2. “Different communities develop specific document types of more or less different compositions.”
  3. “The above mentioned discursive or epistemic communities are always influenced by various epistemological norms and trends, which also influence the social construction of symbolic systems, media, knowledge, meaning and semantic distances.”
  4. “When documents are merged in databases information about implicit meanings from the prior contexts are lost.”

Is the concept of semantic distance tenable in Integrationism?

Under Structures and institutions in scientific communication we get an explicit comment on the “narrow” view taken by Hjørland (at least in this arena) that I critiqued above:

They do not, however, cover mass media, organisational communications, and broader communications connected to the public sphere (447).

Another comment with which I basically agree but also find somewhat narrow [although he does say "a"]:

In LIS a central goal is to provide users with information which can help evaluate the validity of different knowledge claims. To help the user establish his own views on some issue based on studies of all available arguments is extremely important in LIS (450).

What can I say, except “Read it!”

Thursday, 4 Oct

Walrod, Michael E. “Language: object or event? The integration of language and life.” In Nigel Love, Ed. Language and History: Integrationist Perspectives. London: Routledge, 2006: 71-78.

Need to copy this and re-read it as it is the selection for Metadata Roundtable Wednesday. Am I the so-called discussion leader for this one?

Thursday – Friday, 4 – 5 Oct

Hayley, (above).

  • Ch. 5: Parts of Speech and Grammar
  • Ch. 6: Folk Characteristics of Words (split over T/F)
  • Ch. 7: Reorientation: The Integration of Speech and Writing

This was actually a quite entertaining book using an “ask-the-speaker methodology, using fieldwork and interview techniques” (ix-x) to focus “on the uses to which English speakers on the one hand, and linguistic theorists on the other, out the word word” (ix). It is also a fast read.

In fact, it was downright hilarious at points. My only complaints are that: (1) it, although very relaxed, if you will, for an academic book, is still very British in style and, (2) some of the author’s conclusions did not seem to follow from the way they were phrased in summary, although they did from the evidence. Thus, I was a tad confused at points. Well worth a read if you can get it from a library. Just don’t make it an even faster read by skipping what the informants say; that will be important to coming to the correct conclusions and are, of course, the actual funny parts.

Some of things they “blame” on Americans are downright hilarious. This is not the funniest one but one I can find at the moment:

G: . . . shit as far as I understand it is being used more and more by American young girls as an expression of disgust (152)

On epistemological and ontogenetically we get:

Other guesses were that epistemological was possibly a ‘religious’ word (B) because of the word epistle, and W thought they both sounded ‘like words the Americans have made up . . . funny words” (171).

The take home message is not, of course, the humor [perhaps I ought to write humour?] but the variability of users experience with and use of metalinguistic thinking and talk contra the linguistic theorists who think we all have the same ideas innately. Well, we clearly do not nor should it take a research project and book to demonstrate that.

Friday, 5 Oct

Hjørland, B. “Arguments for Philosophical Realism in Library and Information Science.” Library Trends 52 (3), Winter 2004: 488-506. Available in IDEALS at http://hdl.handle.net/2142/1685 [pdf]

Title reflects the paper quite well.

Empiricism is a problematic philosophy, but this does not, of course, imply that empirical research is mistaken (493).

Much more interdisciplinary work needs to be done in the philosophy of science (494).

Being a subscriber to Philosophy of Science I’d say that it is beginning to be done, and I have no doubt much more is being reported in other venues. But the point is well taken and supported by me.

… the socio-cognitive and domain analytic view assumes that “in the beginning there is a community” as well as a body of more or less substantiated knowledge claims; its distinguishing charge is to locate interactional processes in their social structural context as well as in their theoretical-substantial context (496, emphasis mine).

Sounds pretty integrational to me.

Related to an above critique of relevance:

The validity—and thus the relevance—of a document claiming that a certain substance is relevant as a cure for cancer is also ultimately decided in medical research, not by asking users of information services. [en 17, 18] Thu we have a central realist claim: A given document may be relevant to a given purpose, whether or not the user believes this to be so. [en 19] (497).

Sorry but I am not reproducing the endnotes here. While I want to concur with these statements I cannot without qualification. The ultimate question whether a specific substance is a possible cure for cancer is certainly an empirical one, but assuming that our “users of information services” are cancer researchers there is a definite sense in which the relevance of that particular document to their research program is theirs to make. They may lose a Nobel over their relevance decision if it is the wrong one, but the fact that epistemologies and assumptions imply relevance also implies that the decision of relevance is somewhat in the hand of users. But the point which I fully support is that one cannot reduce relevance entirely to what the user says is relevant. In some cases there will be an objective matter of fact of some thing’s relevance to a specific question.

Further:

It is rather a claim that relevance is not a subjective phenomenon but rather an objective one. To be engaged in how to identify what is relevant is to be engaged in scientific arguments, ultimately in epistemology (for a more detailed discussion on the realist position in relevance research, see Hjørland, 2000a and Hjørland & Sejer Christensen, 2002) (497).

Yes, perhaps it is an idealist position that some part of relevance is subjective. Nonetheless, this is the case. The first sentence in the above quote is a non-starter in that it is an either/or when it needs to be an and both. The and both will differ along a continuum depending upon the domain under investigation, but it is not one or the other. What about pop culture? Again, why with such a narrow view of KO and IR?

The field of information-seeking behavior has in a similar way been dominated by antirealist tendencies. When people seek information, they have given systems of information resources with given potentialities at their disposal (497).

OK. This is objectively the case on one description. But these given potentialities are rapidly changing, and many are not so “given” anymore. There is also the matter of knowing, and even being able to know, the given of some of these systems today. This ties directly into my stated intention to hire several librarians to help me manage all of my “systems of information resources” when I win the lottery.

Anyway, I do agree with much of what Hjørland says in this article and elsewhere. I just see some things that to me seem to be based on a narrower view than I feel we can afford to take or which need a bit of nuance as I see it.

Perhaps my views are different and perhaps seem muddled to some because I am a realist about much of the external world, but I am not a realist about much of modern science. Atoms and beyond? Not so much. Useful theoretical entities they be, but just as “wrong” as Newton’s mechanics. Who’s to say our current sub-atomic particles are truly existing entities? See, there‘s the rub. I am an ontological realist (generally), but I am most certainly not an epistemological realist. In fact, my dislike of epistemological realism runs much deeper than disavowing “the view that science provides a true or realistic picture of the world” (490), especially since some would say the only true or realistic picture of the world. Nope, call me an epistemological agnostic, if you like. I think epistemology is an important subject and I fully agree with Hjørland in his claim that it is central to LIS. I just don’t think we really have much that amounts to Truth or Knowledge or, more accurately, that we can ever know if we do.

It seems my views are pretty much in accord with Hjørland’s based on endnote 24 to this article (no idea what his views on particle physics is, though). And while I do agree that our subjective knowledge can be objective, in the sense that it is “in accordance with its object” (504), I do not believe that we can ever know that is is. All we have to go on is the use that that knowledge makes for pragmatically.

I have a definite post in me about science as a belief system right now but I doubt I’ll have time to get to it. I promised a friend of mine the other day who shocked me by claiming that it was not (and says she did before) that I would write it. But, alas, probably not. Trying to claim otherwise via dictionary definitions, statements by scientists, lay views of “systems of belief,” etc. simply cannot get you out of your dilemma of belief. I read a good article somewhere in the last day or so that I wanted to ask her to read. Damn it! What was it? Was it this article or something online?

Theories of Information Behavior [see above].

  • Theory 10: Rieh, Soo Young. “Cognitive Authority.”

Cognitive authority theory was developed by Patrick Wilson in his book, Second-hand Knowledge: An Inquiry into Cognitive Authority. It appears that people of many epistemological persuasions have made use of Wilson’s theory. I think cognitive authority can easily be given an integrationist reading as I can see it being definitely influenced by biomechanical, macrosocial and circumstantial parameters.

Browne, Glenda. “The Definite Article: Acknowledging ‘The’ in Index Entries,” The Indexer, vol. 22, no. 3 April 2001, pp. 119-22.

This article won the 2007 Ig Nobel Award for Literature. I saw this 1st a few days ago at 3 Quarks Daily and then a few other places. When I saw the Thingology post on it this morning I finally read it.

The Ig Nobel is given “For achievements that first make people LAUGH, then make them THINK.”

Ig Nobels at 3 Quarks Daily and at Thingology. 2007 Award Winners at the Annals of Improbable Research site.

As Tim says, “Hey, it’s a problem” and the author makes some good points.

Initial articles are the focus of my Python programming so far in LIS452. My 1st program took an internal list of mixed case titles and put them in lower case, stripped leading articles (English only) and then alphabetized them. My 2nd program which is currently beta and due Thursday does pretty much the same thing except it is written using functional vs. procedural style and it reads the titles in from a file and writes them out to a 2nd file. I hope to “fix” it to capitalize the 1st letter of each title, and if I have time to use regular expressions to do the stripping. Regex will be overkill for this program but I see them as probably the most important thing I can learn from this class (at the moment anyway).

Not sure how far I’ll get with this, though, as. must. prepare. for Dr. Hjørland’s visit this coming week!

Not going to claim that I won’t be reading or re-reading anything else today but I am going to cut this off and get back to my commentary o Hjørland’s “Semantics and Knowledge Organization” which is a much bigger job than I was thinking. It is about to become a multi-post job.

Gulp. I have 3 Downey chapters and 2 Zelle chapters to read for 452, which is LEEP on-campus this week. Luckily I have an extra day to get to those since class is Friday this week. Thank the LEEP gods for that one!

The Hollywood Librarian




Ticket stub

Originally uploaded by broken thoughts

Last night I attended The Hollywood Librarian at GSLIS. I was going to boycott it until I found out it was free for 90-some-odd percent of those who were going to go see it anyway (library staff & library studennts). Fine, call me cheap. Whatever.

It was a real mixed bag. It doesn’t seem to know what its subject, or at least its message really is. Perhaps it has several messages. But, if so, that just makes the problem worse.

Is it about librarians? Is it about libraries? Is it about the media’s and in particular Hollywood’s portrayal of librarians? Is it about the (current) funding crisis in libraries? Is it about the Salinas Public Library?

I think it was K. G. Schneider who had a review of it from ALA that I basically agree with: hone your message, re-edit the thing, make it a bit shorter, and back off a tad on the sermonizing about Salinas.

And some of those librarians…

I almost walked out about 3 times in the 1st 5 minutes! Almost every bit of dialogue at the beginning was just a complete non-starter! I really wish I could have captured/remembered it all, but alas, no.

I know it’s a movie about (well, I don’t know what exactly it’s about) librari*, but that crap about how without libraries we’d be completely in the dark, we’d have no cultural memory, blah, blah is just downright offensive to oral cultures and thus to most individual homo sapiens who ever lived.

There were a few other howlers interspersed throughout, too. E.g., “… the soul of information ….”

Anyway, if you have 90 minutes to spare and can see it for free then perhaps you should. Otherwise, can’t say what you should do.

The reel highlight (hehe) of the evening was an old silent movie shot on campus in 1936 by the Class on Administration. It is about 15-minutes long and is on 2 reels–an actual “movie.” I believe it is entitled “Making the Most of Your Library.” It was hard to tell, though, as I was trying to record the metadata on a small piece of paper with a lot of print on it, in the dark.

It was found by students in our Society of American Archivists student chapter. Really nice to see the Main Reference Room and Circulation from “back in the day.”

It was nice to see how ol’ boy who went to the library finished his essay and got a date … but, uh, what about his grade? I guess in 1936 dates were far more important than grades. Hmmm? Maybe they still are and I’ve just had my priorities reversed. ;)