Quality of Information, GopherCon94 (April 22, 1994) Anders Gillner, KTHnoc/SUNET "The mind is built on an enormous selection of information and it does not get more knowledgeable because of the information it contains, but because of the information it does NOT contain....." Tor N=F6rretranders "Information should not be mixed up with meaning..." Warren Weaver Background Making things clear from the beginning, this talk is NOT going to be tecnical stuff... After having dealt with information in the networked world for a couple of years now, I have found that we pay a lot of attention to technicalities, not only when we are dealing with networking, but when we are dealing with information and ordering information as well. I t seems like some people are satisfied with the mere situation that the information is there, and is ordered and available, and in my opinion, it is not. It just might be a good idea to stop for a while and think about why we need all the information out there on the net, and what kind of results we want from people working with that information. I spend a lot of time, travelling inside and outside my country, giving talks to people about the Internet, and what impact using the net can have on their business or their academic professional lives, and I get all kinds of questions. A typical question is; "But isn`t it real cheap to manufacture a CD-ROM today ?" The obvious answer is "Yes, it is very cheap, but what do you want to do with it ?". At that time the person asking me the question, looks at me as if I am some kind of geek from outer space, because for them everything is set and ready when you got your data down on the CD. Again, obviously that is not so, to make a CD and fill it with pictures of a certain kind, or text information covering a certain subject, will cost about a hundred dollars, to turn the CD into some kind of learning tool will probably cost you around =A4150.000, and I think that the difference between the stuffed CD, and the learning tool is an equivalent to the network information services we have today, and what we could have if we really used the network in the ways it could be used. The situation today In an article in a Swedish newspaper recently, a Swedish physicist said " The global "village"is a big first time experiment". What she meant was that the we don`t have a "global mind". Since beginning of history, man has been a more or less geographically static animal. A hundred years ago, most people did not travel more than a few kilometers a day, and their lives would have seem utterly boring to us. Today "Netlanders" travel all over the world in a virtual sense of the word. That article as a whole deals with the human helplessness in a situation where she has no overview and where no respect whatsoever is payed to the human way of subconsciously sorting out information, a situation which I think describes the situation for a lot of the new users we have , or are about to have, on the networks today. I sometimes have the feeling that we put users on the top of a slalom pist, telling them to learn the skill of turning, and the rules of the game on their way down ! We, "the Netlanders", know the rules, we (occasionally), have the subconscious ability to sort out the information we want, because this is where we "live". They don't. We have, in a way passed the threshold where we do not use only our concious methods to work on the net, we have started to develop an instinct for solving problems on the net, and what is an instinct if not just an unconcious feeling for what is right to do in a certain situation ? The concept of the "Global Village" is not really something that netpeople use, it is an expression that they use as a description of the net, in trying to describe their world to the outsiders. And I am quite sure that a lot of the people on the net really think that that concept only covers the half of it. New "Netlanders" have to be helped out, and what we should discuss more is how to do that, or how we could disseminate the knowledge we have to them. The Network Hardware Store So the today 's situation on the net is quite confusing for the unexperienced user, and we have already seen a couple of solutions which tries to solve that problem. Almost since the first gophers came upon the net, we have had "Subject Tree:s" trying to order information in categories taken from one or another Library Classification Scheme, after the model "L2=3DLong Nails, L23=3DLong Nails, Blue, etc...". This is of course better than chaos, but do we really use this new toy of ours in the best possible way? The normal way for humans entering something new, is to make it look like something that has been around for a long time , our first cars looked like horse powered chariots for example. Ok, it is hard for us to take advantage of computers and networks, but is the solution really to make computerprograms and network resources look as familiar as possible ? I mean the kind of thinking that makes a designer design a program logo and interface for a register program so it looks like a card register ? I personally sometimes have the feeling that some people thinks "Hey, this new thing is scary, let`s make it look familiar so people don`t go off their rocker when they see it !" But the real problem is not what it looks like, but how it is percieved. If you make people use a digitized card register, there is a risk that you lock their conceptualisation of the tool they are using, setting up limitations, not only about possible ways of using that new tool and the network as a whole, but in human perception of theses new possibilities, so we are really dealing with psychology here, and we might really need totally new interfaces to information than the ones we have today, not just to be able to use the information in the best possible way, but also so that we communicate to the users that this is something entirely new I think that what we need is research in the area. How do humans scan information ? How do we store information ? We know that the brain does a primary selection of information more or less on receptor level, and that information is sorted out very early in the perception process. Let me give an example, A playground with fifty kids playing, mine gets hurt and start crying. For me the background noise more or less disappears, because all I hear is my kid crying. The stronger the feeling is that is associated with a sound, the stronger is my reaction is when it comes to filter out the information that is crucial to me, or maybe to my survival. These functions are old ones, because man is an old animal, and we are in no way designed to work in an environment like the one we have on the network today. The information flow today is more likely to bring out another old mechanism in us, habituation. How many of you have not had, what I should like to call the "Bosnia experience" ? Day after day, they send out pictures of houses getting shot and bombed to pieces, people crying, thousands and thousands of refugees etc. and after a couple of weeks your feelings about the whole thing is more like boredom than something else, you have had too much similar information, and another crying child (which is not your own) does not bring up the same feelings in you that the first one you saw did. You have gotten yourself habituated... So, we tend to fall back on our old knowledge of the world when we sort out information, and we also tend to sort out information that fits into the the knowledge or information that we have before, we , sort of, need a hanger to hang our coat on, and if that is not there, it takes a great deal of learning and restructuring of the mind to "manufacture" a new one. Sorting out Information, How to improve on quality ? Craig Summerhill from CNI, sometimes refers to the present level of networked information as the Paleolitic level, and I think that it is important to bear in mind that we have just started. It willtake us a long time to get where we want, and the goal will shift during our travel there. Probably even our way to percieve information as such will change, because it is quite traditional in it's nature. A lot of the quality information flow has traditionally been handled by librarians, and the information experts will probably be with us in the years to come, even though their roles will change a lot. Instead of sorting electronic books into electronic shelves, their role might be to determine which expert system that might benefit from being exposed to a certain piece of information, and even though the system itself determines if the information really fits, the percental weights of the information quality has to be determined by the librarian. In the future we will probably have networking tools that do that primary sorting for us. We are talking about years of experience here, maybe it is so that "growing up" in a networking sense means that you use years instead of hours to set up the filtering mechanisms that you need to be an experienced network information user. We have to use computers and networking as extensions to our primitive selves, not as tools as primitive as we are . When I worked as a psychologist during the 70's, someone at psychology department at the University of Gothenburg came up with a tachistoscopic test that was said to test man's psychological defense mechanisms. That might be dubious to some of you, but the fact is that when they tested combat pilots in the Swedish Air Force, they found that the result predicted with 90% accuracy which of these who was going to have a crash during the next couple of years. What they tested was a persons ability to percieve a picture without shunting out some of the information in it. It seemed like some persons consistently threw away information that was threatening to them, and obviously, these did not grow personalities that made them good pilots... Humans tend to act both psychologically and physiologically during stress ,which this is a good example of, and we are not only shunting away information that we don't like, we also tend to, like in the crying child example, to focus on information that we like, our pupils dilate when we look at somebody we like, our pules rate goes up when we fall in love or see something that we really like, and we also get GSR reactions when we hear something that is good or bad, and these reactions does not neccesarily, have causes consciously known to us. What I want to say with this is that we really don't know much about our "own" idea of the world, the unconscious sometimes makes choices for us that we are not aware of, and sometimes the choice is made on a very primitive level, fooling us to think that we act optimal in a situation when we don't do that at all, but using more than one of our senses while sorting information seems to be a good idea Making it better in a short perspective This might be the future, but how to improve the quality in a short perspective ? Let us make good use of the librarians we have, and of the systems we have. Putting librarians to work with roaming the network for items to sort will of course not scale in the long run, this will have to be done in a distributed way. The local librarian should only be responsible for the classification of the material that her/his site put onto the network. Use any classification scheme, it does not really matter, because the good thing is not the scheme, but that the resource is stamped as a quality resource. If we will be able to make out the difference between quality checked resources and the rest, we have improved on the information quality on the network in a terrific way, compared to the situation today, and if we are able to do searching with a parameter that tells the search program to look for quality resources only, we have improved it even more. We will still have the Network Hardware Store, but we will at least be sure that some of the boxes will contain nails with a certain reliability. There is, of course, also the possibility to make more than two categories of information, separating out for example news, university courses, K12 information and adminastrative information will make it even better. So we need a lot of librarians out there, and who will pay them ? Information is costly, and knowledge will cost a lot more still..... Information reduction is always costly, and as in science, data reduction will cost you some of the valuable content as well while reducing the noise that you really wanted to get rid of The part you really want to get rid of is what sometimes is called "exformation", and to get rid of that will always cost you, knowledge, time, money, you name it. Somebody will always pay! There has been a tendency on the net, to argue for a situation where all the information should be free, and that is possible when you put your own free time onto deriving information out of data for the benefit of the net population. "How to make a japanese kite", or "The Complete List of Beatles Songs" might be free on the net in the future, but I do not think that the Reuter statistics, or the Jane Fonda Workout program will ever be free. Making this information (or whatever you like to call it:-) )has cost some commercial company big bucks, and they are not likely to give it away for free. We are about to get commercial information on the Net on a big scale, and we better be prepared for .... Coca-Cola Information..... Netland has it's advantages compared to other countries. The average level of intelligence , for example, must be very high, and that situation will not change as long as the net is, more or less, academic. Net society today share a lot of views about the world, we are all more or less , well eduacated, which among other things, means that we share a certain view about our environment, about the neccesity of research, about the neccesity of education, about the neccesity of beeing aware of the bias of our own perception, etc, but what will happen when systems like Minitel brings other groups onto the net ? The Net will have some inpact on these people, but they will have some inpact on the net community as well. As soon as we open Pandora's box, we will have all the problems that other media have and have had during the past 400 years on our back, and if we don't think about policy before that, everybody that thinks that their ideas are the right ideas and that nobody else's ideas have the right to live, will try to control what others do and have available. So we are in for a change, and what can we do, to make that change an easier one ? Not very much that I could think of right now, but being aware of the storm at least give you the possibility to prepare for it. The market could be both a problem, and an advantage. The market will sell us information, but we should also be concerned about too much concentration of power around any special player on the information market, just as such concerns has risen when a similar situation has become reality in on other market. He who controls information controls the world is a saying that I don't think that anyone of us wants to see become a reality, although we want it to come true in a broader perspective. The authors of an article about what they call "cyberwar " say that information will be the next crucial point when winning the next war, as it had not always been like that.... The filtering of information has always been something that censors and others have used as a tool in warfare, and I really think that they have a harder time to filter today, than they had in old times. Using intelligence information has become easier though. What we don't want to have on the Net is a totally controlled information market where dictators, companies or others that like to think that they are controlling what we shall know or not know, is ruling. Communicating Information, not as easy as it might seem Now if I want to communicate something to someone else, how do I do that ? I speak to him or write it down for him to read. Which means that the information just has made it through the first filter Speech is information ordered already, because putting words on something is a filter in itself, compare somebody's notion of a horse, and the word horse, which contains only a fraction of what the person saying it, means by "horse", which can be totally different from what the person listening means by "horse". Human communication really has its drawbacks, and the big question is if the network , and the new technology could make it any better. Man has always tried to write laws that were meant to regulate how man should live together with his peers in society, and just see were we have ended up. The language that we use is really an extremely inexact representation of what we really mean, in spite of body language and other non-verbal ques. Using other types of communication on the network, like we do in real life but in an extended way, and using all our senses, might give us an opportunity to improve on information quality in centuries to come. What do we want the information to do for us ? What we really need is not information, but knowledge. We need people to expand their knowledge of the world, and we want our kids to be able to be better citizens in the future. So how do we get knowledge out of a system that only offers information ? Well, to get knowledge you need knowledge, so we are heading into psychology again, but learning psychology this time. There is really no learning psychology designed for network use, although there are quite a few attempts, mostly associated with new CD-ROM learning tools, and some of these will of course work on the network as well, but to be able to find your way through an encyclopedia, you need to know how to do that , so we probably have to improve the basic skills of reading if we want this to expand outside the educational and commercial sector. If you take a look at a network map today, you will find that networking is an industrial world phenomenon, and that there must be a marvelous market for distance education in the developing world. I can see networking and distance education as one of the main streams of foreign aid in the coming years if the institutions giving that aid are smart enough to see the potential it. Helping the developing countries out in this respect should be one of our prime concerns for the future, if we don't want to carry on in a world of have's and have not's. But we do need research in the learning psychology field to make the most out of networked information, and turning that information into valuable knowledge, might be the next great challenge for us.