Although so far I’ve talked about how attention acts as a filtering and boosting mechanism, I haven’t yet shown just how aggressively the brain can filter its input, or how intensely it can boost certain input signals. In fact, routinely, attention filters the billions of pieces of information streaming into our senses, or bouncing around our unconscious minds, into a maximum of three or four conscious items. So the filtering process is about as aggressive as one could imagine. But the boosting process can compensate for this limitation just as aggressively: Each of the mere handful of items can be an immensely complex mental object, and although their number is painfully finite, these conscious objects can be assessed, compared, and manipulated in virtually any way imaginable.
This tiny, yet ever so powerful output store of attention is our “working memory.” Working memory is an inherently conscious short-term memory container where we can remember, rearrange, and evaluate whatever is in this group of items, even if it comes from different senses or categories.
Over the past twenty years, the most prevalent, popular psychological theory of consciousness has been the “global workspace theory” proposed by Bernard Baars. In many ways, Baars’ ideas resemble mainstream views on the psychology of attention. In the global workspace theory, there is again an unconscious fight for dominance between low-level coalitions of neurons, with a winner-takes-all attitude. The winner filters into consciousness, where again Baars makes more parallels with attention, by talking of a spotlight directed onto only a small portion of a theater stage. This spotlight is the subset of our world that we are actually conscious of, and it broadcasts itself to the whole audience, in other words, making just a small number of items available to much of the brain, potentially for further information combination and comparison. But Baars’ most bold and interesting claim is that, more or less, consciousness boils down to the information sitting right now in our working memory. He views working memory as existing for a second or two, available to almost every corner of the brain, and there to guide unconscious specialized knowledge regions to help us carry out our most complex tasks, such as language and planning.
Although there are ambiguities in the definition of “working memory,” in the main I firmly agree with Baars that consciousness and working memory are largely synonymous processes, and that attention is the critical means by which items enter into consciousness. But the next key step, from the point of view of consciousness, is to fill in the details, to describe exactly how working memory functions, both psychologically and in the brain. Twenty years after Baars first formulated his global workspace theory, our understanding of working memory and attention is now far more comprehensive. And, with these advances, many mysteries of consciousness are being solved.
The first feature of working memory is how it is surprisingly so limited in capacity, comprising a mere handful of conscious objects. Many different experiments have confirmed this constraint on our conscious space—though each study has had to take careful precautions to counteract our prodigious ability to develop strategies to cheat—to try to enhance our capacity, usually by linking current items to our long-term memory store. The standard methods for removing the opportunity for such strategies are either to present stimuli so briefly that our myriad workaround tricks don’t have a chance to form, or to present more abstract items that have absolutely no relation to our preexisting memory.
For instance, in one landmark early study, George Sperling presented subjects with a grid of 12 letters, in 3 rows of 4, but only for about 50 milliseconds. Subjects then had to report as many of the letters as possible. They would get, on average, about 1.3 letters per row correct, or about 4 items in total (1.3 multiplied by the 3 rows is 3.9). In a fascinating twist, in some trials, Sperling also immediately followed the flash of letters with a cue to tell the subjects to give their answers from just a single particular row. Now, very surprisingly, they would generally correctly recall all 4 letters from one row instead of the 1.3 letters per row that they could previously manage, presumably because the immediate instruction enabled their attentional system to focus fully in on this one row before the fresh visual information faded. If instead the cue to center on a single row came a second or more after the grid had disappeared, then subjects returned to their previous performance, as if no cue had occurred, and could only answer about 1.3 items from this cued row. Within this single second, their attentional system, not knowing which row to focus on, had applied equal importance to all 12 items as they all faded from their initial fresh visual state, and only the letters from 4 random locations in the entire grid could be preserved in their limited short-term memory store.
A conscious limit of 4 objects turns up faithfully in almost any kind of experiment one tries. But in real life we do not usually need to remember letters in a grid, so I’ll share another example that will seem more natural. We commonly track multiple moving objects—maybe a group of people on the street that we walk past, or a set of players on a soccer pitch. Animals in the wild may also need to analyze where a group of other objects are moving. For instance, the members of a chimpanzee tribe may need to monitor the location of each member of a competing tribe that is encroaching on their territory. In an experiment that mirrors these everyday skills, Steven Yantis presented subjects with a set of 10 crosses on the computer screen. A subset of these initially flashed, and subjects had to keep track of them as they moved randomly around the screen and ignore the moving crosses that previously hadn’t flashed. At some point, the moving crosses would become stationary, and subjects had to say which of the crosses were the initially flashing ones. If there were only 3 crosses to keep track of, then subjects found this task relatively easy. When volunteers had to simultaneously track 4 objects, they were somewhat less accurate, but still performed the task competently. When Yantis increased the number for the volunteers by 1, to 5 moving crosses to keep track of around the screen, because this number exceeded their working-memory capacity by a single item, most subjects found this variant of the task virtually impossible. This experiment is a striking demonstration of how sharp a barrier this capacity of 4 conscious items is.
Surprisingly, our working memory limit of a handful of items is basically the same as the monkey’s, even though a monkey brain is about one-fifteenth the size of ours. And our closely related skill of being able to recognize the number of items briefly presented to us—about 3 or 4 again, before we need to start approximating—is the same capacity limit that newborns have. In fact, many other species have the same upper bound to immediately counting the number of objects, including the lowly honeybee, which can differentiate patterns containing 2 from 3 items, or 3 from 4, but not 4 from 5 or above. So there may be something fundamentally limited about just how many items all animals can store in short-term memory.
19
. . . BUT EACH CONSCIOUS COMPARTMENT CAN HOLD OBJECTS OF GREAT COMPLEXITY
That the contents of consciousness, if you discount compensating strategies, is fixed at about four items seems to be a tremendous handicap. But in humans, especially,
one should never discount strategies
. We use built-in attentional mechanisms as well as the heavy ammunition of our conscious powers of analysis to regularly load huge quantities of data into each conscious compartment, shamelessly cheating our apparent working memory boundaries.
Turning first to the role that attention plays in boosting our capacity per working memory holder: Once attention has decided to prioritize a given object, whatever it may be, the neuronal war has been won. Activity in much of the brain is then shaped according to this current object and how it relates to us. For basic objects or features in the world, such as the color red as painted on a plain wall, attention boosts the signal by enhancing the readiness to fire of our visual regions, especially those for red. Non-red color-coding neurons may be suppressed, not only in our color-processing centers, but everywhere else as well. Our hearing and taste centers, for instance, may be inhibited. At the same time, all general-purpose regions, especially the prefrontal and parietal cortices, which are closely connected to consciousness, have activity that hones in on this current feature. All of this works well, and does help us spot red in the world, but the effects are not nearly as striking as when the brain has some internal hook by which to latch onto, so as to enhance the incoming signal.
If, instead of the red wall, the current object of attention is Angelina Jolie on the big screen in front of me wearing a red dress, then anything around me that’s not Angelina Jolie gets suppressed, and any corner of my brain with any relevant information about Angelina Jolie becomes activated. As soon as I see her, I recognize the features of her face, I know her name, recall how she speaks, have knowledge of her famous husband that I can easily retrieve, remember the other films she’s been in, and so on. And, of course, I can also see that she’s wearing red. These aren’t sets of unrelated facts; they are all bound together as a single, unified, complex object. The previous example of the plain wall as an attended single object effectively had red as the only feature. When I attend to Angelina Jolie, the same piece of information, red, is attached to my conscious representation of her, but this time red is only one of dozens of features connected to this single mental object. This is a fantastic system to have—attention takes this raw input and seamlessly transforms it into a panoply of interconnected facts by the time it reaches consciousness. And yet, because attention has activated and drawn together all the components of this one object, Angelina Jolie, it takes up the same single compartment in my working memory as does the plain red wall.
In other words, we may only have a few conscious compartments, but each holder can cope equally well with the simplest of objects or the most complex. And the term “working memory objects” in this context generally means just some bound collection of information. It could be a physical object, like Angelina Jolie. But it could equally mean one strand of the plan I devised for this current chapter as I was walking to Grantchester.
Just how much information can one working memory object support? This is where the concept of “chunking” returns in force. In terms of grand purpose, chunking can be seen as a similar mechanism to attention: Both processes are concerned with compressing an unwieldy dataset into those small nuggets of meaning that are particularly salient. But while chunking is a marvelous complement to attention, chunking diverges from its counterpart in focusing on the compression of conscious data according to its inherent structure or the way it relates to our preexisting memories.
One of the most dramatic experiments to demonstrate how chunking can expand what we store in working memory was published in 1980 by K. Anders Ericsson and colleagues. The experiment is beautifully simple: The scientists took one normal undergraduate, with an average memory capacity and IQ for a student, and gave him a basic task—the experimenter read to him a sequence of random digits and he then had to try to say back the digits he’d heard, in the order he’d heard them—just like trying to remember a phone number someone has just said to you. If he recalled the digit sequence correctly, the next trial would be one number longer. If he said it back with any mistakes, the next trial would be one number shorter. This is a very standard test for verbal working memory. However, in this case, there was a big twist—he did this task for an hour a day, for roughly 4 days a week,
for nearly two years!
At the start, he was able to remember about 7 numbers in a sequence, which is indeed about average (almost everyone improves on their initial verbal working memory limit of 4 through various rehearsal strategies). But as psychology experiments go, this must have potentially won a prize for the most boring in the world, being the same day in, day out, for months on end. In order to spice things up for himself, the participant seemed determined to improve his performance. And improve he did, until, by the end of the experiment, 20 months later, he could successfully say back a novel sequence that was 80 digits long! In other words, if 7 friends in turn rapidly told him their phone numbers, he could calmly wait until the last digit was spoken and then, from memory, key all 7 friends’ numbers into his phone’s contact list without error.
On occasion, he was tested after a session to see if he could still recall any of the sequences from earlier on in that session. At the start of the experiment, he was understandably useless, hardly remembering anything of the digit sequences, even though they were only 7 digits long. However, toward the end of the experiment nearly two years later, despite the sequences now being over 10 times longer than when he began the experiment, he could remember the vast majority of the sequences perfectly. So not only could he have immediately recalled 7 combined phone numbers, just after hearing them, but he could also have typed them in without error
an hour later
! How did he achieve this seemingly superhuman improvement in performance?
This volunteer happened to be a keen track runner, and so his first thought was to see certain number groups as running times, for instance, 3492 would be transformed into 3 minutes and 49.2 seconds, around the world-record time for running the mile. In other words, he was using his memory of well-known number sequences in athletics to prop up his working memory. This strategy worked very well, and he rapidly more than doubled his working memory capacity to nearly 20 digits. The next breakthrough some months later occurred when he realized he could combine each running time into a superstructure of 3 or 4 running times—and then group these superstructures together again. Interestingly, the number of holders he used never went above his initial capacity of just a handful of items. He just learned to cram more and more into each item in a pyramidal way, with digits linked together in 3s or 4s, and then those triplets or quadruplets of digits linked together as well in groups of 3, and so on. One item-space, one object in working memory, started holding a single digit, but after 20 months of practice, could contain as much as 24 digits.
20