Editor's Notes
In our eleventh issue of Usability News:
Per the request of several subscribers, we have added short summaries at the beginning of each article in this issue. This will allow you to get the "gist" of each study before reading the details.
One of our SURL members, Bonnie Rogers, is investigating a better measure of online experience. Please help us out by taking her 20-minute survey. Results will be published in our next issue.
Also, we invite you (especially international subscribers) to participate in a new online Web Objects study. This study will help us to understand where individuals worldwide expect certain web objects to be located on a typical ecommerce website.
Usability News is distributed to over 5000 usability professionals, developers, managers, and researchers in over 60 countries. We welcome your feedback and comments. Contributions, suggestions, and submissions for future issues should be directed to barbara.chaparro@wichita.edu.
SURL Home Page: www.surl.org
Usability News: www.usabilitynews.org
Designing for usability: www.optimalweb.org
Internet Experience Survey
We all know that Internet "experience" is more than just the number of hours or minutes we use the Internet each week1. However, we believe there is no validated instrument available to help us truly measure online experience. We have developed the Internet Experience Survey to more fully investigate the dimensions of experience.
As subscribers to Usability News, we thought it would be interesting to include your experience data in our analyses and development of this new scale. We are eager to include diverse populations to better generalize our findings to those who are designing, developing, and applying web applications.
Please take the time (20 minutes) to complete our online version of the Internet Experience Survey by clicking the following link (this link will open a new browser window):
Click on this link to take the survey at http://spss.wichita.edu/internetsurvey
We thank you for your time and your interest. Results of the survey will be posted in our next issue of Usability News.
1See ../usabilitynews/52/experience.htm for more on our quest to measure online Experience.
Optimal Web Study
Michael Bernard & Ashwin Sheshadri
You are invited to participate in a short study that seeks to understand where individuals expect certain web objects to be located on a typical ecommerce website. Web objects include such things as navigational items, advertisements, and text headers. We are interested in the expected web object locations for both usability and non-usability professionals from around the world. This information will allow us to know:
Where do typical, non-usability web users generally expect certain web objects to be located on ecommerce websites and how does their expectation correlate with usability professionals’ expectations?
Are there age and/or regional differences (such as French compared to German) in the expectation for the location of certain web objects? If so, what are these differences?
Do users have a certain preference for the location of specific web objects on a typical ecommerce website? If so, is it different from where they expect them to be located?
In order to know the answers to these questions, we are asking you to answer a short survey -which typically takes no longer than five minutes. The results will be published in an upcoming edition of the Usability News newsletter.
Please click on this link to start the survey: http://www.webobject.org
Note: This survey is compatible with Microsoft Internet Explorer ONLY.
What’s the Skinny on Weight Loss Websites?
As 2004 begins, millions of Americans have resolved (once again) to lose weight. MyGoals.com listed “to lose weight” as the most popular resolution for 2004 (2004 New Year’s, n.d.). In a meta-analysis of dieting programs and long-term weight loss success stories, reviewers for the International Journal of Obesity reported that a correspondence program was the most effective way to assist in long-term weight loss and improved fitness. A study conducted by Tate, Wing, and Winett (2001) found that the Internet and email were reliable methods for assisting in weight loss programs. At the 2003 annual meeting of the American Dietetic Association, experts supported the value of online resources for complementing dieting efforts. A panel of experts at the meeting unanimously agreed that online programs should be used as tools to help dieters “achieve permanent weight loss and improved fitness” (Burke, 2003).
Advertising in popular media suggests there are currently many popular weight-loss programs—Jenny Craig, Weight Watchers, South Beach Diet, Atkins Diet, Zone Diet, among others. In this study, we chose to evaluate the websites promoting three of these popular weight loss programs: Jenny Craig, Weight Watchers, and the Atkins Diet (see Figures 1-3 for images of these sites’ homepages). This usability test evaluated first-time users’ satisfaction, navigational efficiency, and general preferences for the three sites.
We also began exploring the usefulness of gathering user eye-movement data in conjunction with web usability testing. On virtually any given e-commerce website there is a war being waged for user attention. The collection and analysis of eye-tracking data would logically benefit web designers and advertisers who need to know where users look and why. What draws attention and what inhibits it? What aspects of a site are looked at most often and the longest? Can reading behavior be influenced by the page design? These and many other questions still remain to be answered as we continue to investigate the difference between eye-movement, attention, and comprehension.

Figure 1. JennyCraig.com homepage

Figure 2. WeightWatchers.com homepage

Figure 3. Atkins.com homepage
METHOD
Participants
Nine participants (5 female, 4 male) with an average age of 24.6 volunteered for this domain study. All participants were familiar with the Internet (67% reported using it daily); none reported having visited the online diet sites before. Five participants reported purchasing products online 1-5 times in the last year, 2 reported zero online purchases in the past year, and the remaining reported buying more than six times. Five of the participants indicated they were somewhat interested in diet information, while 3 reported they were not interested at all, and 1 participant was neutral in opinion.
Materials
A Pentium IV-based PC computer with a 75 Hz, 96 dpi, 17” monitor with a resolution setting of 1024 x 768 pixels was used. In addition, the monitor used during testing was an integrated component of the Tobii ET-17 eye-tracking system. The Tobii ET-17 was used to detect and collect participant eye-gaze data prior to usability testing. This system is characterized by the unobtrusive addition to the eye-tracking hardware (e.g., high resolution camera and near infra-red light-emitting diodes) to the monitor. This aspect helps promote more natural user behavior by not placing unnatural restrictions on participants (e.g., helmets, head-rests, etc.). Participants answered the System Usability Scale (SUS) instrument (Digital Equipment Corporation Limited 1986), which was slightly adapted for web usage and consisted of 10 satisfaction questions using a 1-5 Likert scale. A final questionnaire was used to solicit the participants’ overall preference for one of the three websites.
Procedure
Participants completed a background questionnaire about their typical Internet usage. Participants were seated approximately 60 cm away from the monitor. The eye tracker was calibrated for each participant. Once the eye-tracking apparatus was calibrated, participants were asked to view an image of each website home page and to report their impressions as to the site’s purpose and the types of information likely to be available on the site. During this time (20 seconds per image), participants’ eye-movements were recorded as they visually inspected the image and gave verbal reports. After this initial task, participants were given a total of seven tasks for each of the three sites (site order and task presentation were counterbalanced across participants). The tasks included finding details of the diet plan, recommended types of exercise to accompany the plan, success stories, and advice when eating out at restaurants.
Participants’ search efficiency, or ‘lostness’ was measured by the number of pages traversed beyond the optimal number of pages to complete a task as measured by the tracking program Ergobrowser™. In order to better analyze the efficiency and intuitiveness of the site structure, participants were instructed to find the answers to the tasks without the assistance of the site search engine. After completing the tasks on each site, participants answered the SUS. After tasks were completed on all three sites, participants were asked to indicate their favorite website of the three.
RESULTS
Time
Average time taken to complete the seven tasks ranged from 12.24 minutes on the Weight Watchers site to 12.90 minutes on the Jenny Craig site. The time taken on the Atkins site (M = 12.87 minutes) was similar to that taken on Jenny Craig. Results of a repeated measures one-way ANOVA revealed no significant differences across the three sites for time taken to complete the tasks.
Search Efficiency
Significant results were found for overall search efficiency or “lostness” on the seven tasks using a repeated measures one-way ANOVA, [adjusted F (1.347, 10.779) = 4.718, p < .05]. Paired samples t-tests indicated the Atkins website (M = 21) was significantly more efficient than both Jenny Craig (M = 30) and Weight Watchers (M = 31.44). Figure 4 illustrates the search efficiency for the three sites.

Figure 4. Mean search efficiency for all seven tasks
Success
Figure 5 shows participants’ rate of successful task completion across all three sites. Participants were most successful in completing the tasks on the Atkins site, and least successful on the Jenny Craig site. No significant differences were found using a repeated measures one-way ANOVA.

Figure 5. Successful task completion on diet sites
Satisfaction
After completing each task, participants indicated their subjective satisfaction with each website. Atkins received a mean score of 30 (maximum possible = 50) which was slightly higher than Jenny Craig (M = 29.22) and Weight Watchers (M = 28). Results of a repeated measures one-way ANOVA revealed no significant differences across the three sites for satisfaction.
Preference
After all testing was completed, participants were asked to indicate which site they preferred overall. Figure 6 shows more participants chose the Atkins website as their favorite site.

Figure 6. Participant preference for diet websites (# participants choosing site as first choice)
Eye-Tracking Data
“Hot spot” maps summarizing the gaze position for each website home page were generated across users (though only 7 of the 9 participants were included due to calibration difficulty). The maps are color-coded similar to what you might expect from thermal imaging, with red being the most viewed area, graduating down to light yellow to indicate less fixation time. These hot spot maps are useful for a quick assessment of the areas of the page most frequently fixated on by participants. We also studied the user visual fixation data for experimenter-defined areas of interest (AOI’s) on each image. This enabled the collection of multiple visual fixations in a large area to be combined, so that a more precise estimation of user attention could be obtained.
Participants were asked to view the image of each site and then verbally report “what information they could expect to find on this website.” According to this data, the hot spot map of the Atkins.com site (Figure 7) indicates which specific areas received the most visual attention by participants. The most frequently and longest fixated upon AOI (M=2748ms, SD=1366ms) on this page was the rectangular area in the center of the page with the title “4 steps to a healthy new lifestyle.” The attention afforded this AOI is reasonable as it contained a block of text that explained the philosophy of the Atkins diet plan (and was targeted information). Despite the presence of the color red on the hot spot map, the ‘My Atkins’ AOI was actually fixated upon for less time (M=1365ms, SD=1412) than the navigation bar (M=2474ms, SD=2129ms) and the Atkins logo (M=1717ms, SD=1162ms). This was most likely because the other AOI’s were larger and fixation times collected in those areas exceed that for the smaller areas.

Figure 7. ‘Hot Spot’ map of the Atkins homepage
The hot spot map and AOI dwell time data for the Jenny Craig home page (Figure 8) indicate the area most fixated on to be the ‘Internet-only’ advertisement (M=2346ms, SD=1235ms), followed by the ‘Joy lost 23 lbs’ text (M=1544ms, SD=1102ms). It is interesting to note that details about the Jenny Craig diet plan and clues as to what information is available on this website located in the lower left portion of the screen was fixated upon an average of 1,000ms less than the ‘Internet-only’ ad (M=1306, SD=1108). The results for the Jenny Craig page are very similar to those of the Atkins page; participants apparently focused primarily on very specific areas of the screen for the majority of the 20 seconds. However, in both cases, it does not appear as though the areas of the page that actually contained the most information about the website and/or diet plan received the most fixation time.

Figure 8. ‘Hot Spot’ map of Jenny Craig homepage
The hot spot map for the Weight Watchers home page (Figure 9) was somewhat different. The most popular areas were the ‘Flex Points’ caption (M=2459ms, SD=1424ms), the ‘Meetings’ text directly under it (M=1985ms, SD=1229ms), and the information for online meetings below that (M=1925ms, SD=874ms). It is interesting to note, however, that on this site participants apparently fixated on areas on the left side of the screen considerably more than the right, and their hot spot areas cover more area on the screen than the other two diet sites. This suggests that no specific areas of the screen (other than the entire left side of the page) attracted the users’ attention which may have resulted in a less efficient visual search.

Figure 9. ‘Hot Spot’ map of Weight Watchers homepage
It is difficult to draw any specific conclusions from the preliminary eye-tracking data. A general observation would be that participants focused more on very specific areas of the Atkins and Jenny Craig home pages. However, upon further inspection, these areas are not necessarily those that would assist the participant in objectively determining what kinds of information might be found on the sites. Data from the Weight Watchers home page was somewhat different, in that participants seemed to both favor the left side of the screen over the right, yet did not have a specific set of targets on that side that held their attention.
DISCUSSION
Results from this study showed that users were able to complete the tasks successfully and were equally satisfied by all three sites. However, participants searched for the information more efficiently on the Atkins site and preferred this site overall.
Participants found some tasks more challenging than others across the three sites. Five users failed to correctly explain the purpose of the Jenny Craig program. This finding is complimented by the eye-tracking data, which showed participants (at least initially) fixated on graphics or large text on the home page more than the main navigation area or information near the ‘fold’. The home page was not the only problem, however, as participants did not look deep enough into the site to locate the fact that you must buy your food from Jenny Craig centers.
Many participants commented that the Atkins site was easier to use. The sitemap was a benefit noted by one user. The site was labeled as “well planned out” and “more professional” than the other sites. The titles and links were viewed as more specific resulting in an easier navigation experience. Participants commented negatively about the many areas of the Weight Watchers site that require a paid membership. The participants did not feel they could get the full scope of the site from the limited areas made available to all users.
Areas noted by participants to impact overall preference included the following:
Navigation Location: Locating the navigation at the top
of the screen was seen as beneficial by many participants. However, on the
Weight Watchers site, clicking the top navigation resulted in a sublevel
menu directly below the clicked tab. Many users commented on the small size
of the font and most missed the sublevel navigation completely on the first
few tasks. Jenny Craig uses a similar structure after the home page, but the
sublevel navigation is further below the clicked top level navigation. On
the Atkins site, the home page has the main navigation links located at the
top of the screen as well. On subsequent pages, the sublevel navigation is
located to the left of the main information area. Users repeatedly ignored
the left navigation. Possible explanations for this include the fact that on
the home page this space is occupied by a picture; this area is also a
different color and may have appeared to be an advertisement; or the user
may have expected the sublevel navigation to be closer to the top level
navigation.
Navigation Wording: Users complained about the wording used for the links on
all three sites. The task asking about exercise was particularly troublesome
because there was no consistent location for it on the sites. Several
participants were looking for the key word “Exercise” which was not on any
of the sites. Weight Watchers used the term “Healthy and Fit” which was more
intuitive to the users than the wording used on the other sites. Wording
also affected results on the task about eating out; again users expected
this to be worded as “eating out” or “dining out.”
Membership Requirements: The membership requirement was a major factor affecting the opinion of the Weight Watchers’ site. The site uses a different model than the other two and requires a paid membership to access many features. The users did not feel they could get adequate information about the weight loss program without paying for a membership. Some users commented they thought they would have to go to a meeting to get information.
Page Layout: Many users preferred the home page of the Atkins site over the other two sites. The initial page of the Jenny Craig site contained little information and had two large buttons reading “Client” and “Guest.” Almost all of the participants started their activities by clicking the guest button. For tasks such as locating a recipe, they completely ignored the link labeled “Recipes” and clicked the Guest button. The Flash features on the Jenny Craig home page were also described as distracting and meaningless. The favorite feature on the Jenny Craig site was the search functions within sections of the website. The search boxes were located in a highly visible area to the right of the main text and were easy to use when locating recipes and success stories. The Weight Watchers site was reported to be “cluttered and unorganized.” Most users liked the large blue buttons located at the top of the screen on the Atkins site. As mentioned, most users completely disregarded the sublevel navigation to the left of the main text area. Other complaints on the Atkins page layout regarded the amount of space used immediately below the navigation bar. Participants did not understand the purpose of this portion of the page and often found it led them to information they were not seeking.
REFERENCES
2004 New Year’s resolutions bode well for economy. (n.d.). Retrieved December 21, 2003, from MyGoals.com Web site: http://www.mygoals.com/about/HF
Digital Equipment Corporation Limited (1986). System Usability Scale. Reading, United Kingdom: Digital Equipment Co. Ltd. ©.
Burke, Susan L. (2003). Losing weight online: The new diet evolution. Retrieved December 21, 2003, from http://www.ediets.com/news/article.cfm?article_id=8675&pg=1
Tate D.F., Wing, R.R., Winett, R.A. (2001). Using Internet technology to deliver a behavioral weight loss program. Journal of the American Medical Association, 285(9), 1172-1177.
Summary: Recent studies have shown that while the use
of breadcrumb trails to navigate a website can be helpful, few users choose
to utilize this method of navigation. This study investigates the effects of
"mere exposure" and training on breadcrumb usage and satisfaction. Findings
indicate that brief training on the benefits of breadcrumb usage resulted in
more efficient search behavior.
A
breadcrumb trail is a textual representation of a site’s structure, e.g.,
Home >
Furniture > Chairs > Leather Chairs. This
representation of information allows users to link to major categories of
information along a continuum of sequential order. Research has reported
that breadcrumb navigation improves site efficiency (Maldonado & Resnick, 2002; Bowler, Ng & Schwartz, 2001).
Results of previous studies
exploring breadcrumb usage have shown that, when
given a choice, users choose the Back button or navigation bar to
navigate rather than the breadcrumb trail (Lida,
Hull, Pilcher, 2003). Shneiderman (1998) considers
breadcrumb navigation to be a tool for frequent users. He states novice
users who do not realize the advantage of the tools would simply ignore
them. Lazar and Eisenbrey (2000)
report that most people do not actually look at the breadcrumb trail.
A study by Lida-Rogers & Chaparro (2003) revealed
that participants who
did use the breadcrumb
trail were found to use the Back button less often than users who did not
use the breadcrumb; however, no differences were found in the efficiency
measures of total pages visited, navigation bar clicks, embedded link
clicks, or time to complete the search tasks. It was suggested that perhaps
participants may not understand the function of the breadcrumb as a
navigational tool and that possibly understanding the purpose of
the breadcrumb trail or minimal training might impact usage and/or
efficiency. According to Lazar & Eisenbrey (2000), the first step in making
breadcrumb trails useful in practice is to teach users that they do exist
and are a valuable resource.
In this study we considered the theory of mere exposure as a means of introducing participants to breadcrumb trails. According to Zajonc (1968), if the function of orienting behavior is eventually to change a novel stimulus into a familiar one, it is also its consequence to render the stimulus object eventually more attractive. Zajonc (1968) states that the mere repeated exposure of the individual to a stimulus object enhances his attitude toward it. We questioned if merely exposing participants to usage of breadcrumb trails was enough to enhance participants’ frequency of breadcrumb usage.
METHOD
Participants
Thirty-nine college students (12 males and 27 females) with an average age of 23 (range 18 to 48) volunteered for this study. All participants reported being familiar with the Internet, with 59% of participants reporting they had used the Internet for at least the past five years and 82% estimating that they spend time on the Internet at least once a day.
Materials/Procedure
The Wal-Mart® website was chosen for examination as a breadcrumb trail is one method of navigation (Figure 1).

Figure 1. Breadcrumb trail used on Wal-Mart.com
Participants were randomly assigned to one of three conditions. Condition I participants were shown the use of the breadcrumb trail twice by the experimenter prior to testing. After this exposure, participants in this condition were then given an explanation of the breadcrumb trail by the experimenter and were requested to use this method of navigation “whenever it is possible and most efficient to do so.” Condition II participants served as the mere exposure group and were shown the same usage of the breadcrumb trail as participants in Condition I. However, this group was given no explanation or instruction to use the breadcrumb trail. Condition III participants received no exposure, explanation, or instruction on the breadcrumb trail.
Participants were provided with a scenario indicating that they would be going on a camping trip and needed to purchase some items from the Wal-Mart® website for the trip (e.g., tent, sleeping bag, fishing pole, camera).
RESULTS
Participants in Condition I (Exposure & Instruction) generally used the breadcrumb trail more, the Back button less, and traversed fewer pages to purchase the items than participants in the other two conditions. However, no significant difference was found between the Condition II (Mere Exposure) and III (No Exposure or Instruction) participants in terms of the number of breadcrumb clicks, Back clicks, or total pages traversed.
Table 1. Comparison of navigation data across conditions (n=13 per group)
|
|
Condition I Exposure & Instruction |
Condition II Exposure |
Condition III
No Exposure or Instruction |
|
Breadcrumb clicks |
9.92(1.66) |
6.07(4.56) |
5.62(4.75) |
|
Back clicks |
.31(.631) |
5.31(6.97) |
5.38(6.55) |
|
Total pages |
58.62(3.99) |
65.46 (5.97) |
68.77(5.78) |
|
Total Time (in sec.) |
560.46(93.01) |
724.08(264.69) |
832.54(170.37) |
A significant difference was found for total time to complete search tasks. Post hoc analysis revealed that participants in Condition I completed all tasks in significantly less time than both Condition II and Condition III (see Figure 2, below). Total number of pages visited was also significantly different across groups. Participants in Condition I traversed fewer pages than participants in the other two conditions.

Figure 2. Total time for participants to complete all tasks for each condition.
DISCUSSION
In this study, participants were tested in one of three conditions to determine the impact minimal training and exposure had on influencing usage of breadcrumb navigation. It was hypothesized that merely exposing participants to usage of breadcrumb trails would prompt participants to use the shortcut tool for navigation. We found that mere exposure alone was not enough to significantly influence their usage more than the participants receiving no exposure to the breadcrumb trail. However, we did find that minimal training did affect participants’ usage of the breadcrumb trails and resulted in quicker completion times, visited fewer pages, and minimal use of the Back button.
On average, those participants who received both exposure and instruction completed all tasks much faster than participants receiving only exposure (3 min) and those receiving no exposure or instruction (4 min). This time savings could result in increased productivity for users that search websites on a daily basis. Future research should investigate the development of navigational training, especially in corporate environments, where time savings translate directly into corporate savings.
REFERENCES
Bowler, D., Ng, W., & Schwartz, P. (2001). Navigation bars for hierarchical websites. Retrieved 01/20/03 from University of Maryland, Student HCI Online Research http://www.otal.umd.edu/SHORE2001/navBar/index.html
Lazar, N., & Eisenbrey, M. (2000). Website structural navigation. Retrieved April 29, 2003, from University of Maryland, Student HCI Online Research http://www.otal.umd.edu/SHORE2000/webnav/index.html.
Lida, B., Hull, S., & Pilcher, K. (2003). Breadcrumb navigation: An exploratory study of usage. ../usabilitynews/51/breadcrumb.htm
Lida Rogers, B., & Chaparro, B. (2003). Breadcrumb
navigation: Further investigation of usage.
../usabilitynews/52 /breadcrumb.htm
Maldonado, C. A., & Resnick, M.L. (2002). Do common user interface design patterns improve navigation? Proceedings of the Human Factors and Ergonomics Society 46th Annual Meeting, 1315-1319.
Shneiderman, B. (1998). Designing the User Interface: Strategies for Effective Human-Computer Interaction. (3rd ed.). Reading, MA: Addison-Wesley.
Zajonc, R. B. (1968). Attitudinal effects of mere
exposure. Journal of Personality and Social Psychology Monographs, 9(2, Pt.
2).
Does Background Music Impact Computer Task Performance?
Summary: The effects of music on performance on a computer-mediated problem-solving task were examined. Participants completed the task in anonymous dyads as they were exposed to either Classical music, Punk music, or No Music. Results indicate that those in the Classical music condition performed better on the problem solving-task than those in the Punk music or No Music conditions. However, those listening to the Classical music offered more off-task comments during the task than those listening to No Music. Implications for website designers are discussed.
As more customers begin to access the Internet with high-speed connections, website designers take advantage of this speed
by using more sophisticated multimedia with music audio files to hopefully attract a greater number of users to their sites. Yet, little research has been done to determine whether the use of music actually improves user perception and/or performance on the Web.Many studies have looked at non-computer task completion while listening to background music in a real-world setting. According to a study completed by Hallman, Price, and Katsarou (2002), calming music led to better participant performance on an arithmetic task and a memory task than no music. It was also found that background music on cognitive test performance led to improved performance when compared with a control condition (Cockerton, Moore, & Norman, 1997).
Arkes, Rettig & Scougale (1986) found that as a task increased in complexity, listeners preferred music that was simple. The Mozart Effect (Rauscher, Shaw, & Ky, 1993) is a well-known phenomenon that showed that participants who listened to Mozart performed better on spatial tasks. Nantias and Shellenberg (1999) replicated this study and found participants performed better on spatial tasks than those listening to nothing or easy listening music. However, another study involving computer-generated music found that it was the pairing of music that was preferred over the other selections and the task, and not specifically Mozart. This finding strengthens the argument that a pairing of a positive stimulus (music that is preferred by the listener) with a less engaging stimulus may increase performance (Nantias & Shellenberg, 1999).
This study investigated the influence of two different music genres on performance on a computer-mediated task. Half of the participants were given a time limit per task (10 minutes), and half were in an untimed condition. It was hypothesized that communication of participants would be more task-oriented when the music was perceived as conducive to the task and that those in the timed condition would report more feelings of frustration on tasks and websites than those in the untimed condition.
METHOD
Participants
Seventy-two undergraduate and graduate students enrolled in undergraduate psychology courses and graduate methodology courses at Wichita State University participated in this study. Average age of the participants was 23.82. Participants were assigned to either a Classical, Punk, or No Music group based on their familiarity with the music type. Assignments were made so that all groups were balanced according to familiarity. In addition, participants were randomly assigned to either a timed or untimed condition.
Materials
A communication program was developed which allowed users to communicate with one other participant anonymously. Participants were given the task to assume they had been stranded on the moon and had to rank a list of 15 items in terms of importance to their survival with 1 being the most important and 15 being the least important. Scores were then compared to a recommended solution (Hall and Watson, 1970).
The computer program also
randomized a music playlist for each participant. Each participant received
a completely randomized version of the playlist. Participants listened to
the background music through headphones.
Procedure
The experiment was conducted in a 16-computer classroom. Each participant communicated anonymously with one other participant in the room to complete the task. Participants communicated with each other until they came to agreement on the ranking of the 15 items, or until they ran out of time. All pairs of participants listened to the same type of music.
RESULTS
A 2 X 3 between-subjects ANOVA was computed to examine the effect of time and music type on the ranking of the 15-items. The difference between the participants ranking for each item and the NASA ranking for each item was totaled for a final score. The lower the score, the more accurate the ranking (i.e., the closer it was to the NASA solution). Figure 1 shows that those listening to classical music had a better score than those in the punk music and no music conditions. There was no significant effect of time or interaction.

Figure 1. Performance Scores on the Moon Task Across Conditions (the lower the score the more accurate)
Communication Content
A content analysis was completed to identify all comments not relating directly to the problem-solving task. Those scores were then analyzed using a 2 x 3 between-subjects ANOVA with time and music type. There was a significant main effect of music type. Post hoc tests show the significant difference in the no music condition having significantly fewer off-task comments than those in the other conditions (Figure 2). There was no significant main effect of time or interaction.

Figure 2. Number of Off-Task Comments Across Conditions
DISCUSSION
Results from this study showed that those in the classical music condition performed better on the problem-solving task than those in the Punk music or No Music conditions, regardless of time constraints. Interestingly, participants in the Classical music condition performed better despite the fact that they contributed more off-task comments to one another than the No Music condition and about the same number of off-task comments as the Punk music group. Most of the off-task comments were personal in nature (i.e., emoticons, jokes). These results are supportive of research by Jensen (2001), which states that participants listening to classical music are more likely to disclose personal information than those listening to no music.
Those in the classical music condition appeared to be more involved in the task, leading to better scores on the problem-solving task and an increase in the use of hyper-personal communication. It is possible that classical music helped the participants relax creating an opportunity for them to not only succeed at the problem-solving task, but also have a more interactive communication experience. Future studies should examine the effects of different types of Classical music on computer-mediated tasks as well as other types of computer-based tasks, such as Web browsing and playing online games.
Acknowledgement: We'd like to acknowledge Jeff Howard for the creation of the Visual Basic program used in this study.
REFERENCES
Arkes, H. R., Rettig, L. E., & Scougale, J. D. (1986). The effect of concurrent task complexity and music experience on performance on preference for simple and complex music. Psychomusicology, 6, 51-60.
Cockerton, T., Moore, S., & Norman, D. (1997). Cognitive test performance and background music. Perceptual and Motor Skills, 85, 1435-1438.
Hall, J. & Watson, W. H. (1970). The effects of a normative intervention on group decision-making performance. Human Relations, 23, 299-371.
Hallman, S., Price, J. & Katsarou, G. (2002). The effects of background music on primary school pupil’ task performance. Educational Studies, 28, 111-122.
Jensen, K. (2001). The effects of selected classical music on self-disclosure. Journal of Music Therapy, 38, 2-27.
Nantais, K. M., & Schellenberg, E. G. (1999). The Mozart effect: an artifact of preference. Psychological Science, 10, 370-372.
Rauscher, F. H., Shaw, G. L., & Ky, K. N. (1993). Music and spatial task performance. Nature, 365, 611.
Evaluation of Websites for Older Adults:
How "Senior-Friendly" Are They?
By Traci A. Hart
Summary: Thirty-six websites designed for older adults were evaluated as to how well they complied to 25 "senior-friendly" guidelines recommended by the National Institute of Aging. Results indicate that a majority of the sites complied to guidelines related to basic navigation and content style but not for text size, text weight, or site map availability. Implications of compliance to these guidelines on user satisfaction and performance are discussed.
In 2003, a study recorded that those aged 55+ years, comprising 29% of the total population, accounted for 17.2% of the online users; an increase from 5% in 1997 (Cortese, 1997; Mediamark Research Inc., 2003). Although Internet usage is increasing overall, there remain millions of elderly that have yet to acknowledge the technology age is upon them. There may be a broad spectrum of reasons technology is not quickly adopted by older adults, such as computer anxiety (Ellis & Allaire, 1999), having little exposure or training to technology (Echt, Morrell & Park, 1998; Rogers, Meyer, Walker, & Fisk, 1998), or simply not understanding the possible benefits of use (Czaja & Sharit, 1998; Melenhorst, Rogers & Caylor, 2001).
Unfortunately, designers often fail to recognize older adults as a potential user group for their technology. Industry has only recently begun designing software and hardware that make accommodations for the needs of the aging user. Margaret Wylde (1995) has written that, “many manufacturers believe their idea is an ingenious product for ‘the elderly,’ but they rarely understand what the customer wants or the problems that must be solved” (p.15). There are specific abilities and performance attributes of the older users that need to be addressed. In general, older adults are slower (Czaja & Sharit, 1998), travel to fewer pages (Liao, Groff, Chaparro, Chaparro & Stumpfhauser, 2000), are less likely to leave a website when experiencing long delay times (Selvidge, 2003), and spend more time selecting targets for tasks than those in younger generations (Chaparro, Bohan, Fernandez, Choi, & Kattel, 1999).
Regardless of age, however, users are not always considered in website design. Thus, many groups and researchers have developed comprehensive sets of guidelines to improve accessibility such as the Web Content Accessibility Guidelines (W3/WAI) and the government instituted U.S. Section 508 Guidelines. The National Institute on Aging and the National Library of Medicine advanced the above guidelines one step further by developing “senior friendly” guidelines that are even more specific to the older adult Internet user. In 2002, these two groups published Making Your Web Site Senior Friendly: A Checklist consisting of twenty-five empirically-based guidelines for those websites targeting users 60+ (National Institute on Aging). Research in aging, cognition, human factors, and print materials lead to the development of the guidelines which
cover three areas of design; 1) designing readable text, 2) increasing memory and comprehension of web content, and 3) increasing the ease of navigation. The authors suggest the “implementation of these guidelines in web site construction will result in greater accessibility to online information for the elderly” and influence future willingness to explore the Web and increase enthusiasm toward technology.The purpose of the current study was to determine how websites designed for older adults currently adhere to the senior-friendly guidelines. Using heuristic evaluation techniques, raters identified whether each website conformed to the basic principles of usability following the NIA guidelines.
Method
Participants
Four members of the Software Usability Research Lab (SURL) at Wichita State University volunteered to participate in the research project. The evaluators had practical and research knowledge with website design and usability.
Procedures
Each of the heuristic evaluators were given a packet of information containing the following: a consent form, instructions, a list of websites to visit, the “Senior-Friendly” guidelines, and a blank guideline rating checklist for each website. The assessments were done over a two-week period to allow the evaluators ample time to familiarize themselves with each website. The websites chosen for guideline evaluation were obtained from common search engines (i.e., MSN, Google, etc.) using keywords such as “seniors,” “elderly” and “older adults.” The websites selected were commonly listed within the first twenty search results and were required to have a statement about the site “targeting” or “designed for” those 50 years or older. Websites were excluded from the sample if they appeared to be search engines or portal sites that directed users to other sites.
Results
A total of 36 websites were included in the analysis. A two-way mixed effect model for consistency was used to determine if there were differences between the four individual raters, which needed to be addressed before interpreting the other results, as suggested by Yaffee (1998). The average measured intraclass correlation (ICC) for the four raters was 0.8117, F (881, 2643) = 5.31, p < .0001.
Each of the twenty-five checklist items were rated on a 4-point scale to determine if the “Senior-Friendly” guidelines were followed or used by the particular site; rated as 1= Never, 2= Sometimes, 3= Frequently, and 4= Always. The scores from all four raters were combined to form a “Total Score” comprised of the number of guidelines rated as frequently or always present on the senior websites. After the total points were calculated, the websites were classified into three groups; Most (greater than 70% guideline adherence), Medium (between 51-69% adherence), and Least compliant to the guidelines (less than 50% adherence). The 36 websites’ total scores ranged from 13 to 23 points (M= 18.7, SD= 2.68); 22% of the websites had 13 -16 points (Least), 47% had 17 to 20 points (Medium), and 30.5% of the senior websites scored 21 to 24 points (Most). The ratings for the evaluated websites are listed in Table 1.
Table 1
| MOST COMPLIANT |
LEAST COMPLIANT |
| SeniorNet.com (23) | PaceSaver.com(16) |
| NCOA.org (22) | Go60.com (16) |
| Questonline.org (22) | Grandtimes.com (15) |
| Elderweb.com (22) | Seniors-Place.com (15) |
| Medicare.gov (22) |
SeniorSurfers.org (15) |
| SeniorSafety.com (21) | Geriquest.com (14) |
| Silverts.com (21) |
KCElderlaw.com (14) |
| SpinLife.com (21) | Retired.com (13) |
|
AARP.org (21) |
|
| SeniorCorps.org (21) | |
| Seniors.gov (21) | |
| MEDIUM COMPLIANT | |
| SeniorsGoTravel.com (20) | |
| MISeniors.net (20) | |
|
RetiredAmericans.org (20) |
|
|
SeniorsMatch.com (20) |
|
| VA.gov (20) | |
|
Friendly4Seniors.com (20) |
|
|
FDA.gov/oc/olderpersons (20) |
|
|
GrandparentWorld.com (19) |
|
| NewLifestyles.com (19) | |
|
Eldercare.gov (19) |
|
| Living-Trustforms.com (18) | |
|
SeniorsWW.com (18) |
|
| MyNursingHomes.com (18) | |
|
ElderHostel.org (17) |
|
|
Overfifties.com (17) |
|
|
SeniorCitizens.com (17) |
|
| SeniorResource.com (17) |
Seven guidelines were scored as “frequently” or “always” present at 95% of the sites and another four guidelines were only followed by 25% or less of the selected senior websites. The other guidelines did not appear to be consistently present (or lacking) across the different website designs. The guidelines and percentage of adherence are listed in Table 2.
Table 2. Guideline Adherence (% of sites adhering to guideline)
| Guideline Description | % |
|
Phrasing: uses the active voice |
100.0 |
| Scrolling: avoids automatically scrolling text and provides scrolling icon | 100.0 |
| Mouse: uses single clicks to access information | 100.0 |
| Lettering: uses upper and lower case for body text and reserves all capitals for headlines | 97.2 |
| Justification: uses left justified text | 97.2 |
| Style: uses positive phrasing and presents info. in clear manner without need for inferences | 97.2 |
| Menus: uses pull down and cascading menus sparingly | 97.2 |
| Simplicity: uses simple language for text; glossary provided for technical terms | 91.7 |
| Typeface: uses san serif typeface that is not condensed | 91.4 |
| Color: avoids using yellow, blue, and green in proximity | 90.6 |
| Backgrounds: uses light text on dark backgrounds or visa versa; avoids patterns | 88.9 |
| Consistent Layout: uses standard page design and navigation is same on each page | 77.8 |
| Organization: uses a standard format; lengthy documents broken into short sections | 75.0 |
| Navigation: uses explicit step-by-step navigation procedures; simple and straightforward | 72.2 |
| Help & Information: offers a tutorial on web site or offers contact information | 78.1 |
| Icons & Buttons: uses large buttons; text is incorporated with icon when possible | 69.4 |
| Text Alternatives: provides text alternatives for all other media types | 67.6 |
| Illustrations & Photos: uses text relevant images only | 63.9 |
| Type Weight: uses medium or bold face type | 50.0 |
| Type Size: uses 12 or 14 point for body text | 44.4 |
| Site Maps: uses a site map to show how site is organized | 38.9 |
| Hyperlinks: uses icons with text as hyperlinks | 25.0 |
| Animation, Video & Audio: uses short segments to reduce download time | 9.5 |
| Back/Forward Navigation: uses buttons such a “previous” and “next” for reviewing text | 5.8 |
| Physical Spacing: uses double spacing in body text | 2.8 |
Discussion
Results from this analysis show a wide range in the level of compliance across sites. In general, the majority of the sites complied to guidelines related to basic navigation and content phrasing and style but not to guidelines specifying text size, text weight, line spacing, textual links with graphics, or site map availability. The lack of consistency with regard to the formatting of the textual content is surprising, given the fact that one of the most fundamental guidelines for developing reading materials for older adults is to provide enlarged and highly contrasted text.
The impact of compliance to these guidelines on actual performance and satisfaction with a site is not clear. One may hypothesize that the more compliant a site is, the more usable it is. However, preliminary results from our research suggest that the sites most compliant are not always the most efficient or preferred. Specific results of this research will be available in the next issue of Usability News.
REFERENCES
Chaparro, A., Bohan, M., Fernandez, J. E., Choi, S. D., & Kattel, B. (1999). The impact of age on computer input device use: psychophysical and physiological measures. International Journal of Industrial Ergonomics, 24, 503-513.
Cortese, A. (1997, May 5). A census in cyberspace. Business Week, 84-85.
Czaja, S. J., & Sharit, J. (1998). Age differences in attitudes toward computers. Journal of Gerontology, 53B (5), 329-340.
Echt, K. V., Morrell, R. W., & Park, D. C. (1998). Effects of age and training formats on basic computer skill acquisition in older adults. Educational Gerontology, 24, 3-25.
Ellis, R. D., & Allaire, J. C. (1999). Modeling computer interest in older adults: The role of age, education, computer knowledge, and computer anxiety. Human Factors, 41(3), 345- 356.
Liao, C., Groff, L., Chaparro, A., Chaparro, B. S., & Stumpfhauser, L. (2000). A comparison of web site usage between young adults and the elderly. Proceedings of the IEA 2000/HGES 2000 Congress. San Diego, CA: Human Factors and Ergonomics Society.
Mediamark Research Inc. (September 2002 – October 2003). MRI Cyberstats, Fall 2003. Retrieved on November 18, 2003, from www.mediamark.com
Melenhorst, A. S., Rogers, W. A., & Caylor, E. C. (2001). The use of communication technologies by older adults: exploring the benefits from the user’s perspective. Proceedings of the Human Factors and Ergonomics Society 45th Annual Meeting. Santa Monica, CA: Human Factors and Ergonomics Society.
National Institute on Aging (2002). Older adults and information technology: A compendium of scientific research and web site accessibility guidelines. Washington, DC: U.S. Government Printing Office.
Rogers, W. A., Meyer, B., Walker, N., & Fisk, A. D. (1998). Functional limitations to daily living tasks in the aged: A focus group analysis. Human Factors, 40(1), 111-125.
Selvidge, P. R. (2003). The effects of end-user attributes on tolerance for world wide web delays. Unpublished doctoral dissertation, Wichita State University, Wichita, Kansas.
Wylde, M. A. (1995). How to size up the current and future markets: technologies and the older adult. Generations, Spring, 15-19.
The Effects of Perceptual Grouping on Text Entry Performance
Summary: One of the primary challenges confronting designers of mobile computing devices is the issue of efficient text entry. One potential solution is to group multiple letters onto single keys, similar to the T9 keyboard currently used on telephones. Two experiments examined the effects of perceptual grouping on soft keyboard transcription rates. Results from Experiment 1 showed significantly slower transcription rates for QWERTY keyboards with grouped keys. Results from Experiment 2 showed various levels of perceptual interference due to the different Gestalt grouping effects. These results indicate that perceptual grouping can negatively affect text entry performance, and placing multiple letters onto single keys reduces the speed at which users can transcribe words.
INTRODUCTION
Technological advances in the computing industry have lead to a dramatic increase in the accessibility of mobile computing devices such as pocket PCs, personal digital assistants, pen tablets, web-enabled cellular phones, and two-way pagers (Mobile.com, 2002; Legard, 2000). As these devices continue to provide increased functionality while at the same time becoming progressively smaller, data entry for these devices becomes a critical issue.
Previous research has shown that novice users are typically faster and more accurate using a QWERTY keyboard than with alternative keyboard design (e.g. Bohan, Phipps, Chaparro, & Halcomb, 1999; McQueen, MacKenzie, Nonnecke, & Riddersma 1994; MacKenzie, Zhang, & Souckoreff, 1999). Thus, shrinking the QWERTY keyboard to accommodate mobile devices provides familiarity at the expense of speed.
Altering keyboards to optimize data entry often results in a loss of immediate usability. Several “high-performance” keyboard designs such as the OPTI (MacKenzie & Zhang, 1999) and the Fitaly (Broida, 1998) have been proposed to minimize travel distance between keys. MacKenzie and Zhang demonstrated that users were able to achieve tapping rates upwards of 44 wpm on the OPTI keyboard after approximately 15 hours of use compared to 40 wpm for the standard QWERTY keyboard with the same amount of practice. However, initial performance of novice typers was considerably faster with the more familiar QWERTY (28 wpm) than the OPTI keyboard (17 wpm).
Additional keyboards have been proposed that attempt to
achieve a balance between more efficient input and ease of use. Hashimoto
and Togasi (1995) attempted to maximize the performance of a virtual QWERTY
keyboard by adjusting its shape to better accommodate the biomechanical
attributes of the user’s wrist and fingers. The authors demonstrated that
participants could point to the designated keys significantly faster on the
oval keyboard than on the standard rectangular keyboard layout.
The T9 keyboard (Tegic communication, 1998) incorporates the familiar
alphanumeric layout of touchtone telephones where 3-4 letters are grouped
onto a single button in alphabetical order. Applying Fitts’ Law (Fitts &
Peterson 1964; Fitts, 1954), this keyboard is potentially better suited for
touch tapping in small spaces because it reduces the number of keys in
addition to reducing travel time between keys. In this case, tapping
sequential letters that are grouped on the same key would minimize travel
time. Bohan, et al. (1999) investigated the immediate usability of the T9
keyboard relative to the standard QWERTY keyboard of the same overall size.
It was expected that participant’s performance would be faster and more
accurate using the T9 keyboard due to the larger blocked keys and the
ability to make multiple taps on a single key when selecting letters within
the same block; however, transcription rates were actually higher for the
QWERTY keyboard (23 wpm) than for the T9 keyboard (17 wpm).
There are at least two interpretations of Bohan, et al.(1999) results. One interpretation is that the slower performance of the T9 keyboard relative to the QWERTY keyboard reflects differences in the participant’s familiarity with the respective keyboard. Alternatively, performance may have been hindered due to perceptual interference in accordance with the Gestalt principle of common region (Palmer, 1992) which states that elements will be perceived as a group together if they are located within a common region of space or an enclosed boundary. This grouping effect may in turn make it more difficult to process the individual elements within a group. The letters on the telephone keypad are arranged on the buttons in groups of three or four, which will cause them to be perceptually grouped. The resulting group would then have to be decomposed into its requisite component letters before a target letter could be identified. If perceptual grouping does negatively impact transcription performance in this way, the effects may be strong enough to counteract the motor advantages supported by larger keys (Bohan, et al. 1999). The following experiments address both explanations.
EXPERIMENT 1
The first experiment compared the transcription rates of participants using a single key (Qs) keyboard and a modified keyboard (Qm) in which the arrangement of the letters were not altered; however, keys were merged so that the width of the keys were increased by approximately four fold with up to three letters on a single key (see Figure 1). Participants also transcribed words using single- and multiple-key versions of the ABC keyboard layout (As and Am, respectively). It was reasoned that perceptual grouping would have a greater impact on transcription performance when participants are less familiar with the spatial arrangement of a particular keyboard thus having to rely a greater extent on visual scanning to find the target letter. Whereas with experience the participants may rely more on remembered spatial locations and kinesthetic positional cues rather than visual scanning to find the target letter.
Participants transcribed words presented at systematically varying rates using soft keyboards graphically displayed on a touch-screen. The average maximum transcription rate for each keyboard was calculated and used to compare text entry performance. According to the perceptual grouping hypothesis, transcription rates for the Qs should be as high or higher than those for the Qm due to the additional visual processing. The alternative hypothesis is that grouping letters on a single key does not introduce additional visual processing resulting in significantly faster transcription rates for the Qm due to the key’s larger target size and the ability to make multiple taps on a single key. In addition, transcription rates for the QWERTY keyboards would be higher than those of the ABC keyboards due to difference in familiarity. In addition, transcription rates for the As keyboard should be higher than those for the Am keyboard.

Figure 1. Qm keyboard with multiple letters grouped onto single keys.
METHOD
Calculation of Transcription Rate
To measure participant performance, a mathematical formula originally developed to measure reading rates was used (Legge, Pelli, Rubin, & Schleske, 1985). Transcription rates were calculated by multiplying the percentage of correctly transcribed words by the text presentation rate (wpm). Maximum average transcription rates were estimated by fitting each participant’s performance curve with a second order polynomial function (i.e. parabola). The peak of the function was estimated by solving for the first-order derivative of the function. This method minimizes the potential impact of experimenter bias in estimating the peak when the function was relatively flat. Maximum average transcription rates were used to measure the mean differences between each of the keyboard formats.
Participants
Ten right-handed participants, ages 28-45 (M = 34) participated in the study. All participants reported having 20/20 corrected/uncorrected vision. All participants had extensive computer experience and four had used a PDA.
Apparatus
A custom computer program written in Microsoft Visual Basic 6.0 was created to display the various keyboards and collect the data. A land-on touch selection strategy was used to activate the “buttons” of the soft keyboard. For these experiments the actual size of the keys were not a concern as the purpose of the experiments was to determine if there was an effect for grouping; as such the size of the keys were four times larger than a traditional keyboard in an effort to minimize inadvertent errors. With participants seated approximately 24 cm from the screen single key targets subtended 2.5˚ visual angle with alphanumeric characters within the keys subtending 1.67˚ visual angle. Grouped keys employed the same size characters; however, the key target width subtended to 6.95˚ visual angle to include the additional letters. The program ran on a 233 MHz IBM-compatible computer and stimuli displayed on a 15-inch diagonal 3M Microtouch CRT Touch Monitor (1024 x 768 pixel resolution) with a resistive touch-sensitive screen.
Procedure
The height of the monitor was adjusted to center the screen approximately at eye level. Participants started each trial by tapping a “START” button displayed on the screen. Upon starting the trial ten four-letter words, randomly selected from a database, appeared individually at the top of the screen. Participants were required to transcribe the word using the soft keyboard displayed on the touch-sensitive screen. The participant’s transcriptions were displayed in a window directly below the target word. In trials involving grouped keys the software automatically determined the correct letter to display in the transcription window by comparing the sequencing of the tapped keys to the letters in each word. For example, if the word “BIRD” was displayed at the top of the screen, the participant would correctly tap the “VBN” key for the letter B, the “UIO” key for the letter "I" and so on. If the tapped key contained the required letter the program tallied a correct response and displayed the letter in the transcription window. If the tapped key did not contain the required letter the program tallied a miss and a hyphen was displayed in the transcription window. Each trial presented the words at different rates (10, 15, 20, 25, and 30 words per minute). Participants performed 5 trials at each presentation rate for each keyboard resulting in 25 trials per keyboard. The order of presentation rate and order of keyboard layout was randomized for each participant. Each experiment took approximately 25 minutes.
RESULTS AND DISCUSSION
Maximum average transcription rates were subjected to a 2 (grouping) X 2 (keyboard layout) within-subjects ANOVA. Analysis showed a significant main effect for keyboard layout, F (1, 9) = 437.81, p < .001, as transcription rates were significantly faster for the QWERTY keyboards than for the ABC keyboards. There was not a significant main effect of grouping, F (1, 9) = 1.23, p = .285; however, there was a significant interaction between keyboard layout and grouping F (1, 9) = 14.122, p = .004. Paired sample t-tests comparing transcription rates within keyboard layouts showed a significant difference between Qs and Qm keyboards, t (9) = 3.29, p = .009 (see Table 1).
Table1. p-values for pairwise comparisons of transcription rates in Experiment 1.
|
|
As |
Am |
Qs |
|
Am |
.299 |
|
|
|
Qs |
<.001 |
<.001 |
|
|
Qm |
<.001 |
<.001 |
.009 |
The results of this experiment showed significant differences in transcription rates. In the case of the ABC keyboard, the results were likely indicative of a floor effect. That is, the novelty of the ABC keyboard caused search times long enough that any additional searching induced by the grouping effect would not have a significant affect. The results support the hypothesis that perceptual grouping significantly affects transcription rates. The results also support Bohan, et al.’s (1999) previous findings that transcription rates for the QWERTY keyboard were faster than the T9 keyboard despite the larger blocked keys.
EXPERIMENT 2
As a follow-up study, a second experiment was designed to measure the relative effects of different types of Gestalt grouping strategies (Palmer, 1992; Wertheimer, 1923) on text entry performance using soft keyboards. Specifically, participants transcribed words using QWERTY keyboards with letters that were not grouped, grouped by common region, or grouped by proximity. In the case of proximity grouping, letters were segregated by spatial distance between the letters. The purpose of this experiment was to explore the effects of perceptual grouping on text entry performance on soft keyboards.
Participants
Ten right-handed participants ages 29-49 (M = 36) participated in the study. All participants reported having 20/20 corrected/uncorrected vision. All participants had extensive computer experience and three had used a PDA.
Apparatus
The task and apparatus were identical to those used in Experiment 1 with the exception of the keyboards used. Like the first experiment, the single-key and multiple-key QWERTY keyboards were used in addition to a keyboard grouped by proximity. The proximity keyboard (Qp) consisted of letters grouped together in groups of 1 to 3 by spatial distances between keys; that is, there were not box outlines to contain the letters (see Figure 2).

Figure 2. Screen shot of the Qp keyboard with multiple letters keys grouped by spatial proximity.
RESULTS
A significant main effect was found for grouping type, F (1, 9) = 5.87, p = .006. Pairwise comparison t-tests showed significant differences between the Qs keyboard and the grouped keyboards: Qm (p = .05), Qp (p = .043), Qc (p = .021). Significant differences between the grouped keyboards are shown in Table 2. Interestingly, transcription rates for the Qc condition was significantly different from all the other conditions suggesting that the Qc condition has a much stronger perceptual grouping effect that grouping by common border or proximity. The reason for this effect is unknown but will be the focus of continuing research.
Table 2. p-values for pairwise comparisons of transcription rates in Experiment 2.
|
|
Qs |
Qm |
Qp |
|
Qm |
.051 |
|
|
|
Qp |
.043 |
.813 |
|
|
Qc |
.021 |
.076 |
.031 |
DISCUSSION
Tapping on a miniaturized virtual QWERTY keyboard has been shown to be the optimal solution for data entry because of its widespread familiarity (MacKenzie, Zhang & Soukoreff, 1999). Thus, a logical approach to finding a balance between these tradeoffs is to maximize the performance of a familiar text entry method like the QWERTY keyboard (Hashimoto & Togasi, 1995).
Given what is known about speed/accuracy tradeoffs in human performance, one way to improve performance on soft keyboards is to increase the size of the keys. However, size constraints of mobile computing devices prevent this option. Alternatively, placing multiple letters on a single key reduces the number of keys required and minimizes the space required. However, this study suggests that grouping letters onto a single-key may decrease transcription performance due to additional perceptual processing steps despite a four-fold increase in target size.
The results of these experiments illustrate that the need to visually parse groups of letters into their constituents before a target letter can be identified will result in a significant decrease in text entry speed. In addition, comparison of mean transcription rates in the second experiment suggest that strength of the perceptual interference on text input is consistent between the types of grouping effects tested. Ongoing research is currently investigating the effects of other grouping effects.
REFERENCES
Bohan, M., Phipps, C.A., Chaparro, A., & Halcomb, C. (1999). A psychophysical comparison of two stylus-driven soft keyboards. Proceedings of Graphics Interface ’99, 92-97.
Broida, R. (1998). Input Alternatives - Fitaly vs. T9.
Tap Magazine, Issue 1.6, November 1998
http://www.fitaly.com/
Fitts, P.M. (1954). The information capacity of the human motor system in controlling amplitude of movement. Journal of Experimental Psychology, 47, 381-391.
Fitts, P.M., & Peterson, J.R. (1964). Information capacity of discrete motor responses. Journal of Experimental Psychology, 67, 103-112.
Hashimoto, M., & Togasi, M. (1995). A virtual oval
keyboard and a vector input method for pen-based character input.
Proceedings of the ACM Conference on Human Factors in Computing Systems.
Retrieved September 11, 2001 from:
http://www.acm.org/sigchi/chi95/proceedings/shortppr/mh_bdy.htm.
Legard, David (2000). PDA Sales to Double in 2000. PCWorld, August 28, 2000.
Legge, G.E., Pelli, D.G., Rubin, G.S., & Schleske, M.M. (1985). Psychophysics of reading – I: Normal vision. Vision Research, 25, 239-252.
MacKenzie, I. S., & Zhang, S. X. (1999). The design and evaluation of a high-performance soft keyboard. Proceedings of the ACM Conference on Human Factors in Computing Systems - CHI '99, pp. 25-31.
MacKenzie, I. S., Zhang, S. X., & Soukoreff, R. W.
(1999). Text entry using soft keyboards. Behaviour & Information Technology,
18, 235-244.
McQueen, C., MacKenzie, I. S., Nonnecke, B., & Riddersma, S. (1994). A
comparision of four methods of numeric entry on pen-based computers.
Proceedings of Graphics Interface '94, pp. 75-82. Toronto: Canadian
Information Processing Society.
Mobile.com (2002, November 5). Gartner: PDA Sales Up or
Down. News issue 42. Retrieved December 4, 2002, from:
http://www.mobileinfo.com/News_2002/Issue42/PDA_Dataquest.htm
Palmer, S.E. (1992). Common Region: A new principle of perceptual grouping. Cognitive Psychology, 24, 436-447.
Tegic Communication (1998). http://www.t9.com/t9_faqs.html
Wertheimer, M. (1923). Untersuchengen zur Leher von der
Gestalt. Psychologische Forshung, 4, 301-350. [Reprinted in part in W.D.
Ellis (Ed.), A sourcebook of Gestalt Psychology (pp. 71-88). New York: The
Humanities Press, 1950].
Evaluating the CombiMouse™:
A New Input Device
for Personal Computers
Summary: The CombiMouse™ is a new input device for personal computers that combines the functionality of a keyboard and a mouse into one device. Preliminary results of first-time usage indicate the device to be very promising. CombiMouse performance was shown to be just as efficient as the traditional mouse and significantly more efficient than the track ball and another novel input device. Typing performance was not as fast as the traditional keyboard, though users reported they would improve with practice.
INTRODUCTION
Intensive mouse and keyboarding use has been associated with increased risk of upper extremity musculoskeletal disorders (see Fagarasanu & Kumar, 2003 for a review). As a result, input devices (e.g. ergonomic keyboards, trackball or joystick) are often recommended as a therapeutic intervention. However, use of these alternative devices may result in a trade-off between comfort and performance.
The CombiMouse™ is an alternative input device
1 that combines the functionality of a keyboard and a mouse into one device. The CombiMouse consists of two units, much like a split-keyboard, in which the left hand unit (LHU) is a stationary device with keys that are typically used by the left hand, while the right hand unit (RHU) contains keys typically used by the right hand but also is mobile and serves as a mouse. To change the RHU to mouse mode, the user grips the RHU touching a proximity circuit that detects finger contact. This cancels out the keyboard functions and allows it to act as a mouse.The purpose of the current pilot study was to evaluate the CombiMouse as a pointing device and as a typing device. User performance with the CombiMouse was compared to a traditional mouse and two other alternative pointing devices (trackball and RollerMouse™) (Slocum, Bohan, & Cha