User perspectives on the usability of a regional health information exchange
- Cynthia S Gadd1,
- Yun-Xian Ho1,
- Cather Marie Cala1,
- Dana Blakemore2,
- Qingxia Chen2,
- Mark E Frisse1,
- Kevin B Johnson1
- 1Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee, USA
- 2Department of Biostatistics, Vanderbilt University, Nashville, Tennessee, USA
- Correspondence to Dr Cynthia S Gadd, Vanderbilt University, Department of Biomedical Informatics, Eskind Library, Room 444, Nashville, Tennessee, TN 37203, USA;
- Received 25 March 2011
- Accepted 29 April 2011
- Published Online First 27 May 2011
Objective We assessed the usability of a health information exchange (HIE) in a densely populated metropolitan region. This grant-funded HIE had been deployed rapidly to address the imminent needs of the patient population and the need to draw wider participation from regional entities.
Design We conducted a cross-sectional survey of individuals given access to the HIE at participating organizations and examined some of the usability and usage factors related to the technology acceptance model.
Measurements We probed user perceptions using the Questionnaire for User Interaction Satisfaction, an author-generated Trust scale, and user characteristic questions (eg, age, weekly system usage time).
Results Overall, users viewed the system favorably (ratings for all usability items were greater than neutral (one-sample Wilcoxon test, p<0.0014, Bonferroni-corrected for 35 tests). System usage was regressed on usability, trust, and demographic and user characteristic factors. Three usability factors were positively predictive of system usage: overall reactions (p<0 0.01), learning (p<0.05), and system functionality (p<0.01). Although trust is an important component in collaborative relationships, we did not find that user trust of other participating healthcare entities was significantly predictive of usage. An analysis of respondents' comments revealed ways to improve the HIE.
Conclusion We used a rapid deployment model to develop an HIE and found that perceptions of system usability were positive. We also found that system usage was predicted well by some aspects of usability. Results from this study suggest that a rapid development approach may serve as a viable model for developing usable HIEs serving communities with limited resources.
- Information dissemination
- medical records system
- regional health planning
- biomedical informatics
- Qualitative/ethnographic field study
- Improving the education and skills training of health professionals
- System implementation and management issues
- Social/organizational study
- Collaborative technologies
- Methods for integration of information from disparate sources
- Demonstrating return on IT investment
- distributed systems
- Software engineering: architecture
- Supporting practice at a distance (telehealth)
- Data exchange
- and integration across care settings (inter- and intra-enterprise)
- Visualization of data and knowledge
- ethical study methods
- clinical informatics
- biomedical informatics
- human factors
Introduction and background
The need for electronic health records with information from multiple sites has steadily grown since 1990, but there have been and continue to be many challenges to the implementation of information technology in healthcare. Early attempts to develop health information exchange (HIE) systems were plagued by problems that temporarily halted progress.1 However, a federal report published in 2004 calling for a national health information network prompted renewed interest and catalyzed efforts to establish regional health information organizations.2
Most HIE systems are in the early stages of development; therefore, not surprisingly, studies have focused on anticipated healthcare outcomes,3 4 and few have directly addressed perceptions of usability.5 As defined by the ISO 9241-11 (1998) standard, usability is the ‘extent to which a product can be used by specified users to achieve specified goals with effectiveness, efficiency, and satisfaction in a specified context of use.’ It is an important component of any study evaluating outcomes since such studies presume a high level of use. System adoption and use is generally tied to usability: if a system is difficult to use, people will be less likely to use it.6–10 Moreover, because HIE use implies a willingness to trust data from remote facilities when making decisions, it is possible that its adoption depends on trusting remotely-entered data. Therefore, usage may also be related to attributes of the data, such as what types of data are available and user perceptions of their reliability.11–14
In 2004, the MidSouth eHealth Alliance (MSeHA) introduced an HIE to three counties surrounding Memphis, Tennessee. MSeHA is an initiative funded by the Agency for Healthcare Research and Quality, the State of Tennessee, and Vanderbilt University.15 Its goal was to create an HIE that meets the basic needs of an underserved metropolitan region through exchange of clinical data among hospital emergency departments and community-based ambulatory clinics. To achieve this goal, MSeHA implemented a rapid deployment model that consolidated data from multiple facilities, while preserving the autonomy of participating entities.16 We adopted this approach to achieve a goal of having an operational system within 2 years of receiving grant funding, and because we believed that achieving operational status quickly would lead to wider participation among regional sites.
As a part of a larger study,17 and to better understand how the system was perceived by end users, we assessed user impressions of the HIE's usability. We hypothesized that system usage is linked to usability, an idea consistent with the widely-accepted technology acceptance model (TAM) which describes usage as a combination of factors related to user perceptions of its usability and its usefulness.7 18 Although we did not test the TAM directly, we included some of the factors and relationships modeled by the TAM such as usability, trust, and socio-demographic variables.11 19 Multiple dimensions of usability were tested by the Questionnaire for User Interaction Satisfaction (QUIS 7.0), a validated instrument that assesses perceptions of technological tools.20 21 Variables of interest included user impressions of the following system attributes: the system, screen attributes, terminology and system information, learnability, system capabilities, system functionality, and trust of data sources. The influence of socio-demographic variables (eg, age, gender, professional role) was also examined for potential influence on usage. We defined usage as the self-reported average weekly time spent using the system. Ordinal response categories were pre-defined according to QUIS as follows: <1 h, 1 to <4 h, 4 to <10 h, and over 10 h. Figure 1 shows how we adapted the TAM to illustrate the relationship between usage and usability (using QUIS variables), trust, and socio-demographic variables.
The HIE design was based on four prominent models of interoperability used in four states.22 Vanderbilt University led the HIE design and development using a model consisting of the following primary components: (1) a patient-centered framework, a ‘decentralized’ system architecture with secure ‘vaults’ of patient information designated for and managed by each participating organization; (2) the ability for participating organizations to maintain current representations from each data input that are viewable by users via high-level identifiers; (3) minimal cost because organizations do not have to map their data and their systems can therefore evolve; (4) and a strong emphasis on security and privacy with two-factor authentication coupled with binding data sharing and participation agreements.16
Early in the HIE's design phase, the MSeHA Board approved a platform and a plan for rapid data exchange and system dissemination. This decision was made because the urgent needs of the community precluded a standardized product development lifecycle, as had been executed with other HIE systems. As a result, we used an existing electronic health record platform and made only minimal changes to its interface. They included modifying the patient selection screen; displaying a facility name next to each test result, procedure note, or encounter summary; creating an aggregated view of laboratory data across facilities; and increasing the security of the login process by adding random number tokens. We also made minor changes to simplify navigation and remove functionality not required for the HIE system. Figure 2 shows examples of HIE interfaces. For more details, see Frisse et al.16
Our system serves 1.25 million patients in southwestern Tennessee. It shares patient data (eg, demographics, ICD-9 discharge codes, laboratory results, encounter data, and dictated reports) among nine hospital emergency departments, 15 ambulatory clinics, and one university medical group. A patient record-locator matches records across all vaults and displays an aggregate view of data in response to a user's query. Patients may opt out and keep their records local. The system records all logons and activity. Johnson and colleagues reported that the HIE was used for 3% of all encounters after its first year.16 Usage was steady at 7% at the time of this study, despite some changes in staff and faculty turnover which is high relative to some other recently reported usage rates.23
Sampling and subject recruitment
An email requesting participation in our study was sent to 345 healthcare professionals at participating MSeHA organizations in May 2009. This group included physicians, nurse practitioners, registered nurses, physician's assistants, and other medical staff. Candidates who agreed to participate were asked to indicate their preferred method of survey delivery. In June, we distributed a link to the electronic version of the survey, or sent it directly by email, postal mail or fax copy. Persons with invalid or missing email addresses were contacted by telephone or via the best contact method provided by the site supervisor. We informed prospective participants that those who completed the survey had up to a 1% chance of winning an iPod Touch. Seventy-six candidates could not be located (eg, workplace relocation). One of our co-authors (CMC) disseminated periodic emails to the remaining 269 candidates from June to November 2009, to remind them about the survey. Recruitment ceased when our sample was large enough to achieve sufficient statistical power.
We evaluated user perceptions of the system with selected items from QUIS 7.0.20 21 Our survey had three sections: (1) demographics (age, job, gender) and system usage characteristics; (2) familiarity with technology (source: QUIS); and (3) user perceptions across seven scales: Overall Reactions, Screen, Terminology and System Information, Learning, System Capabilities, System Functionality (source: QUIS), and an author-generated Trust scale. The Trust scale probed beliefs about other participating organizations and the integrity of their data. Members of our evaluation team reviewed items for face validity. We collected responses for all items on a scale of 1 to 9, except for Trust, which was scaled from strongly disagree to strongly agree. Participants could also leave comments in each section. Online and paper versions of the survey were as similar as possible; however, there were some notable differences (see Data analysis section for details).
We used REDCap (Research Electronic Data Capture, https://redcap.vanderbilt.edu/), a secure web-based tool to create the electronic version of the survey and administer it.24 In addition to delivery methods noted above, we hand-delivered surveys to providers who were unable to receive them any other form. The survey was distributed to health professionals with access to the HIE, including a subset of participants we refer to as ‘non-users’ who despite having access to the HIE subsequently reported that they did not use it. Data were collected from June to November 2009.
We analyzed data from the online and paper versions of the survey together; however, there were some notable differences between the two versions. The paper version of QUIS was scaled from 1 to 9, whereas the online version was scaled from 0 to 9. We compensated by transforming online scores so that both scales ranged from 1 to 9. In addition, QUIS items in the paper version included a Not Applicable (NA) option. The online version did not. Because the online survey did not disambiguate between missing and NA responses, we treated NA responses as missing. This approach is addressed in the Discussion section. Finally, the Terminology and System Information scale in the online version included ‘exploration of features by trial and error,’ which is typically considered a Learning item in QUIS. We analyzed it as part of the Learning scale to be consistent with QUIS. We calculated the internal reliability of each QUIS scale using Cronbach's α.
We used the statistical package Stata/SE 10.1 for quantitative analyses. A Wilcoxon rank sum test was used to investigate survey method effects on QUIS responses. We also compared descriptive statistics for both methods across demographics and usage characteristics. Data were summarized using mean, median, and SD, and we used a sign test to determine if individual subscale items were significantly different from neutral. To determine the effects of our independent variables (age, training role, system access time, computer literacy, trust, and QUIS items) on our dependent outcome, average system use per week, we used the ordinal logistic regression model shown below:where ‘Qi’ is one of the six QUIS scale scores averaged across subscales, ‘system access time’ is an ordinal variable (<1 year, 1 year–<2 years, 2 years–<3 years, ≥3 years), ‘training role’ is a nominal category (Registered Nurse (RN)/Physician's Assistant (PA)/Licensed Practical Nurse (LPN) vs Nurse Practitioner (NP)/Physician (MD)), ‘age’ is a continuous variable, ‘computer literacy’ is scored from 0 to 25, and the trust index is the summed Trust subscale scores, yielding a value from 5 to 25. On average, 10% all responses per subscale item were not interpretable (NA) or missing. We used the multiple imputation approach with five imputations via the STATA ice package.25 26 This model included our outcome, average system use per week, and 40 independent variables: age, training role, system access time, computer literacy, trust, and 35 QUIS items. Although the missing at random assumption used in the multiple imputations is not directly testable, we evaluated the assumption by calculating the C-statistics (ROC value) from a logistic regression model in which the binary missing data indicator was regressed on variables used to impute missing values. Averaged imputed QUIS scales and summed Trust scores were included in the regression as single terms. A test of the proportional odds assumption for each of the six models (one for each QUIS item) verified the use of an ordinal logistic regression (p values range from 0.080 to 0.154).
User comments were analyzed qualitatively by three coauthors using a thematic coding approach whereby emergent themes in perceptions of usability were identified via inductive analysis and a consensus process.27
Out of 237 distributed surveys, we received 165 responses (70% response rate). A total of 105 responses were submitted via REDCap and 60 were completed on paper. Three were excluded from analysis because there were no responses in any of the QUIS sections and the respondent did not self-identify as a non-user, which would have permitted the survey to be exited before the QUIS sections were reached. Of the remaining 162 completed surveys, 151 respondents identified themselves as system users and 11 identified themselves as non-users.
The Wilcoxon rank sum test showed no significant difference in QUIS scores between the responses to the online and paper versions (p<0.0014, Bonferroni-corrected for 35 tests), except for three subscale items: ‘I need to scroll to review information’ in Screen, and ‘Terminology relates well to the work you are doing: Never–Always’ and ‘Messages which appear on the screen are: Inconsistent–Consistent’ in Terminology and System Information. We also found no remarkable differences in the demographics or usage characteristics between online and paper survey respondents. For details about our respondents by survey method, see Table S1 of our online supplement at www.jamia.org.
The mean respondent age was 43.8 years (SD 10.8; range: 23–76); 83 females (51.2%) and 76 males (46.9%) participated. Respondents identified themselves as follows (number per role is in parentheses): LPN (10), MD (85), NP (14), PA (3), RN (36), and Other (10).
We asked the 11 respondents who identified themselves as non-users to indicate a reason for non-use. Reasons included the following (frequency of responses is indicated in parentheses): Never been trained to use it (2); I do not find it useful (1); I do not have access to it (1); It is too difficult to use during my work (1); and Other (6). We thematically organized respondents' comments into the following categories of non-use: Someone else accesses the system when I need it (4); Do not know about the system (1); and Do not need the system (3). Comments included: ‘Don't know what it's about nor what to use it for’; ‘I have not worked in the emergency department since getting access’; ‘Had someone else get me information from it when I needed it (rarely)’; and ‘Info had already been pulled up before I saw patient needing this info, found it very helpful but haven't had a need/chance to use it myself.’
Table 1 shows users' average weekly time spent using the HIE. Overall, 43% of users reported using the system for <1 h a week. Users had personally used or were familiar with 15/25 technological devices listed, on average. Items ranged from ‘email’ to ‘CAD computer-aided design.’
There was no significant difference between the ages of users and non-users (p=0.86) or their genders (p=0.52).
Perceived system usability
A QUIS score of 5 was neutral, a score of >5 was favorable, and a score of <5 was unfavorable. The mean score of all 35 items of the six QUIS scales was 6.5 (SD 1.4) with high intrascale agreement (Cronbach's α scores ranged from 0.74 to 0.91), indicating a high degree of measurement reliability within and across scales.28 29 For all items, over 50% of scores were above neutral (>5), suggesting that the system was perceived positively. A one-sample Wilcoxon test revealed that all QUIS subscale item scores were significantly greater than 5 (p<0.0014, Bonferroni-corrected for 35 tests). For details about the distribution of all user QUIS scale scores, see Table S2 of our online supplement at www.jamia.org.
There were a relatively small number of NA values in the paper version of the survey received from users (117 or 5% of total potential NA values generated by the paper version) that were treated as missing data in our analysis. The following scales contained items that contained 10 or more NA responses: Terminology and System Information (ie, ‘help messages appear on screen,’ ‘messages which appear on screen,’ and ‘error messages’) and System Capabilities (ie, ‘correcting your mistakes’). AUC values ranged from 0.67 to 0.88, suggesting that missing data were likely to be missing at random. Therefore, we assume that there is likely to be no systematic pattern of missing data and report the following results from ordinal logistic regression models with a multiple imputation method.
In multivariate analyses, higher average weekly system use was associated with higher scores in Overall Reactions (OR 1.50, p<0.01), Learning (OR 1.32, p<0.05), and System Functionality (OR 1.34, p<0.01). Trust was not significantly predictive of system usage. Table 2 shows details of our regression model.
A total of 98 comments were collected for the QUIS scales and the Trust scale (number per scale is in parentheses): Overall Reactions (29), Screen (13), Terminology and System Information (6), Learning (10), System Capabilities (7), System Functionality (21), and Trust (12). Table 3 shows themes that emerged and their frequencies. Representative quotes are also shown.
We conducted an exploratory analysis of the Trust variable and found an interscale reliability of 0.62 (Cronbach's α for five items). The mean Trust scale score for all respondents was 3.6 on a scale of 1 to 5 where 3 was neutral and 5 was very favorable (median 3.8, SD 0.56). Table 4 shows each trust item with its percentage of total responses.
Health information exchange systems are a cutting-edge component of healthcare. Recent legislation has catalyzed state-level collaborations to implement this technology. This study is one of the first to formally assess the usability of a regional HIE in emergency department and ambulatory clinic settings, and expose opportunities for enhancing HIEs.
Overall, we found an encouraging level of usability across all users. Their reactions to system usability in general and to specific aspects of usability (effort required to learn the system, system functionality) were good predictors of the average weekly time that they engaged with the system. The relationship between learning effort and system usage was also supported by qualitative comments about training and the need for ‘just in time’ online or in-person tutorials.
A previous study of this HIE noted its potential to reduce testing and admissions16; here, survey respondents believed that additional data could further increase the system's utility. Data types include discharge summaries and information from additional medical facilities. One user wished that ‘more of the state of Tennessee was included in the database… also Arkansas and Mississippi.’
Importantly, our results strongly suggest that an HIE may achieve a high level of usability with relatively primitive groupings of data, that is, a rapid development approach,16 which allows relatively quick implementation of a usable system in clinical settings.30 31 We believe that using an information model that consolidates data—rather than aggregating it—may allow an HIE to be implemented quickly with reasonably high usability.
We hypothesized that usage would be partially driven by user trust of different organizations. Surprisingly, this hypothesis was not confirmed—trust was not an important determinant of usage, although most users reported they could trust the HIE. Results from a validated trust scale may have been different. We also did not find any significant difference between the ages or genders of users and non-users, in contrast to previous studies.19 32
There were limitations to our findings. First, more than half of the respondents had used the system for ≤2 years, and 43% used it for <1 h per week. Although the model we have chosen to examine the relationship between usability and usage assumes usability predicts usage, the relationship is likely not unidirectional. Usability may have no impact on measurable use in settings where use is a job requirement or where a system under study is the only solution to achieve a user's need (i.e., ‘captive use’).33 It is also possible that respondent usage patterns reported in this survey may not accurately represent those of non-respondents. For example, a leadership change during the study period caused high physician turnover in at least two sites which affected our ability to obtain responses from experienced users at those sites. However, the number of affected physicians was relatively small compared to our sample size, therefore our findings were likely not adversely affected. Regarding the method of survey distribution, we chose to maximize our response rate by disseminating the survey according to respondent preferences. To ameliorate any differences that might arise between online and paper versions of the survey, we created the two versions to be as similar as possible and rectified any remaining differences before analysis. For example, we transformed the online 10-point scale to the paper version's 9-point scale to maintain consistency with the standard QUIS scale. However, it is unlikely that this adjustment affected the validity or reliability of our results.34 We also evaluated NA responses on paper as missing responses in our analyses, which may have inflated the prevalence of missing responses to some questions. However, the number of NA responses was small, and therefore it is unlikely that our treatment of these responses significantly affected our results.
The HIE reported here is one of an increasing number of regional data-sharing initiatives that have been deployed since 2001.35 Unlike failed attempts at building similar networks in the 1990s, our HIE was built on a relatively low-cost data consolidation model designed to evolve with changes in patient data management.16 We learned that if an HIE system meets basic expectations shared among members of its user base, an acceptable level of usability is achievable. This fact is encouraging for developers with limited budgets who need to create HIEs. While evaluating an HIE from the standpoint of its users is only one measure of a system's success, it is also a critical one for determining how to improve and promote continued adoption.
Health information exchange systems are an emergent component of the healthcare landscape. This formal usability assessment of a regional HIE has documented overall encouraging levels of usability that, perhaps not surprisingly, were associated with the weekly exposure of the user to the system. This study also uncovered additional data that could enhance the meaningful use of the exchange, and some alternative instructional approaches to help maintain comfort with the system in the face of infrequent use. Overall, this study supports the rapid development approach to implementing an HIE.
Funding This project was financially supported by the Agency for Healthcare Research and Quality contract 290-04-0006, the State of Tennessee, and Vanderbilt University. Funding for REDCap is provided by 1 UL1 RR024975 from the National Center for Research Resources/National Institutes of Health.
Competing interests None.
Ethics approval This study was conducted with the approval of the ethics committee of Vanderbilt University IRB.
Provenance and peer review Commissioned; externally peer reviewed.