Introduction

Over the past few years the International Test Commission (ITC) has adopted a policy of promoting good practice in testing issues where international coordination of effort is most important. For example, the ITC has devised guidelines to promote good practice in test adaptations (Hambleton, 1994; Van de Vijver & Hambleton, 1996) and good practice in test use (ITC, 2001). In recent years substantial and rapid developments have occurred in the provision of stand-alone and Internet-delivered computer based testing. These developments raise a number of issues in relation to standards of administration, security of the tests and test results and control over the testing process. Therefore, as the market for such testing increases and as the technological sophistication of the products increases issues associated with ensuring those developing, distributing, using and taking such tests and assessment tools follow good practice will increase in importance. In response to this, the ITC Council decided to invest in a program of research, consultation, and conferences designed to develop internationally agreed guidelines specifically aimed at computer/Internet based testing.

Aims and Objectives

The ultimate aims of this project were

The aim was not to ‘invent’ new guidelines but to draw together common themes that run through existing guidelines, codes of practice, standards, research papers and other sources, and to create a coherent structure within which these guidelines can be used and understood. Contributions to the guidelines have been made by psychological and educational testing specialists, including test designers, test developers, test publishers and test users drawn from a number of countries.

Further, the aim is to focus on the development of guidelines specific to CBT/Internet based testing, not to reiterate good practice issues in testing in general. Clearly, any form of testing and assessment should conform to good practice issues regardless of the method of presentation. These guidelines are intended to complement the ITC Guidelines on Test Use (2001), with a specific focus on CBT/Internet testing.

Development of the Guidelines

As with previous ITC guidelines, the present guidelines can be seen as a benchmark against which existing local standards can be compared or as a basis for the development of locally applicable standards or codes of practice. The advantage of these guidelines is that local standards can be compared to these set guidelines for coverage and international consistency in order to promote consistency across national boundaries and for benchmarking purposes.

The project commenced with an initial literature search and review of existing references and guidelines on computer-based testing and Internet testing from a number of different countries (see Appendix). A number of these sources were particularly influential in the development of the guidelines:

The next stage involved a small scale survey of United Kingdom test publishers, examining good practice issues in Internet-delivered personality tests in the UK. Further examples of good practice were highlighted from this survey.

As a third method of obtaining relevant information, the ITC organised a conference in Winchester, England in June 2002 on Computer-based Testing and the Internet. The goal of this conference was to bring together people working in the field of computer/Internet testing (e.g., practitioners, scholars, industry leaders and others) from around the world and to extract common issues and themes that would inform the guidelines. In total 254 delegates from 21 countries attended the conference. The conference was composed of workshops, keynote presentations and themed papers, posters and symposia on a number of topics concerning computer/Internet testing. A review of the material from this conference coupled with the small survey data and literature review provided the basis for the development of the draft guidelines for initial consultation (version 0.3).

Four general issues emerged from the information gathering process and these formed the basis of the development of an initial draft version. The four issues were:

These four issues were considered high level issues and were further broken down into second-level specific guidelines. A third-level set of accompanying examples is provided to the relevant stakeholder. The guidelines are primarily written to provide advice to test developers, test publishers and test users; however, these guidelines also provide a useful source of reference for test-takers. Given these intended applications, the guidelines are structured in a three (main stakeholders) by three (level of guideline) matrix.

Another cycle of consultation was implemented including those people previously contact in the first consultation process. The revisions and edits from this process were completed and version 0.6 of the draft guidelines was produced. Final revisions were produced and the f inal draft version was devised (1.0 ). The current guidelines (version 2005) were officially launched in July 2005 after approval by the ITC Council.

Timeline

The following shows the timeline in the design and development of the guidelines.

  • Completion of first draft and first consultation initiated: March 2003
  • End of first Consultation period: June 2003.
  • Revisions completed and second consultation initiated: February 2004
  • End of second Consultation period: April 2004
  • A symposium on CBT and Internet testing at the International Congress of Psychology in Beijing, August 2004.
  • Final version for approval: January 2005
  • Development of final version and design of web-based version: March 2005
  • Approval by ITC Council and formal launch: July 2005
  •  

    Scope

    As with the International Guidelines of Test Use (2001), the current guidelines use the terms ‘test’ and ‘testing’ in their broadest sense and include psychological and educational tests used in clinical, health, educational and work and organisational assessment settings. CBT/Internet tests should be supported by evidence of their technical adequacy for their intended purpose. These guidelines are aimed at tests conducted both online and onscreen (offline), which can include testing via the use of a CD ROM or a download executable. The document includes guidance for fully computerised testing and for part-computerised testing and the reader can refer to the most appropriate elements. For example, only the sending and scoring of assessment papers may be computerised (the rest paper and pencil). Given this, the guidelines dealing with security and confidentiality of data are important.

    In general, the guidelines can apply to both high stakes and low stakes assessment. As an example, high stakes assessments are those where a third party requires the results of the test for use in the process of making an important decision about a test-taker (high stakes testing may also include those that are used to make decisions about groups of test-takers, such as a school class). By contrast, an example of low stakes assessment would be where the test-taker obtains the information for his or her own interest. That some guidelines apply only to high stakes testing environments is made clear within the text itself.

    Again, unless otherwise specified in the text, the guidelines presented here should be considered as applying to a number of modes of supervision and across a number of testing scenarios. Four modes of test administration are considered:

    Application of these guidelines needs to be considered in terms of their relevance for a range of different testing scenarios (e.g., guidelines are more appropriate for the more high stakes forms of scenarios). For example, in relation to testing in work and organisational settings, four main scenarios can be identified:

    Additionally in clinical/counseling settings, four scenarios could be:

    Each of these raises different issues regarding control and security.

     

    1 Standardisation of the testing environment is not possible with open mode testing, and often not possible in the controlled mode of testing.
    2 Standardisation is possible with supervised mode and managed mode.

    Who are the Guidelines for?

    The guidelines apply to the use of CBT and Internet tests in professional practice. Thus they are directed towards test users who:

    These guidelines also specifically address three other main stakeholders in the testing process:

    The guidelines are relevant to others involved in the use of CBT and Internet tests. These include:

     

    Contextual factors

    The guidelines are intended to be applicable internationally. Many factors may affect how standards may be managed and realised in practice. These contextual factors have to be considered at the local level when interpreting these guidelines and defining what they would mean in practice within any particular setting.

    The factors that need to be considered for turning the guidelines into specific standards include: