The accuracy and efficacy of screening tests for Chlamydia trachomatis: a systematic review

Abstract

Screening women for lower genital tract infection with Chlamydia trachomatis is important in the prevention of pelvic inflammatory disease, ectopic pregnancy and infertility. This systematic review aims to state clearly which of the available diagnostic tests for the detection of C. trachomatis would be most effective in terms of clinical effectiveness. The review included all studies published from 1990 onward that evaluated diagnostic tests in asymptomatic, young, sexually active populations. Medline and Embase were searched electronically and key journals were hand-searched. Further studies were identified through the Internet and contact with experts in the field. All studies were reviewed by two reviewers and were scored by Irwig's assessment criteria. Additional quality assessment criteria included a documented sexual history and recording of previous chlamydial infection. The reviews were subjected to meta-analysis and meta-regression. The 30 studies that were included examined three types of DNA-based test – ligase chain reaction (LCR), PCR and gene probe – as well as enzyme immuno-assay (EIA). The results showed that while specificities were high, sensitivities varied widely across the tests and were also dependent on the specimen tested. Pooled sensitivities for LCR, PCR, gene probe and EIA on urine were 96.5%, 85.6%, 92% and 38%, respectively, while on cervical swabs the corresponding sensitivities of PCR, gene probe and EIA were 88.6%, 84% and 65%. Meta-analysis demonstrated that DNA amplification techniques performed best for both urine and swabs in low prevalence populations. We conclude that nucleic acid amplification tests used on non-invasive samples such as urine are more effective at detecting asymptomatic chlamydial infection than conventional tests, but there are few data to relate a positive result with clinical outcome.

Received 19 Nov. 2001; revised version accepted 8 Aug. 2002.

Chlamydia trachomatis is the most common bacterial sexually transmitted infection in Western Europe and women carry the main burden of this disease. The case for national screening programmes has been made but there is a need for more data to show how this can be done most effectively. This review focuses on the best test for the detection of C. trachomatis when used in a screening context.

C. trachomatis is an obligate intracellular gram- negative bacterium. Infection with this agent can be asymptomatic in up to 80% of women [1], which can make diagnosis and detection difficult. Chlamydia has its highest prevalence amongst young men and women. More than 13.5% of women <25 years old have lower genital tract infection, reducing to <4.9% in women over 25 [2].

Left undetected and untreated chlamydia can ascend the upper genital tract, causing inflammation and scarring in both the female and the male reproductive tract [3]. Many reports now indicate that it is the major causative agent in the development of pelvic inflammatory disease (PID) in women. The major sequelae of PID include ectopic pregnancy, tubal factor infertility and chronic pelvic pain. In many countries the incidence of ectopic pregnancy is increasing and it remains the principal cause of maternal death in the first trimester of pregnancy [4]. In addition, chlamydia can be transmitted to the neonate at birth, causing conjunctivitis and pneumonia [5].

The asymptomatic nature of chlamydial infection makes screening essential if control of this infection is to be achieved. In Sweden, policies to reduce the prevalence of infection have been in place since the 1980s and rates of chlamydial infection and its complications have fallen. Because of the severity of the complications of infection with chlamydia and their implications in health economic terms, several other countries including the UK, France, Holland and Finland have now taken action to reduce the prevalence of this infection.

To be effective, a national screening programme must use the most accurate diagnostic test available. Currently there is little or no consensus on which diagnostic tool to use as a screening device or which sampling method to use [6]. The ‘gold standard’ for detection of chlamydia is still considered by many to be cell culture [7]. Culture is 100% specific, but estimates of sensitivity are as low as 50%. The majority of laboratories have moved away from culture, as it is expensive, time-consuming and technically difficult. The use of an expanded gold standard, commonly consistent results with two non-culture techniques, is considered to be more useful as a research tool, but most laboratories use only one non-culture method as their routine test for detection of chlamydia [8].

There is considerable variation in health-care professionals’ knowledge of chlamydial infection. Furthermore, there is a degree of confusion as to which diagnostic tests and sampling methods should be employed [9]. Within the last decade, tests that are based on nucleic acid amplification have become available. They appear to be highly sensitive and specific and these tests have the added advantage that they are effective for use with non-invasive specimens such as urine and vulval swabs [10].

It is important that any test adopted in a national screening programme can be used in the primary care setting by practitioners without the need for expensive training. Furthermore, the ideal test for a screening programme should have the capacity to be used in both sexes. The reason for this is that any screening programme, while initially being directed at young women, should have the potential to involve men, both for contact tracing and for possible expansion of the programme. The reasons for eventual expansion may be three-fold. Firstly, so as not to stigmatise women's sexuality [11], secondly to involve men in the health-care system and not to exclude them, and finally to screen only half the population affected by a condition would be ineffective. Clad et al. [12] demonstrated that screening women detected only 54% of infected couples, but screening men alone detected 81% of infected couples.

The objective of this systematic review is to state clearly which available diagnostic test is the most accurate and effective when used in young, asymptomatic, sexually active populations. For the results to be valid and transferable it is important that the populations examined in the review are similar to the population in which the test would eventually be used in a screening programme. Diagnostic tests perform differently in high and low risk populations. As prevalence varies, so does the positive predictive value [13]. Irwig et al. [14] have set out guidelines to assess studies that examine the usefulness of diagnostic tests. These guidelines have been used to evaluate the screening tests included in this systematic review (Table 1).

Table 1. Irwig's [14] criteria for assessing studies examining diagnostic tests

Methods

Search strategy

Studies from 1990 onwards that assessed the effectiveness of tests used to diagnose C. trachomatis infection were located on the electronic databases Medline, CINAHL and Embase. Relevant journals were hand-searched. The Internet was explored with Lycos, Alta Vista and Excite as search engines and Medscape was also used to detect information and conference proceedings. The bibliographies of included studies were also searched for relevant articles. Experts in the field were contacted by electronic mail for study information. The search included a filter and headings such as chlamydia, exp. diagnosis and mass screening. One reviewer (E.J.W.) examined the titles and abstracts on three occasions. The subset of articles that focused on asymptomatic populations and therefore, was, relevant to the meta-analysis was further evaluated by two reviewers (E.J.W. and J.S.W.) with the inclusion and exclusion criteria. Thirty-two studies that evaluated diagnostic tests for C. trachomatis infection were included in the systematic review.

Selection criteria

Any trial from 1990 onwards that evaluated methods of detecting urogenital infection with chlamydia was included. The lack of randomised controlled studies available in this field meant that only comparative studies were examined.

Population

The patients included in the study had to be sexually active young men or women with no symptoms of chlamydia infection. It was important that they were asymptomatic, as there is evidence that diagnostic tests for chlamydia perform better among patients who are symptomatic, perhaps because of an increased elementary body load. The age range of the populations in the included studies was 14–40 years.

Setting

Papers that described populations with a low prevalence, taken as ≤5% [15], were included regardless of the setting and studies that were set in primary care or a family planning clinic were included regardless of the prevalence of chlamydia infection.

Intervention

The diagnostic tests examined for detection of C. trachomatis were nucleic acid amplification techniques (PCR and LCR), gene probes (GP), enzyme immuno-assay (EIA) and direct immunofluorescence (DFA). The leucocyte esterase test (LET) was also examined to determine if it would be a useful screening tool. All were compared to culture or an expanded gold standard. The sensitivity of culture was calculated by comparing it with two non-culture techniques. All methods of sample collection were reviewed.

Outcome

Detection of chlamydia in the lower genital tract of men and women.

Study quality

The review process was not blind to study authorship, as there is no proof that this adds any quality measure to the review. The quality of the studies will affect the validity of the result and, therefore, study quality was assessed by the criteria suggested by Irwig et al. [14]. Studies were excluded if study design was considered to be poor, as judged by an Irwig score of <5 out of 10.

Statistical methods

The statistical packages used to evaluate the diagnostic tests included meta-test software kindly supplied by J. Lau (JLau1@Lifespan.org) and meta-analysis software in Rev man 4.01 (Update Software, Oxford, UK). SPSS (Chicago, IL, USA) was also used.

Results

This systematic review and the subsequent meta-analysis included all methods for the diagnosis of urogenital chlamydia shown in Table 2, compared to a gold standard. The gold standard is culture for chlamydia performed as described by Mardh et al. [16] or chlamydia diagnosed by two non-culture tests, now known as the expanded gold standard [17]. The focus of the meta-analysis was to provide an overall summary of diagnostic test accuracy for detection of asymptomatic chlamydia infection. The leucocyte esterase test (LET), while not a diagnostic test for C. trachomatis, was included in this review as it has been evaluated for screening purposes. Several hundred study abstracts were examined for potential inclusion; 74 studies were identified for further evaluation.

Table 2. Description of methods available for the detection of C. trachomatis

Study characteristics

Each study was evaluated by the guidelines set out by Irwig et al. [14] as indicated previously and the results are shown in Table 3. Thirty-two studies [17–48] were identified for possible inclusion in the meta-analysis, but two were excluded because of study quality [47, 48]. Forty-two studies did not meet the inclusion criteria when the papers were examined (Table 4) [49–90]. The gold standard was tissue culture of a cervical or urethral swab in 28 of the studies and 2 non-culture techniques used either on urine or cervical swabs in the remainder. When culture was the modality under examination, the sensitivity was calculated by comparing the results with two non-culture techniques. Six (19%) of the studies indicated that results were read blind, i.e., without knowledge of the result of the gold standard. However, many of the tests evaluated were assessed as positive or negative with automated equipment, which by its own nature is blinded. Unless it was stated in the methodology that the results were read blind this was recorded as unknown. One-third of the studies did not recruit consecutive patients. This may be due in part to the nature of the disease being tested, as it is often difficult to gain consent from patients for an invasive test for a sexually transmitted disease. Thirty of the studies performed verification of negative results in addition to the positive results. In all these studies the technique and study methodology were well described.

Table 3. Included studies and Irwig score [14]

Table 4. Excluded studies

The investigators had taken and documented a sexual history in only seven of the studies. The validity of a sexual history has been called into question and patients will often say what they feel is expected, so while this element is interesting it was not a basis for exclusion.

Study outcomes

Studies that reported test accuracy commonly did so in terms of sensitivity and specificity. Sensitivity is the ability of the test to correctly identify those with the disease; the specificity is the ability of the test to identify those who do not have the disease. Some of the tests examined performed better than the gold standard and this makes evaluation difficult. In this situation, tests are compared to an expanded gold standard. Data from each study were re-examined to establish sensitivity and specificity, but missing data in some studies made re-analysis difficult. A scatter plot of the tests’ sensitivity to inverse specificity was plotted for each test to illustrate the test's accuracy and is a useful visual guide to the variability of performance (Fig. 1). An ideal test would have a sensitivity of 100% and an inverse specificity of zero. The number of false-negative results in each study was summed to calculate a pooled sensitivity for the test types by specimen type. The studies were analysed in subgroups according to specimen type and test, as shown in Fig. 2. Only studies involving asymptomatic populations in a primary care or low prevalence setting were examined. This enabled evaluation of the test under more searching conditions. Certain tests for the detection of chlamydia may perform best when the elementary body load is high or if, as with all tests, the prevalence of the condition is high. Studies were performed in a wide range of locations in different types of population. The mean prevalence of chlamydia infection among the populations studied was 4.5%. This reflects the expected prevalence in all age groups tested in primary care. However, the range was from <1% to 15%. This variation illustrates the difference in prevalence among primary care settings depending on risk factors and location. It may also represent differences in prevalence between countries that offer screening compared with those that do not.