Citation: Sandgren A, Strong M, Muthukrishnan P, Weiner BK, Church GM, et al. (2009) Tuberculosis Drug Resistance Mutation Database. PLoS Med 6(2): e1000002. doi:10.1371/journal.pmed.1000002
Published: February 10, 2009
Copyright: © 2009 Sandgren et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: AS and MBM were funded through the Ellison Foundation. AS was also funded through a postdoctoral stipend from the Swedish Society for Medical Research/Stiftelsen för Medicinsk Forskning and Stiftelsen Olle Engqvist Byggmästare. The funding sources had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript or the Web site adjoining the publication.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: MDR, multidrug resistant; TB, tuberculosis; TBDReaMDB, TB Drug Resistance Mutation Database; XDR, extensively drug resistant
Provenance: Not commissioned, externally peer reviewed
Resistant TB Is a Serious Threat to Global Health
Tuberculosis (TB) remains the leading cause of death from a largely preventable and curable infectious disease, with an estimated 1.7 million deaths in 2006 . Global prospects for TB control are challenged by the emergence of drug-resistant strains, especially those that are multidrug resistant (MDR) and extensively drug resistant (XDR) .
Soon after anti-TB drugs became available in the 1940s came reports of drug resistance among patients undergoing treatment . With the advent of “short course chemotherapy” in the 1980s, the duration of treatment fell from 24 to six months, but even then full adherence to treatment regimens has been difficult to accomplish, due to the extensive length of therapy necessary to achieve cure. The prevalence of TB resistant to a single drug was continuously on the rise in several parts of the world, and eventually in the early 1990s, multiple converging factors led to an explosive emergence of MDR-TB, defined as resistance to the two most effective first-line anti-TB agents, isoniazid and rifampicin.
The most recent estimates on the prevalence of anti-TB drug resistance come from surveys conducted by the World Health Organization and the International Union Against Tuberculosis and Lung Disease. These organizations investigated both new and previously treated TB cases in 93 geographical settings between 2002–2006 . In these surveys, the prevalence of MDR-TB ranged from 0% to 22% among newly diagnosed cases and from 0% to 60% among previously treated cases. In addition, since 2002, 45 countries have reported cases of XDR-TB, i.e., TB that is resistant not only to isoniazid and rifampicin but also to at least one fluoroquinolone and to any of the following injectable second-line drugs: kanamycin, amikacin, or capreomycin. Of the MDR isolates tested for second-line drugs, 0%–30 % were found to be XDR .
- The emergence and spread of drug-resistant TB threatens global TB control efforts.
- There is an urgent need for new diagnostic tests that rapidly identify drug sensitivity profiles of M. tuberculosis strains and for new drugs to treat resistant disease.
- We have established a comprehensive database listing mutations associated with TB drug resistance and the frequency of the most common mutations associated with resistance to specific drugs.
- We have made this interactive database publicly available on the Web site http://www.tbdreamdb.com/ so that it can serve as a resource for the development of molecular diagnostics for TB.
- Other uses of the database include structural mapping of mutations to study mechanisms of resistance for drug discovery purposes.
Identification of Drug Resistance Mutations in TB
Drug resistance in TB is believed to be mediated exclusively by chromosomal mutations, which affect either the drug target itself or bacterial enzymes that activate prodrugs. Since the early 1990s, numerous studies have described the genetic mechanisms of drug resistance in Mycobacterium tuberculosis, and a wealth of data has accumulated on the mutations found in isolates resistant to specific drugs . For some drugs, like isoniazid and rifampicin, a large number of mutations have been identified that confer resistance, and these mutations account for most of the resistance found among clinical isolates. For other drugs, such as streptomycin and many of the second-line drugs, known drug resistance mutations occur in only a small proportion of resistant isolates . As resistance has become even more prevalent, and high-throughput sequencing methods and genotyping strategies have been developed and refined, large-scale studies to identify the mutations associated with resistance have been undertaken throughout the world. Until now, there has been no single resource where such drug resistance mutation data are readily available to use in research and development. In order to propel the use of the data that are scattered over a large number of publications, many of which are not readily accessible to researchers around the globe, we have compiled the data available on drug resistance mutations in M. tuberculosis in an open-access database that can be used for a diverse array of purposes.
Applications of the Database
Centralized databases, curated from the literature or high-throughput experiments [6,7], have helped usher in the era of systems biology , and have helped researchers identify global trends and important biological features that would have only been apparent through a comprehensive analysis of the combined data . Examples include protein–protein interaction databases [10,11], genomic databases [12,13], and microarray databases [14–16]. Through this project we present a new database devoted to drug resistance mutations in TB, called the TB Drug Resistance Mutation Database (TBDReaMDB). By providing a comprehensive, single resource of drug resistance mutations in TB, we hope to accelerate and encourage new discoveries that will have applications ranging from diagnostics to drug discovery. We envision that this database will expand as additional mutations are identified in the coming years and will serve as a platform for diverse analyses and projects. TBDReaMDB also complements other TB-related databases such as TBDB (http://www.tbdb.org/), a recently developed Web-based resource housing both annotated genome sequence data and expression data from M. tuberculosis and related species .
Resource for sequencing projects.
The creation of this database was originally precipitated by our in-house need for information on known mutations found in drug-resistant TB, to be used in a large-scale whole-genome sequencing project of MDR- and XDR-TB strains . Whole-genome sequencing identifies a large number of mutations among isolates from diverse geographic regions that exhibit varying degrees of drug resistance. The data in the mutation database can be used to screen the mutations discovered by sequencing, thereby allowing the user to distinguish mutations and genes that previously have been associated with resistance from newly identified resistance-associated mutations, or random mutations that serve as markers of the genetic lineage of the particular strain.
Development of diagnostics.
New tools for rapid and accurate diagnosis of drug-resistant TB are urgently needed. Sequence-based diagnostic methods have been developed that detect specific mutations associated with drug resistance. These tools have the advantage of being rapid, high-throughput, and easily compared between laboratories. However, the development of such diagnostic tools relies on detailed information about the mutations that lead to drug resistance and their relative frequency. Currently the two main diagnostic tests available commercially are the INNO-LiPA TB test (Innogenetics)  and the GenoType MTBDRplus kit (Hain Lifescience) . These assays have recently been approved by the World Health Organization as a tool for rapid MDR-TB diagnosis . These “line probe assays” are based on the detection of a set number of mutations in a few genes associated with resistance against isoniazid and rifampicin. These assays may therefore be unable to detect all isoniazid and rifampicin resistance and do not detect resistance to other first-line or second-line drugs. This database project establishes a foundation that we hope will spur future development of assays with improved sensitivity and specificity and the expansion of these tools to detect resistance to other first- and second-line drugs. In addition, the data may also be used in future research to understand the clinical importance of rare mutations.
Geographical distribution and surveillance of mutations.
Differences in the type and frequency of mutations in drug-resistant isolates in diverse geographical regions can pose challenges for the development of a sequence-based diagnostic, especially if the test relies on detecting a limited number of mutations. Different drugs of the same drug class may also interact differently with the molecular drug targets and thus affect the mutations selected for [22,23]. In addition, some specific mutations have been shown to lead to high-level resistance, while others confer low-level resistance to a specific drug . Others may lead to cross-resistance to drug analogs . Regional differences in drug analogs used have been shown to select for different drug resistance mutations; for example, mutations found in regions that use specific rifamycin analogs differ from those in which other analogs are used . The mutations that arise from these different drug analog exposures may help explain some of the reported geographical differences in drug efficacy . We have therefore included the geographic sites of resistance studies in our database. As more data on the frequency of mutations become available through large-scale sequencing projects, we will garner an even more comprehensive understanding of the global distribution of specific mutations and a better perception of the role of geographical differences.
The database can also be used for novel computational technologies in drug discovery. Mutations may be mapped onto protein structures in order to better understand the mechanisms of resistance at the molecular and atomic level. Such analyses may help us to better understand how particular mutations lead to resistance, how mutations impact the affinity or activation of corresponding drugs, and how mutations may impact the fitness of the bacteria. Trends, such as correlated or synergistic mutations, may also become apparent through the comprehensive analysis of the data on mutations associated with resistance. In addition to revealing the mechanisms of resistance on a global scale, the resulting mapping of mutations may also suggest drug modifications or alternative strategies to counter emerging drug resistance in TB.
A New Comprehensive Database
TBDReaMDB is a comprehensive resource on drug resistance mutations in M. tuberculosis. We conducted a systematic review (see Text S1 for a brief description of the systematic review strategy) to identify drug resistance mutations from the existing literature to include in the database. We chose not to make any a priori decisions as to whether the mutations described in the literature actually confer drug resistance or are possible secondary compensatory mutations, but instead included all mutations that have been described in drug-resistant strains more often than drug-sensitive isolates. For each mutation, the database provides complete codon changes for each mutation at both the nucleotide and amino acid level.
Systematic compilation of mutations found in clinical isolates of drug-resistant TB.
The database is divided into two parts. The first part lists all the unique mutations reported in drug-resistant TB isolates, as well as information on the time period of isolate collection, country of origin, molecular detection method, resistance pattern, and susceptibility testing method. Since many of the mutations are reported in multiple publications, we included only the first report that identified a specific mutation. As of September 1, 2008, TBDReaMDB contains 946 unique mutations associated with seven different drug classes and spread over 36 genes, two intergenic/promoter regions, and one ribosomal RNA coding region.
Frequency of high-confidence mutations associated with drug resistance in TB.
The second part of the database provides data on the relative frequency of the most common mutations associated with resistance to specific drugs, as reported in surveys from diverse geographical sites. For each drug, we extracted data from ten high-quality publications that reported the frequency of mutations associated with resistance to that drug. In addition to the information about the frequency of specific mutations among resistant and sensitive isolates, we have also extracted the type of article-specific data, including time period of isolate collection and country of origin. Such frequency data might be useful for the design and validation of sequence-based diagnostic tools to estimate the number of genes or mutations needed to develop a sensitive and specific assay. These data may also serve as a baseline for surveillance of geographical and temporal trends in the prevalence of specific mutations, as well as for research on the implications of geographical differences in mutation frequencies for development of new diagnostic tools.
Open Access through an Interactive Web Site
We have created an interactive Web site that allows users to visualize all the specific mutations associated with resistance to each drug (http://www.tbdreamdb.com/). For more information on the TBDReaMDB Web site, please see Box 1. We envision that this Web site, in addition to presenting the literature in a manually curated database, also will serve as a gateway to post data from future research and development undertaken by its users. In keeping with the scientific philosophy of open access to the public, the database is open to the public and free under the Creative Commons Attribution 3.0 Unported License (http://creativecommons.org/licenses/by/3.0/). We hope that the Web site will be a portal to communicate future research on drug resistance mechanisms, and to spur development of novel rapid diagnostic tools and drug discovery.
Box 1. The TBDReaMDB Web Site
The TBDReaMDB Web site (http://www.tbdreamdb.com/) was launched on September 1, 2008. All the data collected as part of this study are presented on the Web site and accessible to the public. The data are free of usage but recognition of the data source is required; this article should be referred to as well. The Web site is organized by drug and presents all the information about the mutations described in the literature. The mutations presented have been color coded to highlight the high-confidence mutations for which frequency data are available. For an example of the interface of the Web site, see Figure 1. The data are also accessible in the form of global Excel and tab-delimited spreadsheets for all mutations. The database developed through this project has been constructed to allow continuous updates as additional mutations in clinical TB isolates are reported. To submit new mutations or to give us feedback you may e-mail us at the address found on the homepage of the Web site. On the Web site you can also find all the criteria that must be fulfilled for the mutations to be qualified for inclusion in the database.
The current version of TBDReaMDB, as of September 1, 2008, is based on literature available up to January 2008. The database will be kept updated and manually curated to provide an accurate picture of the current distribution of mutations associated with drug resistance in TB. To maintain the database up to date, we will include new information submitted to the mutations lists and mutation prevalence data sets.
Submission of new mutations or frequency data.
We will include novel mutations found in clinical isolates of drug-resistant TB if they (1) are from published studies of clinical M. tuberculosis isolates, (2) occur in isolates that have been characterized by phenotypic drug sensitivity testing, and (3) are identified by specification of the gene, nucleotide position, and the nucleotide and/or amino acid change. Mutations that meet these criteria may be submitted through the Web site and will be added continuously after review.
For studies to be included in the database describing the frequency of common mutations associated with drug resistance in M. tuberculosis, they must (1) be studies of clinical M. tuberculosis isolates; (2) have large sample sizes (a minimum of 100 resistant isolates are required for isoniazid and rifampicin; for the other drugs sample sizes have been determined empirically depending on the ten largest studies published so far—see Web site); (3) report on phenotypic drug sensitivity testing for all isolates; (4) use validated methods to identify drug resistance mutations; (5) identify the nucleotide position and the nucleotide change; and (6) specify the number of resistant and sensitive isolates carrying a specific mutation. The frequency database will be updated regularly with the most recent data.
Posting of results from future applications for open access.
We will customize the Web site to house results and references from projects using the information available through our resource. Scientists who wish to post data on the Web site may submit an inquiry for space to present results from data analysis, graphics, or links to their own Web site or published articles.
This project is meant to equip the efforts on research and development of novel technologies and approaches needed for the global surveillance and control of drug-resistant M. tuberculosis, as well as for the diagnosis and clinical care of individual patients with this disease. We have assembled a comprehensive database of putative and well-established resistance mutations and presented the relative frequency and geographical distribution of the most common high-confidence mutations. TBDReaMDB is currently the largest and only open-access database for mutations associated with drug-resistant TB. The most important aspect of this project is that all the data are readily available for open access through a publicly accessible Web site. These data constitute an important resource for the TB research community and others to spur further research on drug resistance mechanisms that eventually may lead to the development of novel diagnostic tools and therapeutics to combat resistant TB.
Text S1. Systematic Literature Review: Methods
(48 KB DOC).
AS had full access to all of the data in the study and takes responsibility for the integrity and accuracy of the data. MS and GMC participated in project concept and drafting of the manuscript. PM participated in data acquisition and final revision of the manuscript. BKW built and programmed the Web site and participated in final revision of the manuscript. AS and MBM were responsible for project concept and design, acquisition of data, analysis and interpretation of data, drafting of the manuscript, and critical revision of the manuscript for important intellectual content. MBM supervised the study.
- 1. World Health Organization (2008) Global tuberculosis control: Surveillance, planning, financing. Available: http://www.who.int/tb/publications/global_report/en/. Accessed 5 January 2009.
- 2. Jain A, Mondal R (2008) Extensively drug-resistant tuberculosis: Current challenges and threats. FEMS Immunol Med Microbiol 53: 145–150.
- 3. Iseman MD (1994) Evolution of drug-resistant tuberculosis: A tale of two species. Proc Natl Acad Sci U S A 91: 2428–2429.
- 4. World Health Organization (2008) Anti-tuberculosis drug resistance in the world. Fourth global report. Available: http://www.who.int/tb/publications/2008/drs_report4_26feb08.pdf. Accessed 5 January 2009.
- 5. Ramaswamy S, Musser JM (1998) Molecular genetic basis of antimicrobial agent resistance in Mycobacterium tuberculosis: 1998 update. Tuber Lung Dis 79: 3–29.
- 6. Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, et al. (2005) Towards a proteome-scale map of the human protein-protein interaction network. Nature 437: 1173–1178.
- 7. Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, et al. (2000) A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403: 623–627.
- 8. Hood L, Heath JR, Phelps ME, Lin B (2004) Systems biology and new technologies enable predictive and preventative medicine. Science 306: 640–643.
- 9. Rives AW, Galitski T (2003) Modular organization of cellular networks. Proc Natl Acad Sci U S A 100: 1128–1133.
- 10. Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, et al. (2007) IntAct—Open source resource for molecular interaction data. Nucleic Acids Res 35: D561–D565.
- 11. Xenarios I, Salwinski L, Duan XJ, Higney P, Kim SM, et al. (2002) DIP, the Database of Interacting Proteins: A research tool for studying cellular networks of protein interactions. Nucleic Acids Res 30: 303–305.
- 12. Liolios K, Mavromatis K, Tavernarakis N, Kyrpides NC (2008) The Genomes On Line Database (GOLD) in 2007: Status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res 36: D475–D479.
- 13. Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, et al. (2008) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 36: D13–D21.
- 14. Demeter J, Beauheim C, Gollub J, Hernandez-Boussard T, Jin H, et al. (2007) The Stanford Microarray Database: Implementation of new analysis tools and open source release of software. Nucleic Acids Res 35: D766–D770.
- 15. Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, et al. (2007) NCBI GEO: Mining tens of millions of expression profiles—Database and tools update. Nucl Acids Res 35: D760–D765.
- 16. Parkinson H, Kapushesky M, Shojatalab M, Abeygunawardena N, Coulson R, et al. (2007) ArrayExpress—A public database of microarray experiments and gene expression profiles. Nucl Acids Res 35: D747–D750.
- 17. Reddy TB, Riley R, Wymore F, Montgomery P, Decaprio D, et al. (2008) TB database: An integrated platform for tuberculosis research. Nucleic Acids Res 37: D499–D508. doi:10.1093/nar/gkn652.
- 18. Koenig R (2007) Tuberculosis. Few mutations divide some drug-resistant TB strains. Science 318: 901–902.
- 19. Morgan M, Kalantri S, Flores L, Pai M (2005) A commercial line probe assay for the rapid detection of rifampicin resistance in Mycobacterium tuberculosis: A systematic review and meta-analysis. BMC Infect Dis 5: 62.
- 20. Ling D, Zwerling AA, Pai M (2008) GenoType MTBDR assays for diagnosis of multidrug-resistant tuberculosis: A meta-analysis. Eur Respir J 32: 1165–1174.
- 21. World Health Organization (2008) Molecular line probe assays for rapid screening of patients at risk of multidrug-resistant tuberculosis (MDR-TB). Policy statement. Available: http://www.who.int/tb/dots/laboratory/lpa_policy.pdf. Accessed 5 January 2009.
- 22. Anthony R, Schuitema A, Bergval I, Brown T, Oskam L, et al. (2005) Acquisition of rifabutin resistance by a rifampicin resistant mutant of Mycobacterium tuberculosis involves an unusual spectrum of mutations and elevated frequency. Ann Clin Microbiol Antimicrob 4: 9.
- 23. Zhou J, Dong Y, Zhao X, Lee S, Amin A, et al. (2000) Selection of antibiotic-resistant bacterial mutants: Allelic diversity among fluoroquinolone-resistant mutations. JInfect Dis 182: 517–525.
- 24. Moghazeh SL, Pan X, Arain T, Stover CK, Musser JM, et al. (1996) Comparative antimycobacterial activities of rifampin, rifapentine, and KRM-1648 against a collection of rifampin-resistant Mycobacterium tuberculosis isolates with known rpoB mutations. Antimicrob Agents Chemother 40: 2655–2657.
- 25. Williams DL, Spring L, Collins L, Miller LP, Heifets LB, et al. (1998) Contribution of rpoB mutations to development of rifamycin cross-resistance in Mycobacterium tuberculosis. Antimicrob Agents Chemother 42: 1853–1857.
- 26. Qian L, Abe C, Lin TP, Yu MC, Cho SN, et al. (2002) rpoB genotypes of Mycobacterium tuberculosis Beijing family isolates from East Asian countries. J Clin Microbiol 40: 1091–1094.