- DoTS (EPConDB)
- We provide links from MGI markers to computationally derived transcript assemblies from DoTS, the Database Of Transcribed Sequences, developed at the Computational Biology and Informatics Laboratory (CBIL) at the University of Pennsylvania. See the EPConDB query page for a specifics). Associations between MGI markers and assemblies are based on shared GenBank sequence accession identifiers. Only associations between MGI genes and DTs that are supported by non-conflicting evidence are accessible for these resources. Links are updated after every major change in MGI databases.
- DFCI - The Gene Index Project at Dana Farber Cancer Institute
- We provide links from MGI markers to computationally derived transcript assemblies from DFCI Mouse, Human, and Rat Gene Indices (developed originally at TIGR). The associations between MGI markers and assemblies are based on shared GenBank sequence accession identifiers. Only associations between MGI genes and TCs supported by non-conflicting evidence are accessible for these resources. Links are updated weekly as well as after every major change in MGI databases.
- We collaborate with the Ensembl Genome Database Project to provide links between Ensembl mouse gene models and MGI markers. Ensembl is a joint project between the
European Bioinformatics Institute/European Molecular Biology Laboratory (EMBL-EBI) and the Wellcome Trust Sanger Institute. We provide links from our marker detail page to the Mouse GeneView of the Ensembl Genome Browser. Marker detail pages also link to Ensembl phylogenetic Gene Trees.
We also obtain chromosome map locations for MGI genes associated with Ensembl gene models.
- Generic Model Organism Project
We collaborate with the Generic Model Organism Project (GMOD), a joint effort by the model organism system databases WormBase, FlyBase, MGI, SGD, Gramene, Rat Genome Database, EcoCyc, and TAIR to develop reusable components suitable for creating new community databases of biology.
- We based the MGI Mouse Genome Browser, a tool for manipulating and displaying genome annotations, on GBrowse, a generic genome browser.
- Gene Ontology Consortium (GO)
- We are part of the Gene Ontology Consortium that also currently includes Flybase and SGD (Saccharomyces Genome Database). We are developing extensive, structured vocabularies for molecular function, biological process, cellular component and annotating genes/gene products within each of our species' databases using these vocabularies. These ontologies and annotations will be accessible from our species' database web sites and from a jointly developed web site.
- IGTC - International Gene Trap Consortium
- We collaborate with phenotyping/mutagenesis/targeting programs such as the International Mouse Mutagenesis Consortium, the Complex Trait Consortium, the Phenome Project, the International Gene Trap Consortium, and the Knockout Mouse Project (KOMP).
- IMAGE Consortium (Integrated Molecular Analysis of Genomes and their Expression) and
WashU (Washington University)
- We work with the IMAGE consortium to pre-assign MGI accession numbers to IMAGE clones, which are, in turn, sequenced by Washington University. The IMAGE and MGI accession identifiers travel in pairs and are submitted to dbEST (database of Expressed Sequence Tags) directly with the sequencing data. Thus the connection between an EST, the clone it was derived from, and the MGI accession identifier is generated.
- International Committee on Standardized Genetic Nomenclature for Mice,
HUGO Gene Nomenclature Committee, and Rat Genome and Nomenclature Committee Board.
- We work closely with the human and rat international nomenclature groups to assign gene symbols wherever feasible, to evaluate gene family nomenclature, and to maintain consistent use of nomenclature for human, mouse and rat species.
- We provide links from MGI microRNA Gene Detail pages to miRBase, home of microRNA data, which incorporates database and gene naming roles previously provided by the miRNA Registry. The miRBase Sequence Database contains all published miRNA sequences, genomic locations, and associated annotation.
- Mouse Atlas and 3D Graphical Gene Expression Database
- We collaborate with the Edinburgh Mouse Atlas and Gene Expression Database project to develop and extend an anatomical dictionary of mouse development and to build the Mouse Gene Expression Information Resource that will combine the use of controlled vocabularies and image-based methods for the representation and analysis of expression patterns. The standard anatomical terms are, or will be, used throughout our database wherever anatomical data are found (e.g., expression data, clone library information, phenotypic descriptions).
- The MouseCyc database is part of the BioCyc collection of biological pathway databases created using the Pathway Tools Software Development kit created by Dr. Peter Karp and his research group at SRI, International in Palo Alto, California.
- Mouse Genome Sequencing Consortium
- We collaborate with members of the MGSC, a public-private partnership of institutes involved in sequencing and genomics. The MGSC aims to accelerate, facilitate and coordinate global mouse genomic sequencing efforts. Funding for is provided by the National Institutes of Health, the Wellcome Trust, GlaxoSmithKline, the Merck Genome Research Institute and Affymetrix Inc. Sequencing partners are the Sanger Institute, the Whitehead Institute for Biomedical Research and Washington University School of Medicine.
- NCBI (National Center for Biotechnology Information)
- Working with Entrez, we provide official nomenclature for mouse genes, our curated links between genes and sequence identifiers in GenBank, chromosomal, cytogenetic and centimorgan map positions, and MGI accession numbers. Entrez uses these curated data to develop reference sequences for mouse and notifies us of new GenBank mouse sequences submitted. Working with UniGene, we provide gene-oriented clusters of transcript sequences.
- OMIM (Online Mendelian Inheritance in Man)
- We collaborate with the editors of the Online Mendelian Inheritance in Man (OMIM) database at Johns Hopkins). OMIM is a catalog of human genes and genetic disorders, containing textual information, pictures, and reference information as well as links to NCBI's data resources.
- We collaborate with the Protein Information Resource (PIR), a public resource of protein informatics based at George Washington University, which hosts the PIRSF database.
- RIKEN Genomic Sciences Center
- We collaborate with the Genome Exploration Research Group of the RIKEN Institute in Japan to provide public access to RIKEN cDNA clone information through MGI. These clones were isolated and sequenced at the RIKEN Institute as part of a series of public cDNA clone releases that constitute the Mouse Genome Encyclopedia, a genomics project centered on the production of full-length mouse cDNAs. The RIKEN Institute hosted a meeting for first-pass functional annotation of the first set of 21,076 cDNAs (Functional Annotation of Mouse, FANTOM), attended by an international assembly of mammalian biologists and bioinformaticians. Links are provided to DNA Database of Japan (DDBJ) records and to analysis results and summary annotations from the FANTOM meeting for these RIKEN cDNA clones.
- We collaborate with UniProt, (Universal Protein Resource), a central repository of protein sequence and function created by joining the information contained in SWISS-PROT, TrEMBL, and PIR.
- UCSC Genome Browser
- We provide links from MGI markers to the UCSC Genome Browser, a central repository for known genes from UniProt, RefSeq, and GenBank mRNA, created by the Genome Bioinformatics Group of UC Santa Cruz.
- We collaborate with Vertebrate Genome Annotation (VEGA), a central repository for high quality, frequently updated, manual annotation of vertebrate finished genome sequence. Vega is funded by the Wellcome Trust Sanger Institute. From our marker detail pages, we link to sequence maps in Vega Mouse ContigView and to curated locus reports of representative sequences in Vega Mouse GeneView.
- MGI provides links to an increasing number of databases on the WWW and many of these sites provide links to MGI as well. We wish to acknowledge these valuable scientific information resources.
- ArkDB (Roslin Institute, Edinburgh, Scotland)
- ARK is currently used by the Pig, Chicken, Sheep, Cattle, Cat, and Horse genome mapping communities. The ARK system is currently installed at the Roslin Institute where it was developed. There are ARK mirror nodes in various locations around the world.
- ATCC (American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD)
- The mission of the ATCC is to acquire, authenticate, and maintain reference cultures, related biological materials, and associated data, and to distribute these to qualified scientists in government, industry, and education.
- BovMap (Institut National de Recherche Agronomique, Laboratoire de genetique biochimique - Jouy-en-Josas)
- The BovMap database offers to the scientific community working on the bovine genome a useful and practical
tool to follow the rapid progress of the mapping effort.
- dbEST (NCBI - Database for Expressed Sequences Tags)
- dbEST is a division of GenBank containing sequence data and other information on "single-pass" cDNA sequences, or Expressed Sequence Tags, from a number of organisms.
- dbSNP serves as a central repository for single nucleotide polymorphisms (SNPs), multiple nucleotide polymorphisms (MNPs), and short insertion/deletion polymorphisms (IN-DELs). dbSNP is the official source of SNP-based mouse genomic variation in MGI.
- DDBJ (DNA Data Bank of Japan (DDBJ), National Institute of Genetics, Mishima, Japan)
- DDBJ is the only DNA data bank based in Japan. Sequence information is received from researchers from Japan and from other countries, and from other sequence databases, including EMBL and GenBank.
- EMBL-EBI (Databases at the European Bioinformatics Institute Cambridge, United Kingdom)
- The European Bioinformatics Institute (EBI) of the European Molecular Biology Laboratory (EMBL) maintains a number of databases including the EMBL Nucleotide Sequence Database and the SWISS-PROT Protein Sequence Database.
- ENZYME (Enzyme nomenclature database, maintained at the
ExPASy molecular biology server of the Geneva University Hospital and the University of Geneva, Switzerland)
- ENZYME is a repository of information relative to the nomenclature of enzymes. It is primarily based on the recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB) and it describes each type of characterized enzyme for which an EC (Enzyme Commission) number has been provided.
- FlyBase (FlyBase is available on the web from various sites)
- FlyBase is a comprehensive database for information on the genetics and molecular biology of Drosophila. It includes data from the Drosophila Genome Projects and data curated from the literature. FlyBase is a joint project with the Berkeley and European Drosophila Genome Projects.
- GenBank (Produced and maintained by NCBI, NLM, and NIH, Bethesda, Maryland)
- GenBank is an interface providing access to three databases: a subset of the National Library of Medicine's PubMed database, the NCBI protein database, and the NCBI nucleotide database. GenBank is produced and maintained by the National Center for Biotechnology Information (NCBI). NCBI is responsible for building, maintaining, and distributing GenBank, the NIH genetic sequence database that collects all known DNA sequences from scientists worldwide. The NCBI is a division of the National Library of Medicine (NLM) and is located on the campus of the National Institutes of Health(NIH)in Bethesda, Maryland.
- NCBI (National Center for Biotechnology Information)
- NCBI is a division of the National Library of Medicine (NLM) and is located on the campus of the National Institutes of Health (NIH) in Bethesda, Maryland. NCBI is responsible for building, maintaining, and distributing GenBank, the NIH genetic sequence database for all known DNA sequences from scientists worldwide; Entrez, a database of information on official nomenclature, aliases, sequence accessions, phenotypes, EC numbers, MIM numbers, Unigene clusters, orthology, and map locations; and PubMed, a bibliographic database.
- PiGBASE (Genome database of the pig maintained at The Roslin Institute, Edinburgh, Scotland)(1995)
- A North American node of PiGBASE is maintained at the Iowa State University.
- WashU (Genome Sequencing Center, Washington University School of Medicine St. Louis, MO)
- The Genome Sequencing Center two-year project will produce expressed tag sequences (ESTs) from approximately 400,000 mouse cDNAs.