- Alliance of Genome Resources (AGR)
- We are a founding member of the Alliance of Genome Resources, whose goal is to provide a consistent user interface with an integrated view of data from 6 Model Organism Databases and the GO Consortium.
- We collaborate with the Ensembl Genome Database Project to provide links between Ensembl mouse gene models and MGI markers. Ensembl is a joint project between the
European Bioinformatics Institute/European Molecular Biology Laboratory (EMBL-EBI) and the Wellcome Trust Sanger Institute. We provide links from our marker detail page to the Mouse GeneView of the Ensembl Genome Browser. Marker detail pages also link to Ensembl phylogenetic Gene Trees. We also obtain chromosome map locations for MGI genes associated with Ensembl gene models.
- Generic Model Organism Project
We collaborate with the Generic Model Organism Project (GMOD), a joint effort by the model organism system databases WormBase, FlyBase, MGI, SGD, Gramene, Rat Genome Database, EcoCyc, and TAIR to develop reusable components suitable for creating new community databases of biology.
- We based the MGI Mouse Genome Browser, a tool for manipulating and displaying genome annotations, on JBrowse, a generic genome browser.
- Gene Ontology Consortium (GO)
- We are part of the Gene Ontology Consortium that also currently includes Flybase and SGD (Saccharomyces Genome Database). We are developing extensive, structured vocabularies for molecular function, biological process, cellular component and annotating genes/gene products within each of our species' databases using these vocabularies. These ontologies and annotations will be accessible from our species' database web sites and from a jointly developed web site.
- IGTC - International Gene Trap Consortium
- We collaborate with phenotyping/mutagenesis/targeting programs such as the International Mouse Mutagenesis Consortium, the Complex Trait Consortium, the Phenome Project, the International Gene Trap Consortium, and the Knockout Mouse Project (KOMP).
- IMAGE Consortium (Integrated Molecular Analysis of Genomes and their Expression) and WashU (Washington University)
- We work with the IMAGE consortium to pre-assign MGI accession numbers to IMAGE clones, which are, in turn, sequenced by Washington University. The IMAGE and MGI accession identifiers travel in pairs and are submitted to dbEST (database of Expressed Sequence Tags) directly with the sequencing data. Thus the connection between an EST, the clone it was derived from, and the MGI accession identifier is generated.
- MouseMine is based on the open source InterMine data warehouse system developed by the Micklem Lab at the University of Cambridge. InterMine instances offer flexible and iterative batch querying, built-in enrichment analysis, and web service access.
- International Committee on Standardized Genetic Nomenclature for Mice, HUGO Gene Nomenclature Committee, and Rat Genome and Nomenclature Committee Board.
- We work closely with the human and rat international nomenclature groups to assign gene symbols wherever feasible, to evaluate gene family nomenclature, and to maintain consistent use of nomenclature for human, mouse and rat species.
- We provide links from MGI microRNA Gene Detail pages to miRBase, home of microRNA data, which incorporates database and gene naming roles previously provided by the miRNA Registry. The miRBase Sequence Database contains all published miRNA sequences, genomic locations, and associated annotation.
- Mouse Genome Sequencing Consortium
- We collaborate with members of the MGSC, a public-private partnership of institutes involved in sequencing and genomics. The MGSC aims to accelerate, facilitate and coordinate global mouse genomic sequencing efforts. Funding for is provided by the National Institutes of Health, the Wellcome Trust, GlaxoSmithKline, the Merck Genome Research Institute and Affymetrix Inc. Sequencing partners are the Sanger Institute, the Whitehead Institute for Biomedical Research and Washington University School of Medicine.
- NCBI (National Center for Biotechnology Information)
- Working with Entrez, we provide official nomenclature for mouse genes, our curated links between genes and sequence identifiers in GenBank, chromosomal, cytogenetic and centimorgan map positions, and MGI accession numbers. Entrez uses these curated data to develop reference sequences for mouse and notifies us of new GenBank mouse sequences submitted. Working with UniGene, we provide gene-oriented clusters of transcript sequences.
- OMIM (Online Mendelian Inheritance in Man)
- We collaborate with the editors of the Online Mendelian Inheritance in Man (OMIM) database. OMIM is a catalog of human genes and genetic disorders, containing textual information, pictures, and reference information as well as links to NCBI's data resources.
- We collaborate with the Protein Information Resource (PIR), a public resource of protein informatics based at George Washington University, which hosts the PIRSF database.
- RIKEN Genomic Sciences Center
- We collaborate with the Genome Exploration Research Group of the RIKEN Institute in Japan to provide public access to RIKEN cDNA clone information through MGI. These clones were isolated and sequenced at the RIKEN Institute as part of a series of public cDNA clone releases that constitute the Mouse Genome Encyclopedia, a genomics project centered on the production of full-length mouse cDNAs. The RIKEN Institute hosted a meeting for first-pass functional annotation of the first set of 21,076 cDNAs (Functional Annotation of Mouse, FANTOM), attended by an international assembly of mammalian biologists and bioinformaticians. Links are provided to DNA Database of Japan (DDBJ) records and to analysis results and summary annotations from the FANTOM meeting for these RIKEN cDNA clones.
- We collaborate with UniProt, (Universal Protein Resource), a central repository of protein sequence and function created by joining the information contained in SWISS-PROT, TrEMBL, and PIR.
- UCSC Genome Browser
- We provide links from MGI markers to the UCSC Genome Browser, a central repository for known genes from UniProt, RefSeq, and GenBank mRNA, created by the Genome Bioinformatics Group of UC Santa Cruz.
- MGI provides links to an increasing number of databases on the WWW and many of these sites provide links to MGI as well. We wish to acknowledge these valuable scientific information resources.
- ATCC (American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD)
- The mission of the ATCC is to acquire, authenticate, and maintain reference cultures, related biological materials, and associated data, and to distribute these to qualified scientists in government, industry, and education.
- dbEST (NCBI - Database for Expressed Sequences Tags)
- dbEST is a division of GenBank containing sequence data and other information on "single-pass" cDNA sequences, or Expressed Sequence Tags, from a number of organisms.
- dbSNP serves as a central repository for single nucleotide polymorphisms (SNPs), multiple nucleotide polymorphisms (MNPs), and short insertion/deletion polymorphisms (IN-DELs). dbSNP is the official source of SNP-based mouse genomic variation in MGI.
- DDBJ (DNA Data Bank of Japan (DDBJ), National Institute of Genetics, Mishima, Japan)
- DDBJ is the only DNA data bank based in Japan. Sequence information is received from researchers from Japan and from other countries, and from other sequence databases, including EMBL and GenBank.
- EMBL-EBI (Databases at the European Bioinformatics Institute Cambridge, United Kingdom)
- The European Bioinformatics Institute (EBI) of the European Molecular Biology Laboratory (EMBL) maintains a number of databases including the EMBL Nucleotide Sequence Database and the SWISS-PROT Protein Sequence Database.
- ENZYME (Enzyme nomenclature database, maintained at the ExPASy molecular biology server of the Geneva University Hospital and the University of Geneva, Switzerland)
- ENZYME is a repository of information relative to the nomenclature of enzymes. It is primarily based on the recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB) and it describes each type of characterized enzyme for which an EC (Enzyme Commission) number has been provided.
- FlyBase is a comprehensive database for information on the genetics and molecular biology of Drosophila. It includes data from the Drosophila Genome Projects and data curated from the literature. FlyBase is a joint project with the Berkeley and European Drosophila Genome Projects.
- GenBank (Produced and maintained by NCBI, NLM, and NIH, Bethesda, Maryland)
- GenBank is an interface providing access to three databases: a subset of the National Library of Medicine's PubMed database, the NCBI protein database, and the NCBI nucleotide database. GenBank is produced and maintained by the National Center for Biotechnology Information (NCBI). NCBI is responsible for building, maintaining, and distributing GenBank, the NIH genetic sequence database that collects all known DNA sequences from scientists worldwide. The NCBI is a division of the National Library of Medicine (NLM) and is located on the campus of the National Institutes of Health(NIH)in Bethesda, Maryland.
- NCBI (National Center for Biotechnology Information)
- NCBI is a division of the National Library of Medicine (NLM) and is located on the campus of the National Institutes of Health (NIH) in Bethesda, Maryland. NCBI is responsible for building, maintaining, and distributing GenBank, the NIH genetic sequence database for all known DNA sequences from scientists worldwide; Entrez, a database of information on official nomenclature, aliases, sequence accessions, phenotypes, EC numbers, MIM numbers, Unigene clusters, orthology, and map locations; and PubMed, a bibliographic database.
- WashU (Genome Sequencing Center, Washington University School of Medicine St. Louis, MO)
- The Genome Sequencing Center two-year project will produce expressed tag sequences (ESTs) from approximately 400,000 mouse cDNAs.