About   Help   FAQ
MGI News
June 7, 2021
MGI is updated to the reference genome Build 39
MGI has integrated the latest C57BL/6J assembly, GRCm39. This includes annotations from NCBI and Ensembl, and updated coordinates for genome features, transcription start sites, and gene traps. SNP coordinates in MGI were not updated to GRCm39 for this release, but will be in a future release.
An updated Multiple Genome Viewer now includes human and rat genomes and transcripts.
The Multiple Genome Viewer (MGV) has been rewritten to be species agnostic and now includes human, rat and 4 other model organism reference genomes, as well as the C57BL/6J reference genome (both Build 38 and 39) and 18 other mouse inbred strains. The MGV allows you to explore and compare chromosomal regions between multiple selected species and mouse strains, and to view orthology and inferred paralogy relationships across genomes. This update also includes transcripts and CDSs, the ability to zoom down to the DNA basepair level, and easily download FASTA files of selected sequences and regions. The Viewer permits searches by genes and other genome features, genome coordinates, expression location, and function, phenotype, and disease terms. All searches access mouse gene annotations from MGI via API calls to MouseMine.
For a selected gene, you can download genomic DNA, composite transcripts, all transcripts, and CDS sequences for your selected homologs.
You can access the viewer from the Analysis Tools menu in the dark blue banner at the top of MGI web pages and from the Strain Comparison section on MGI Gene Detail pages.
The MGV software is freely available in GitHub
MGI now loads vertebrate homology data from the Alliance of Genome Resources
MGI now loads vertebrate orthology data based on the "stringent" set of homology class data from the Alliance of Genome Resources. The orthology inferences from different methods have been integrated using the DRSC Integrative Ortholog Prediction Tool (DIOPT). DIOPT integrates a number of existing methods including those used by the Alliance: Ensembl Compara, HGNC, Hieranoid, InParanoid, OMA, OrthoFinder, OrthoInspector, PANTHER, PhylomeDB, Roundup, TreeFam, and ZFIN. These data include human, rat, and zebrafish homologs.
April 26, 2021
Additional lacZ knock in reporter data from the IMPC
Additional lacZ knock in reporter data from the International Mouse Phenotyping Consortium (IMPC) have been imported into GXD. This set now comprises data for 2,373 targeted mutants, an increase of 1,261 mutants. It includes 81,268 images and their 92,198 annotated results, obtained from embryonic day 12.5 and postnatal adult specimens. These data are fully integrated into GXD, allowing them to be searched in context of all other gene expression data in GXD. The entire set can be viewed at J:228563.
March 9, 2020
GXD now includes RNA-Seq data
GXD has been expanded to include RNA-Seq data. In keeping with GXD's scope, these data are from experiments that examine endogenous gene expression in wild-type and mutant mice during the embryonic stages and/or postnatal life. The data sets have been imported from the EMBL-EBI's Expression Atlas. We have integrated these data with the other types of expression data in GXD and with the genetic, functional, phenotypic and disease-related information in MGI and thus made them accessible to many new search capabilities.
We chose the Expression Atlas as our data source because their team selects high quality data sets from the public repositories (ArrayExpress and NCBI's GEO) and then uses a standardized pipeline to re-analyze the data, generating consistently processed TPM values. To effectively integrate these data into GXD, we processed these files further to compute averaged quantile normalized TPM values per gene per biological replicate set. Using the Expression Atlas thresholds as a guide, the TPM values are assigned to expression bins of high, medium, low, and below cutoff.
This binning of TPM values allowed us to assign a detected/not detected value to these data, as is done for all the other expression data in GXD. We also annotated metadata for the RNA-Seq samples using the same controlled vocabularies and ontologies we use for all other expression data in GXD. These two steps enable the full integration of these data into GXD/MGI and makes them accessible via existing search tools.
New data filters on GXD's search summaries
GXD has developed new filters that take advantage of the genetic, functional, phenotypic and disease-related information in MGI. These filters have been added to the gene expression data search summaries. They enable users to use gene function, phenotype and disease ontology annotations, as well as marker type, to filter expression assay results. Filters for individual RNA-Seq data sets and TPM expression bins have also been developed. When combined with the pre-existing filters for anatomical system, developmental stage, assay type, detected/not detected and wild-type and mutant specimens, users have powerful tools to quickly and efficiently extract the expression data of interest to them.
Direct access to Morpheus heat map visualization and analysis tools
GXD users can use our search tools and filters to create RNA-Seq data sets containing the expression data of interest to them. Then, by merely clicking a button on the gene expression data search summary, these data, including the curated sample metadata, will be rendered into an expression heat map via Morpheus, a heat map visualization and analysis tool created at the Broad Institute. Morpheus offers a myriad of tools for further display and analysis, including sorting, filtering, hierarchical clustering, nearest neighbors analysis, and visual enrichment.
June 10, 2019
GXD has developed an RNA-Seq and Microarray Experiment Search to quickly and reliably find specific mouse expression studies in ArrayExpress and GEO. Based on GXD's standardized metadata annotations, users can specify the age, anatomical structure, mutant alleles, strain and sex of samples of interest as well as experiment study type and key parameters. These searches, powered by controlled vocabularies and ontologies, can be combined with free text searching of experiment titles and descriptions. Search result summaries include linkouts to ArrayExpress and GEO, providing easy access to the expression data itself. Links to the PubMed entries for corresponding publications are also included.
GXD's annotation efforts focus on experiments that examine endogenous gene expression in wild-type and mutant mice. Studies from all developmental stages, including adult, are included. In addition, studies examining expression differences within and between species is available (although the metadata for the non-mouse samples is not). As is consistent with GXD's scope, studies using transgenic mice (i.e., animals containing randomly inserted transgenes); animals that have been manipulated, either by substances, surgical procedures, or diet regimens; cultured tissues; or cell lines are not included.
The RNA-Seq and microarray metadata index developed by GXD is comprised of all experiments currently available in ArrayExpress that fit the criteria described above. This includes thousands of experiments that were originally submitted to GEO. Prior to August 2016 ArrayExpress imported all experiments stored at GEO. Future work will incorporate the more recently submitted GEO experiments into the GXD RNA-Seq and Microarray Experiment index.
May 6, 2019
For this release:
  • We retired the MouseCyc Pathways resource. We recommend the BioCyc resource for mouse pathway data.
  • We improved the way variants are represented in our non-public-facing database. This is our first step in several planned enhancements related to variants in MGI and The Alliance of Genome Resources.

Contributing Projects:
Mouse Genome Database (MGD), Gene Expression Database (GXD), Mouse Models of Human Cancer database (MMHCdb) (formerly Mouse Tumor Biology (MTB), Gene Ontology (GO)
Citing These Resources
Funding Information
Warranty Disclaimer & Copyright Notice
Send questions and comments to User Support.
last database update
09/07/2021
MGI 6.17
The Jackson Laboratory