About   Help   FAQ
News and Announcements for 2021/2020
September 20, 2021
Community Action Alert for Model Organism Databases
Please respond to NIH request for information (RFI)
NIH has issued this request for information (RFI) on the use of data resources. The deadline to respond to this RFI is October 15, 2021. The RFI is very broad and mentions Model Organism Databases only in passing (only FlyBase is mentioned by name in the RFI). Continued adequate funding for community centric model organism data resources like MGI is in danger. It is, therefore, very important that NIH receives a robust response from the biomedical community on how Model Organism data resources are used and their value to biomedical research.
Thank you for your support!
June 7, 2021
MGI is updated to the reference genome Build 39
MGI has integrated the latest C57BL/6J assembly, GRCm39. This includes annotations from NCBI and Ensembl, and updated coordinates for genome features, transcription start sites, and gene traps. SNP coordinates in MGI were not updated to GRCm39 for this release, but will be in a future release.
An updated Multiple Genome Viewer now includes human and rat genomes and transcripts.
The Multiple Genome Viewer (MGV) has been rewritten to be species agnostic and now includes human, rat and 4 other model organism reference genomes, as well as the C57BL/6J reference genome (both Build 38 and 39) and 18 other mouse inbred strains. The MGV allows you to explore and compare chromosomal regions between multiple selected species and mouse strains, and to view orthology and inferred paralogy relationships across genomes. This update also includes transcripts and CDSs, the ability to zoom down to the DNA basepair level, and easily download FASTA files of selected sequences and regions. The Viewer permits searches by genes and other genome features, genome coordinates, expression location, and function, phenotype, and disease terms. All searches access mouse gene annotations from MGI via API calls to MouseMine.
For a selected gene, you can download genomic DNA, composite transcripts, all transcripts, and CDS sequences for your selected homologs.
You can access the viewer from the Analysis Tools menu in the dark blue banner at the top of MGI web pages and from the Strain Comparison section on MGI Gene Detail pages.
The MGV software is freely available in GitHub
MGI now loads vertebrate homology data from the Alliance of Genome Resources
MGI now loads vertebrate orthology data based on the "stringent" set of homology class data from the Alliance of Genome Resources. The orthology inferences from different methods have been integrated using the DRSC Integrative Ortholog Prediction Tool (DIOPT). DIOPT integrates a number of existing methods including those used by the Alliance: Ensembl Compara, HGNC, Hieranoid, InParanoid, OMA, OrthoFinder, OrthoInspector, PANTHER, PhylomeDB, Roundup, TreeFam, and ZFIN. These data include human, rat, and zebrafish homologs.
April 26, 2021
Additional lacZ knock in reporter data from the IMPC
Additional lacZ knock in reporter data from the International Mouse Phenotyping Consortium (IMPC) have been imported into GXD. This set now comprises data for 2,373 targeted mutants, an increase of 1,261 mutants. It includes 81,268 images and their 92,198 annotated results, obtained from embryonic day 12.5 and postnatal adult specimens. These data are fully integrated into GXD, allowing them to be searched in context of all other gene expression data in GXD. The entire set can be viewed at J:228563.
March 9, 2020
GXD now includes RNA-Seq data
GXD has been expanded to include RNA-Seq data. In keeping with GXD's scope, these data are from experiments that examine endogenous gene expression in wild-type and mutant mice during the embryonic stages and/or postnatal life. The data sets have been imported from the EMBL-EBI's Expression Atlas. We have integrated these data with the other types of expression data in GXD and with the genetic, functional, phenotypic and disease-related information in MGI and thus made them accessible to many new search capabilities.
We chose the Expression Atlas as our data source because their team selects high quality data sets from the public repositories (ArrayExpress and NCBI's GEO) and then uses a standardized pipeline to re-analyze the data, generating consistently processed TPM values. To effectively integrate these data into GXD, we processed these files further to compute averaged quantile normalized TPM values per gene per biological replicate set. Using the Expression Atlas thresholds as a guide, the TPM values are assigned to expression bins of high, medium, low, and below cutoff.
This binning of TPM values allowed us to assign a detected/not detected value to these data, as is done for all the other expression data in GXD. We also annotated metadata for the RNA-Seq samples using the same controlled vocabularies and ontologies we use for all other expression data in GXD. These two steps enable the full integration of these data into GXD/MGI and makes them accessible via existing search tools.
New data filters on GXD's search summaries
GXD has developed new filters that take advantage of the genetic, functional, phenotypic and disease-related information in MGI. These filters have been added to the gene expression data search summaries. They enable users to use gene function, phenotype and disease ontology annotations, as well as marker type, to filter expression assay results. Filters for individual RNA-Seq data sets and TPM expression bins have also been developed. When combined with the pre-existing filters for anatomical system, developmental stage, assay type, detected/not detected and wild-type and mutant specimens, users have powerful tools to quickly and efficiently extract the expression data of interest to them.
Direct access to Morpheus heat map visualization and analysis tools
GXD users can use our search tools and filters to create RNA-Seq data sets containing the expression data of interest to them. Then, by merely clicking a button on the gene expression data search summary, these data, including the curated sample metadata, will be rendered into an expression heat map via Morpheus, a heat map visualization and analysis tool created at the Broad Institute. Morpheus offers a myriad of tools for further display and analysis, including sorting, filtering, hierarchical clustering, nearest neighbors analysis, and visual enrichment.

Contributing Projects:
Mouse Genome Database (MGD), Gene Expression Database (GXD), Mouse Models of Human Cancer database (MMHCdb) (formerly Mouse Tumor Biology (MTB)), Gene Ontology (GO)
Citing These Resources
Funding Information
Warranty Disclaimer, Privacy Notice, Licensing, & Copyright
Send questions and comments to User Support.
last database update
04/16/2024
MGI 6.23
The Jackson Laboratory