New Cell Ontology Browser and Filtering by Cell TypeThe Cell Ontology Browser can be used to find and view descriptions of cell types from the Cell Ontology (CL), and to retrieve GXD expression data associated with specific cell types. Additionally, we have developed a set of high-level cell type terms that can be used to filter search results from GXD expression queries and Quick Search results.
RNA-seq experiment type querying now available in the RNA-Seq and Microarray Experiment SearchThe RNA-Seq and Microarray Experiment Search uses GXD's standardized metadata annotations to allow users to quickly and reliably find studies of interest that examine expression in adult and embryonic mice. Bulk, single-cell, and spatial RNA-Seq data provide different types of expression information. Although these differences are of interest to users, they cannot be searched at GEO, the major public functional genomics data repository, and data sets in GEO can include a mix of these methods. Therefore, GXD curators have annotated all RNA-seq samples in its RNA-Seq and Microarray Experiment index to one of these three methods (bulk, single-cell, and spatial). This allows users to specify the method of interest in the initial search or, alternatively, use the Method filter found on the search summary to refine the return. Currently there are ~7,000 RNA-seq experiments, 160,000 RNA-Seq samples, and 5,300 mutant alleles included in the search index. Expression Profile Search redesigned and enhancedThe GXD Expression Profile Search is a tool to search for genes by their expression profile. We have redesigned and enhanced this utility to make it more versatile, improve its search speed, and include access to RNA-seq data. More versatile searches: Users have always been able to define the expression profile of interest by specifying up to 10 anatomical structures and whether expression is present or absent in these structures. Now users can also specify the developmental (Theiler) stage(s) of interest. They can define search profiles based on developmental stages alone or they can use combinations of anatomical structures and developmental stages. The addition of this functionality to the profile search allowed us to retire the Differential Expression search as all its utilities (and more) are now available in the redesigned Expression Profile search. Inclusion of RNA-seq data: Previously the Expression Profile Search was limited to classical expression data. The indices and code powering profile searches have been redesigned, resulting in significantly improved search speeds. This allowed us to expand the profile search to interrogate RNA-seq data. Two different search modes: Due to the different nature of the data, the profile search has a Classical Expression Assays mode and an RNA-Seq mode. For instance, due to the need to undertake serial sectioning of the anatomical structure to confirm the absence of expression in it, when describing the results from in situ experiments, authors tend to underreport the absence of gene expression. Thus, in Classical Expression Assays mode the "not detected" search returns genes recorded as not detected, as well as genes for which expression in the specified anatomical structures and / or developmental stages is not recorded in the database (not analyzed). In contrast, absence of expression information is complete for the anatomical structures studied using RNA-seq. Accordingly, the "not detected" search in RNA-Seq mode only returns genes for which gene expression was reported to be absent (below cutoff). New autocomplete functionality: The anatomical structure field has always been supported by an autocomplete function to help users find terms included in our anatomy ontology. Now, as a further aid, the autocomplete is limited to anatomical structures and stages for which GXD has expression data, ensuring users will choose terms that will result in fruitful searches. Improved Matrix Displays: The Matrix displays in the search summary (Tissue x Gene for classical data searches, Tissue x Stage for both search modes) now display each profile structure on its own row, making it easier to interpret returned results. More visible access to Morpheus heat map visualization and analysis tools: Expression results obtained from RNA-seq searches can be forwarded Morpheus, a tool created at the Broad Institute, for display and analysis, including sorting, filtering, hierarchical clustering, nearest neighbor analysis, and visual enrichment. Although we have long supported exports to Morpheus, this functionality has been made more prominent by adding a tab, entitled "Heat Map," to search summaries. The Global Biodata Coalition (GBC) designated GXD a Global Core Biodata ResourceGlobal Core Biodata Resources are biodata resources that are of fundamental importance to the wider biological and life sciences community and the long term preservation of biological data. They:
|