Using MouseBLAST - Sequence Databases
This help document describes the mouse, rat, rodent, or human sequence databases you can select from the MouseBLAST query form when searching for sequences that may be associated with mouse genes.
Back to
Using MouseBLAST - Overview
Nucleotide Sequence Databases
- GenBank Mouse - Subset of GenBank containing mouse sequences from rodent, high-throughput cDNAs (HTC), and patent divisions. Mouse sequences are parsed from these divisions by examining the phylogeny lines for entries containing "mus." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- GenBank Mouse associated with genes in MGI - Subset of GenBank containing mouse sequences from all divisions (including EST) associated with MGI mouse genes. Mouse sequences are parsed from these divisions by examining the phylogeny lines for entries containing "mus."
- GenBank GSS Mouse - Subset of GenBank containing mouse genome survey sequences (GSS). All mouse sequences are parsed from the GSS division by examining the phylogeny lines for those entries containing "mus."
- GenBank HTG Mouse - Subset of GenBank containing mouse high-throughput genomic sequences (HTG). All mouse sequences are parsed from the GSS division by examining the phylogeny lines for those entries containing "mus."
- GenBank Human - Subset of GenBank containing human sequences from primate, high-throughput cDNAs (HTC), and patent divisions. Human sequences are parsed from these divisions by examining the phylogeny lines for entries containing "sapiens." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- GenBank Rat - Subset of GenBank containing rat sequences from rodent, high-throughput cDNAs (HTC), and patent divisions. Rat sequences are parsed from these divisions by examining the phylogeny lines for entries containing "rattus." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- GenBank Rodent - Subset of GenBank containing rodent sequences from rodent, high-throughput cDNAs (HTC), and patent divisions. Rodent sequences are parsed from these divisions by examining the phylogeny lines for entries containing "rodentia." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- RefSeq Mouse Transcripts - All NCBI RefSeq Mouse transcripts (Mouse RefSeqs of type NM_, NR_, XM_, and XR_).
- NCBI Mouse Genome Assembly Build 37 - NCBI Build 37.1 represents over 90% of the mouse genome in finished form. Click to see the data or the statistics.
- NCBI Mouse Genome Assembly Build 36 - The Build 36 assembly from the Mouse Genome Sequencing Consortium (MGSC) is composed largely of finished sequence and includes approximately 2.6 Gb of sequences on chromosomes 1-19, X, Y, M (mitochondrial DNA) and Un (unmapped clone contigs). See NCBI's Assembling Genomic Sequences for complete details.
- NCBI Mouse Genome Assembly Build 34 - NCBI Mouse Build 34 represents a fourth generation composite assembly. In this build, chromosomes 1,3,5,6,7,8,9,10, and 12-19 were automatically assembled. See NCBI's Assembling Genomic Sequences for complete details.
- DFCI Mouse Gene Index - Tentative consensus transcript sequences from DFCI's Mouse Gene Index (MGI).
- DFCI Human Gene Index - Tentative consensus transcript sequences from DFCI's Human Gene Index (HGI).
- DFCI Rat Gene Index - Tentative consensus transcript sequences from DFCI's Rat Gene Index (RGI).
- RIKEN FANTOM cDNAs - Mouse cDNAs from the RIKEN FANTOM project.
Back to top
Protein Sequence Databases
- UniProt - All protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI).
- UniProt Mouse - All mouse protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Mouse sequences are defined as those entries containing "mus" in the phylogeny lines or "mouse" in the species lines.
- UniProt Human - All human protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Human sequences are defined as those entries containing "sapiens" in the phylogeny lines or "human" in the species lines.
- UniProt Rat - All rat protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Rat sequences are defined as those entries containing "rattus" in the phylogeny lines or "rat" in the species lines.
- UniProt Rodent - All rodent protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Rodent sequences are parsed from these sequences as entries containing "rodentia" in the phylogeny lines or "mus" or "rattus" in the species lines.
Back to top