Domains within Mus musculus protein GSC_MOUSE (Q02591)

Homeobox protein goosecoid

Alternative representations: 1 /

Protein length256 aa
Source databaseUniProt
Identifiers GSC_MOUSE, Q02591, ENSMUSP00000021513.3, ENSMUSP00000021513
Source gene ENSMUSG00000021095

Domain architecture analysis

Display all proteins with similar:

Domain organisationProteins having all the domains as the query in the same order. Additional domains are allowed.
Domain compositionProteins with the same domain composition have at least one copy of each of the domains of the query.

This domain architecture was probably invented with the emergence of cellular organisms

Predicted functional partners

GSC_MOUSE is shown as Gsc in the network

Click and drag to pan the network, and zoom by using your mouse wheel. Click the protein nodes for additional options.

The network on the left comes from STRING, a database of known and predicted protein interactions. Displayed here is the evidence view, where different line colors represent the types of evidence for the association.

Open the STRING annotation page for Gsc

Protein GSC_MOUSE is possibly involved in these pathways, based on its similarity to the listed KEGG orthologous groups:

KEGG pathways

PathwayDescription
map04550Signaling pathways regulating pluripotency of stem cells

KEGG orthologous groups

KONameDescription
K09338HD-ZIPhomeobox-leucine zipper protein

Post-translational modifications

PTM annotation is taken from PTMcode, a resource of known and predicted functional associations between protein post-translational modifications (PTMs). There are 6 PTMs annotated in this protein:

PTMCount
SUMOylation6

To see the full details, including possible functional associations between the PTMs, please visit the PTMcode annotation page for protein Gsc.

Orthologous groups

Orthology information is taken from eggNOG, a database of orthologous groups of genes. Orthologous groups containing this protein are listed below. This protein is named 10090.ENSMUSP00000021513 in eggNOG.

OGTaxonomic classDescription
LKOG0490All organisms (root)homeobox-leucine zipper protein,LIM homeobox transcription factor 1,LIM homeobox protein 2/9
KOG0490Eukaryota (superkingdom)homeobox-leucine zipper protein,LIM homeobox transcription factor 1,LIM homeobox protein 2/9
HTPF7Metazoa (kingdom)homeobox protein goosecoid,homeobox protein goosecoid-like
93XK5Chordata (phylum)homeobox protein goosecoid,homeobox protein goosecoid-like
5R1DZSarcopterygii (superclass)homeobox protein goosecoid
8Z9I5Mammalia (class)homeobox protein goosecoid
4RA0PEuarchontoglires (superorder)homeobox protein goosecoid
AI641Rodentia (order)homeobox protein goosecoid
8DGD1Myomorpha (suborder)homeobox protein goosecoid
9FY8WVertebrata (clade)homeobox protein goosecoid,homeobox protein goosecoid-like
H6R7XBilateria (clade)homeobox protein goosecoid,homeobox protein goosecoid-like
7MDXCOpisthokonta (clade)homeobox protein goosecoid,homeobox protein goosecoid-like
CQDVFMuridae (family)homeobox protein goosecoid
AE4VDMurinae (subfamily)homeobox protein goosecoid
5PSZRMus (genus)homeobox protein goosecoid
HE8ZXMus (subgenus)homeobox protein goosecoid

The SMART diagram above represents a summary of the results shown below. Domains with scores less significant than established cutoffs are not shown in the diagram. Features are also not shown when two or more occupy the same piece of sequence; the priority for display is given by SMART > PFAM > PROSPERO repeats > Signal peptide > Transmembrane > Coiled coil > Low complexity. In either case, features not shown in the above diagram are listed in the right side table below, and the reason for their omission is shown in the 'Reason' column.

Confidently predicted domains, repeats, motifs and features:

Outlier homologues and homologues of known structure:

Features NOT shown in the diagram: