The domain within your query sequence starts at position 1592 and ends at position 1706; the E-value for the C4 domain shown below is 7.65e-71.

AVAIAVHSQDTSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFI
ECNGGRGTCHYFANKYSFWLTTIPEQNFQSTPSADTLKAGLIRTHISRCQVCMKN

C4

C-terminal tandem repeated domain in type 4 procollagens
C4
SMART accession number:SM00111
Description: Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.
Interpro abstract (IPR001442):

This duplicated domain is present at the C-terminal of type 4 collagen, the major structural component of glomerular basement membranes (GMB) forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.

Collagens are major components of the extracellular matrices of all metazoan life and play crucial roles in developmental processes and tissue homeostasis. Collagens are composed of three polypeptide chains (alpha chains) that fold together to form the characteristic triple helical collagenous domain. Some types of triple helical protomers contain genetically identical alpha chains forming homotrimers, whereas others contain two or three different alpha chains forming heterotrimers. The sequences required to form a collagenous domain are Gly-X-Y repeats in which the X and Y positions are frequently proline and hydroxyproline. Glycine is required every third residue as it is the only amino acid small enough to pack into the central core of the triple helix. The triple helix-forming parts are surrounded by non-collagenous (NC) domains of variable sequence, size, and shape. Even if the triple helical parts represent the most striking feature of collagens, tissue specificity as well as defined binding of non-collagens seem to be encoded in the NC domains. The terminal NC domains are excised, modified, or incorporated directly into the final suprastructure, depending on protomer type and function [ (PUBMED:12539240) (PUBMED:1639194) ].

Type IV collagen is one of the major constituents of basement membranes, a specialised form of extracellular matrix underlying epithelia that compartmentalises tissues and provides molecular signals for influencing cell behaviour. Each type IV chain contains a long triple-helical collagenous domain flanked by a short 7S domain of 25 residues and a globular non-collagenous NC1 domain of ~230 residues at the N and C terminus, respectively. In protomer assembly, the NC1 domains (monomers) of three chains interact, forming an NC1 trimer, to select and register chains for triple helix formation. In network assembly, the NC1 trimers of two protomers interact, forming a NC1 hexamer structure, to select and connect protomers [ (PUBMED:12011424) (PUBMED:11970952) (PUBMED:15299013) ].

The collagen IV NC1 domain contains 12 cysteines, and all of them are involved in disulphide bonds. It folds into a tertiary structure with predominantly beta-strands. The collagen IV NC1 domain is composed of two similarly folded subdomains stabilised by 3 intrachain dissulphide bonds involving the following pairs: C1-C6, C2-C5, and C3-C4. Each subdomain represents a compact disulphide-stabilised triangular structure, from which a finger-like hairpin loop projects into an incompletely formed six-stranded beta-sheet of an adjacent subdomain of the same or of an adjacent chain clamping the subdomains tightly together [ (PUBMED:12011424) (PUBMED:11970952) (PUBMED:15299013) ].

GO component:collagen trimer (GO:0005581)
GO function:extracellular matrix structural constituent (GO:0005201)
Family alignment:
View or

There are 5781 C4 domains in 2900 proteins in SMART's nrdb database.

Click on the following links for more information.