In molecular biology, a GC box, also known as a GSG box,[1] is a distinct pattern of nucleotides found in the promoter region of some eukaryotic genes. The GC box is upstream of the TATA box, and approximately 110 bases upstream from the transcription initiation site. It has a consensus sequence GGGCGG which is position-dependent and orientation-independent. The GC elements are bound by transcription factors and have similar functions to enhancers.[2] Some known GC box-binding proteins include Sp1, Krox/Egr, Wilms' tumor, MIGI, and CREA.[1]
The GC box is commonly the binding site for zinc finger proteins. An alpha helix section of the protein corresponds with a major groove in the DNA. Zinc-fingers bind to triplet base pair sequences, with residue 21 binding to the first base pair, residue 18 binding to the second base pair, and residue 15 binding to the third base pair. The triplet base pairs can either be a GGG or a GCG. If residue 18 is a histidine, it will bind to a G, and if residue 18 is a glutamate, it will bind to a C. GC box-binding zinc fingers have between 2 and 4 fingers, making them interact with base pair sequences that are 6 to 8 base pairs in length.[1]
References
- 1 2 3 Lundin, M.; Nehlin, J. O.; Ronne, H. (1994-03-01). "Importance of a flanking AT-rich region in target site recognition by the GC box-binding zinc finger protein MIG1". Molecular and Cellular Biology. 14 (3): 1979โ1985. doi:10.1128/MCB.14.3.1979. ISSN 0270-7306. PMC 358557. PMID 8114729.
- โ Klug WS, Cummings MR, Spencer CA, Palladina MA (2009). Concepts of Genetics (Ninth ed.). San Francisco: Pearson Benjamin Cummings. pp. 463โ464. ISBN 978-0-321-54098-0.