Share to: share facebook share twitter share wa share telegram print page

GC-content

Nucleotide bonds showing AT and GC pairs. Arrows point to the hydrogen bonds.

In molecular biology and genetics, GC-content (or guanine-cytosine content) is the percentage of nitrogenous bases in a DNA or RNA molecule that are either guanine (G) or cytosine (C).[1] This measure indicates the proportion of G and C bases out of an implied four total bases, also including adenine and thymine in DNA and adenine and uracil in RNA.

GC-content may be given for a certain fragment of DNA or RNA or for an entire genome. When it refers to a fragment, it may denote the GC-content of an individual gene or section of a gene (domain), a group of genes or gene clusters, a non-coding region, or a synthetic oligonucleotide such as a primer.

Structure

Qualitatively, guanine (G) and cytosine (C) undergo a specific hydrogen bonding with each other, whereas adenine (A) bonds specifically with thymine (T) in DNA and with uracil (U) in RNA. Quantitatively, each GC base pair is held together by three hydrogen bonds, while AT and AU base pairs are held together by two hydrogen bonds. To emphasize this difference, the base pairings are often represented as "G≡C" versus "A=T" or "A=U".

DNA with low GC-content is less stable than DNA with high GC-content; however, the hydrogen bonds themselves do not have a particularly significant impact on molecular stability, which is instead caused mainly by molecular interactions of base stacking.[2] In spite of the higher thermostability conferred to a nucleic acid with high GC-content, it has been observed that at least some species of bacteria with DNA of high GC-content undergo autolysis more readily, thereby reducing the longevity of the cell per se.[3] Because of the thermostability of GC pairs, it was once presumed that high GC-content was a necessary adaptation to high temperatures, but this hypothesis was refuted in 2001.[4] Even so, it has been shown that there is a strong correlation between the optimal growth of prokaryotes at higher temperatures and the GC-content of structural RNAs such as ribosomal RNA, transfer RNA, and many other non-coding RNAs.[4][5] The AU base pairs are less stable than the GC base pairs, making high-GC-content RNA structures more resistant to the effects of high temperatures.

More recently, it has been demonstrated that the most important factor contributing to the thermal stability of double-stranded nucleic acids is actually due to the base stackings of adjacent bases rather than the number of hydrogen bonds between the bases. There is more favorable stacking energy for GC pairs than for AT or AU pairs because of the relative positions of exocyclic groups. Additionally, there is a correlation between the order in which the bases stack and the thermal stability of the molecule as a whole.[6]

Determination

Schematic karyogram of a human, showing an overview of the human genome on G banding (which includes Giemsa-staining), wherein GC rich regions are lighter and GC poor regions are darker.

GC-content is usually expressed as a percentage value, but sometimes as a ratio (called G+C ratio or GC-ratio). GC-content percentage is calculated as[7]

whereas the AT/GC ratio is calculated as[8]

.

The GC-content percentages as well as GC-ratio can be measured by several means, but one of the simplest methods is to measure the melting temperature of the DNA double helix using spectrophotometry. The absorbance of DNA at a wavelength of 260 nm increases fairly sharply when the double-stranded DNA molecule separates into two single strands when sufficiently heated.[9] The most commonly used protocol for determining GC-ratios uses flow cytometry for large numbers of samples.[10]

In an alternative manner, if the DNA or RNA molecule under investigation has been reliably sequenced, then GC-content can be accurately calculated by simple arithmetic or by using a variety of publicly available software tools, such as the free online GC calculator.

Genomic content

Within-genome variation

The GC-ratio within a genome is found to be markedly variable. These variations in GC-ratio within the genomes of more complex organisms result in a mosaic-like formation with islet regions called isochores.[11] This results in the variations in staining intensity in chromosomes.[12] GC-rich isochores typically include many protein-coding genes within them, and thus determination of GC-ratios of these specific regions contributes to mapping gene-rich regions of the genome.[13][14]

Coding sequences

Within a long region of genomic sequence, genes are often characterised by having a higher GC-content in contrast to the background GC-content for the entire genome.[15] There is evidence that the length of the coding region of a gene is directly proportional to higher G+C content.[16] This has been pointed to the fact that the stop codon has a bias towards A and T nucleotides, and, thus, the shorter the sequence the higher the AT bias.[17]

Comparison of more than 1,000 orthologous genes in mammals showed marked within-genome variations of the third-codon position GC content, with a range from less than 30% to more than 80%.[18]

Among-genome variation

GC content is found to be variable with different organisms, the process of which is envisaged to be contributed to by variation in selection, mutational bias, and biased recombination-associated DNA repair.[19]

The average GC-content in human genomes ranges from 35% to 60% across 100-Kb fragments, with a mean of 41%.[20] The GC-content of Yeast (Saccharomyces cerevisiae) is 38%,[21] and that of another common model organism, thale cress (Arabidopsis thaliana), is 36%.[22] Because of the nature of the genetic code, it is virtually impossible for an organism to have a genome with a GC-content approaching either 0% or 100%. However, a species with an extremely low GC-content is Plasmodium falciparum (GC% = ~20%),[23] and it is usually common to refer to such examples as being AT-rich instead of GC-poor.[24]

Several mammalian species (e.g., shrew, microbat, tenrec, rabbit) have independently undergone a marked increase in the GC-content of their genes. These GC-content changes are correlated with species life-history traits (e.g., body mass or longevity) and genome size,[18] and might be linked to a molecular phenomenon called the GC-biased gene conversion.[25]

Applications

Molecular biology

In polymerase chain reaction (PCR) experiments, the GC-content of short oligonucleotides known as primers is often used to predict their annealing temperature to the template DNA. A higher GC-content level indicates a relatively higher melting temperature.

Many sequencing technologies, such as Illumina sequencing, have trouble reading high-GC-content sequences. Bird genomes are known to have many such parts, causing the problem of "missing genes" expected to be present from evolution and phenotype but never sequenced — until improved methods were used.[26]

Systematics

The species problem in non-eukaryotic taxonomy has led to various suggestions in classifying bacteria, and the ad hoc committee on reconciliation of approaches to bacterial systematics of 1987 has recommended use of GC-ratios in higher-level hierarchical classification.[27] For example, the Actinomycetota are characterised as "high GC-content bacteria".[28] In Streptomyces coelicolor A3(2), GC-content is 72%.[29] With the use of more reliable, modern methods of molecular systematics, the GC-content definition of Actinomycetota has been abolished and low-GC bacteria of this clade have been found.[30]

Software tools

GCSpeciesSorter[31] and TopSort[32] are software tools for classifying species based on their GC-contents.

See also

References

  1. ^ Definition of GC – content on CancerWeb of Newcastle University, UK
  2. ^ Yakovchuk P, Protozanova E, Frank-Kamenetskii MD (2006). "Base-stacking and base-pairing contributions into thermal stability of the DNA double helix". Nucleic Acids Res. 34 (2): 564–74. doi:10.1093/nar/gkj454. PMC 1360284. PMID 16449200.
  3. ^ Levin RE, Van Sickle C (1976). "Autolysis of high-GC isolates of Pseudomonas putrefaciens". Antonie van Leeuwenhoek. 42 (1–2): 145–55. doi:10.1007/BF00399459. PMID 7999. S2CID 9960732.
  4. ^ a b Hurst LD, Merchant AR (March 2001). "High guanine-cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes". Proc. Biol. Sci. 268 (1466): 493–7. doi:10.1098/rspb.2000.1397. PMC 1088632. PMID 11296861.
  5. ^ Galtier, N.; Lobry, J.R. (1997). "Relationships between genomic G+C content, RNA secondary structures, and optimal growth temperature in Prokaryotes". Journal of Molecular Evolution. 44 (6): 632–636. Bibcode:1997JMolE..44..632G. doi:10.1007/PL00006186. PMID 9169555. S2CID 19054315.
  6. ^ Yakovchuk, Peter; Protozanova, Ekaterina; Frank-Kamenetskii, Maxim D. (2006). "Base-stacking and base-pairing contributions into thermal stability of the DNA double helix". Nucleic Acids Research. 34 (2): 564–574. doi:10.1093/nar/gkj454. ISSN 0305-1048. PMC 1360284. PMID 16449200.
  7. ^ Madigan, MT. and Martinko JM. (2003). Brock biology of microorganisms (10th ed.). Pearson-Prentice Hall. ISBN 978-84-205-3679-8.
  8. ^ "Definition of GC-ratio on Northwestern University, IL, USA". Archived from the original on 20 June 2010. Retrieved 11 June 2007.
  9. ^ Wilhelm J, Pingoud A, Hahn M (May 2003). "Real-time PCR-based method for the estimation of genome sizes". Nucleic Acids Res. 31 (10): e56. doi:10.1093/nar/gng056. PMC 156059. PMID 12736322.
  10. ^ Vinogradov AE (May 1994). "Measurement by flow cytometry of genomic AT/GC ratio and genome size". Cytometry. 16 (1): 34–40. doi:10.1002/cyto.990160106. PMID 7518377.
  11. ^ Bernardi G (January 2000). "Isochores and the evolutionary genomics of vertebrates". Gene. 241 (1): 3–17. doi:10.1016/S0378-1119(99)00485-0. PMID 10607893.
  12. ^ Furey TS, Haussler D (May 2003). "Integration of the cytogenetic map with the draft human genome sequence". Hum. Mol. Genet. 12 (9): 1037–44. doi:10.1093/hmg/ddg113. PMID 12700172.
  13. ^ Sumner AT, de la Torre J, Stuppia L (August 1993). "The distribution of genes on chromosomes: a cytological approach". J. Mol. Evol. 37 (2): 117–22. Bibcode:1993JMolE..37..117S. doi:10.1007/BF02407346. PMID 8411200. S2CID 24677431.
  14. ^ Aïssani B, Bernardi G (October 1991). "CpG islands, genes and isochores in the genomes of vertebrates". Gene. 106 (2): 185–95. doi:10.1016/0378-1119(91)90198-K. PMID 1937049.
  15. ^ Romiguier J, Roux C (2017). "Analytical Biases Associated with GC-Content in Molecular Evolution". Front Genet. 8: 16. doi:10.3389/fgene.2017.00016. PMC 5309256. PMID 28261263.
  16. ^ Pozzoli U, Menozzi G, Fumagalli M, et al. (2008). "Both selective and neutral processes drive GC content evolution in the human genome". BMC Evol. Biol. 8 (1): 99. Bibcode:2008BMCEE...8...99P. doi:10.1186/1471-2148-8-99. PMC 2292697. PMID 18371205.
  17. ^ Wuitschick JD, Karrer KM (1999). "Analysis of genomic G + C content, codon usage, initiator codon context and translation termination sites in Tetrahymena thermophila". J. Eukaryot. Microbiol. 46 (3): 239–47. doi:10.1111/j.1550-7408.1999.tb05120.x. PMID 10377985. S2CID 28836138.
  18. ^ a b Romiguier, Jonathan; Ranwez, Vincent; Douzery, Emmanuel J. P.; Galtier, Nicolas (1 August 2010). "Contrasting GC-content dynamics across 33 mammalian genomes: Relationship with life-history traits and chromosome sizes". Genome Research. 20 (8): 1001–1009. doi:10.1101/gr.104372.109. ISSN 1088-9051. PMC 2909565. PMID 20530252.
  19. ^ Birdsell JA (1 July 2002). "Integrating genomics, bioinformatics, and classical genetics to study the effects of recombination on genome evolution". Mol. Biol. Evol. 19 (7): 1181–97. CiteSeerX 10.1.1.337.1535. doi:10.1093/oxfordjournals.molbev.a004176. PMID 12082137.
  20. ^ International Human Genome Sequencing Consortium (February 2001). "Initial sequencing and analysis of the human genome". Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. hdl:2027.42/62798. PMID 11237011. (page 876)
  21. ^ Whole genome data of Saccharomyces cerevisiae on NCBI
  22. ^ Whole genome data of Arabidopsis thaliana on NCBI
  23. ^ Whole genome data of Plasmodium falciparum on NCBI
  24. ^ Musto H, Cacciò S, Rodríguez-Maseda H, Bernardi G (1997). "Compositional constraints in the extremely GC-poor genome of Plasmodium falciparum" (PDF). Mem. Inst. Oswaldo Cruz. 92 (6): 835–41. doi:10.1590/S0074-02761997000600020. PMID 9566216.
  25. ^ Duret L, Galtier N (2009). "Biased gene conversion and the evolution of mammalian genomic landscapes". Annu Rev Genom Hum Genet. 10: 285–311. doi:10.1146/annurev-genom-082908-150001. PMID 19630562. S2CID 9126286.
  26. ^ Huttener R, Thorrez L, Veld TI, et al. (2021). "Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution". BMC Ecol Evol. 21 (176): 176. doi:10.1186/s12862-021-01905-7. PMC 8449477. PMID 34537008.
  27. ^ Wayne LG; et al. (1987). "Report of the ad hoc committee on reconciliation of approaches to bacterial systematic". International Journal of Systematic Bacteriology. 37 (4): 463–4. doi:10.1099/00207713-37-4-463.
  28. ^ Taxonomy browser on NCBI
  29. ^ Whole genome data of Streptomyces coelicolor A3(2) on NCBI
  30. ^ Ghai R, McMahon KD, Rodriguez-Valera F (2012). "Breaking a paradigm: Cosmopolitan and abundant freshwater actinobacteria are low GC". Environmental Microbiology Reports. 4 (1): 29–35. Bibcode:2012EnvMR...4...29G. doi:10.1111/j.1758-2229.2011.00274.x. PMID 23757226.
  31. ^ Karimi K, Wuitchik D, Oldach M, Vize P (2018). "Distinguishing Species Using GC Contents in Mixed DNA or RNA Sequences". Evol Bioinform Online. 14 (January 1, 2018): 1176934318788866. doi:10.1177/1176934318788866. PMC 6052495. PMID 30038485.
  32. ^ Lehnert E, Mouchka M, Burriesci M, Gallo N, Schwarz J, Pringle J (2014). "Extensive differences in gene expression between symbiotic and aposymbiotic cnidarians". G3 (Bethesda). 4 (2): 277–95. doi:10.1534/g3.113.009084. PMC 3931562. PMID 24368779.
  1. Table with GC-content of all sequenced prokaryotes
  2. Taxonomic browser of bacteria based on GC ratio on NCBI website.
  3. GC ratio in diverse species.

Read other articles:

The topic of this article may not meet Wikipedia's notability guideline for stand-alone lists. Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention. If notability cannot be shown, the article is likely to be merged, redirected, or deleted.Find sources: List of Monk characters – news · newspapers · books · scholar · …

Jam lilin adalah sebuah metode dasar untuk memberitahu waktu dan salah satu metode yang paling populer untuk memberitahu waktu dengan menggunakan lilin yang dibakar. Jam tersebut dipakai menghitung waktu sebelum jam mekanik diciptakan. Untuk memberitahu waktu, lilin tersebut cukup menyalakan pada piringan logam yang diberi penanda. Setiap lilin yang terkikis oleh panas api yang melewati penanda akan mengindikasikan waktu. Jam lilin juga dimungkinkan untuk membuat peringatan dengan menancapk…

مرت ثقافة الاتحاد السوفيتي بعدة مراحل خلال وجود الاتحاد السوفيتي الذي دام 69 عامًا. وقد ساهم فيه أشخاص من جنسيات مختلفة من كل جمهورية من جمهوريات الاتحاد الخمس عشرة، على الرغم من أن غالبية المتأثرين كانوا من الروس. دعمت الدولة السوفيتية المؤسسات الثقافية، لكنها مارست أيضًا ر…

Пляж в городе Бре-Дюн — самой северной точке Франции Ниже приведён список крайних точек Франции. Содержание 1 Континентальная территория страны 2 Европейская территория страны 3 Территория вместе с заморскими департаментами 4 Территория всех французских владений Конт…

Cinema of Pakistan List of Pakistani films Pakistani Animation Highest Grossing Pre 1950 1950s 1950 1951 1952 1953 19541955 1956 1957 1958 1959 1960s 1960 1961 1962 1963 19641965 1966 1967 1968 1969 1970s 1970 1971 1972 1973 19741975 1976 1977 1978 1979 1980s 1980 1981 1982 1983 19841985 1986 1987 1988 1989 1990s 1990 1991 1992 1993 19941995 1996 1997 1998 1999 2000s 2000 2001 2002 2003 20042005 2006 2007 2008 2009 2010s 2010 2011 2012 2013 20142015 2016 2017 2018 2019 2020s 2020 2021 2022 2023 …

This is a list of schools in the London Borough of Southwark, England. Scholars of Snowsfield School, Bermondsey, 1894 State-funded schools Primary schools Albion Primary School Alfred Salter Primary School Angel Oak Academy Ark Globe Academy The Belham Primary School Bellenden Primary School Bessemer Grange Primary School Boutcher CE Primary School Brunswick Park Primary School Camelot Primary School The Cathedral School of St Saviour and St Mary Overie Charles Dickens Primary School Charlotte …

American actor (1976–2012) Sage StalloneStallone at the premiere of Rocky Balboa in 2006BornSage Moonblood Stallone(1976-05-05)May 5, 1976Los Angeles, California, U.S.DiedJuly 13, 2012(2012-07-13) (aged 36)Studio City, California, U.S.Resting placeWestwood Village Memorial Park Cemetery, Westwood Village, California, U.S.EducationMontclair Preparatory SchoolAlma materNorth Carolina School of the ArtsOccupationsActorfilmmakerYears active1990–2010Spouse Starlin Wright ​…

Jackson RathboneLahirMonroe Jackson Rathbone VPekerjaanaktor, musisiTahun aktif2005-sekarang Monroe Jackson Rathbone V[1][2] (lahir 21 Desember 1984)[3] adalah aktor dan penyanyi asal Amerika Serikat. Ia berperan sebagai Jasper Hale dalam film yang diadaptasi dari novel karya Stephenie Meyer, Twilight (2008). Ia kembali akan berperan sebagai Jasper Hale dalam film New Moon (2009) dan Eclipse (2010). Ia juga berperan sebagai Sokka dalam film The Last Airbender (2010).…

Nikko NatividadNatividad pada tahun 2017LahirNicholai Seagal Natividad13 Februari 1993 (umur 31)Malolos, Bulacan, FilipinaPekerjaanModel, pemeran, penariTahun aktif2014–sekarangAgenStar Magic(2014–2022) Viva Artists Agency (2022–present)Dikenal atasNikko, #NikkoTinggi170 m (557 ft 9 in)Suami/istriCielo Mae Eusebio ​(m. 2021)​Anak1 Nicholai Seagal Natividad (lahir 13 Februari 1993) adalah model, pemeran, dan penari Filipina, yang menjadi…

Mauern Lambang kebesaranLetak Mauern di Freising NegaraJermanNegara bagianBayernWilayahOberbayernKreisFreisingMunicipal assoc.Mauern Pemerintahan • MayorAlfons Kipfelsberger (CSU)Luas • Total24,14 km2 (932 sq mi)Ketinggian435 m (1,427 ft)Populasi (2013-12-31)[1] • Total2.960 • Kepadatan1,2/km2 (3,2/sq mi)Zona waktuWET/WMPET (UTC+1/+2)Kode pos85419Kode area telepon08764Pelat kendaraanFSSitus webwww.mauern-o…

You can help expand this article with text translated from the corresponding article in Portuguese. (May 2022) Click [show] for important translation instructions. View a machine-translated version of the Portuguese article. Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pasting machine-translated text into the English Wikip…

ХристианствоБиблия Ветхий Завет Новый Завет Евангелие Десять заповедей Нагорная проповедь Апокрифы Бог, Троица Бог Отец Иисус Христос Святой Дух История христианства Апостолы Хронология христианства Раннее христианство Гностическое христианство Вселенские соборы Ни…

Penyuntingan Artikel oleh pengguna baru atau anonim untuk saat ini tidak diizinkan hingga 8 November 2024.Lihat kebijakan pelindungan dan log pelindungan untuk informasi selengkapnya. Jika Anda tidak dapat menyunting Artikel ini dan Anda ingin melakukannya, Anda dapat memohon permintaan penyuntingan, diskusikan perubahan yang ingin dilakukan di halaman pembicaraan, memohon untuk melepaskan pelindungan, masuk, atau buatlah sebuah akun. Artikel ini memiliki beberapa masalah. Tolong bantu memperbai…

† Египтопитек Реконструкция внешнего вида египтопитека Научная классификация Домен:ЭукариотыЦарство:ЖивотныеПодцарство:ЭуметазоиБез ранга:Двусторонне-симметричныеБез ранга:ВторичноротыеТип:ХордовыеПодтип:ПозвоночныеИнфратип:ЧелюстноротыеНадкласс:Четвероноги…

Patung Kisshoten atau Kichijoten (Dewi Laksmi menurut kepercayaan Shinto) di Joururi-ji, Jepang, sedang membawa Cintamani di tangan kirinya. Patung khiimori atau lung ta, kuda angin menurut kepercayaan Asia Timur, digambarkan mengangkut Cintamani. Cintamani[a] (Dewanagari: चिंतामणि; ,IAST: Cintāmaṇi, चिंतामणि), adalah jenis permata atau mestika yang dapat mengabulkan harapan dalam kepercayaan Hindu dan Buddha, yang dapat dipadankan dengan batu…

Cet article est une ébauche concernant le Concours Eurovision de la chanson et l’Azerbaïdjan. Vous pouvez partager vos connaissances en l’améliorant (comment ?) ; pour plus d’indications, visitez le projet Eurovision. Azerbaïdjanau Concours Eurovision 2019 Données clés Pays  Azerbaïdjan Chanson Truth Interprète Chingiz Langue Anglais Sélection nationale Radiodiffuseur İTV Type de sélection Sélection interne Date 8 mars 2019 Concours Eurovision de la chanson 2019 …

2007 American filmSuper Sleuth Christmas MovieDVD coverDirected by Don MacKinnon David Hartman Written by Nicole Dubuc Jeff Kline Brian Hohlfeld Produced byDorothy McKimStarring Chloë Grace Moretz (US) Kimberlea Berg (UK) Dee Bradley Baker Jim Cummings Travis Oates Peter Cullen Ken Sansom Kath Soucie Max Burkholder Oliver Dillon Jeffrey Tambor Mikaila Baumel Tara Strong Cinematography Jeremy Lasky Sharon Calahan[citation needed] Edited byJhoanne ReyesMusic byAndy Sturmer (score/songs)Pr…

Dua orang imam Katolik sedang merayakan misa Imamat adalah jabatan pelayan kerohanian yang dikuasakan (ditahbiskan) dengan Sakramen Imamat Kudus Gereja Katolik. Secara teknis, para uskup pun adalah imam, tetapi istilah imam dipahami umat awam sebagai sebutan khusus bagi para presbiter dan pastor (imam paroki). Doktrin Gereja Katolik pun adakalanya menyebut seluruh umat (awam) terbaptis sebagai imamat umum,[1][2] yang dapat saja dirancukan dengan imamat pelayanan rohaniwan tertahb…

此條目需要补充更多来源。 (2021年7月4日)请协助補充多方面可靠来源以改善这篇条目,无法查证的内容可能會因為异议提出而被移除。致使用者:请搜索一下条目的标题(来源搜索:美国众议院 — 网页、新闻、书籍、学术、图像),以检查网络上是否存在该主题的更多可靠来源(判定指引)。 美國眾議院 United States House of Representatives第118届美国国会众议院徽章 众议院旗帜…

2020年夏季奥林匹克运动会波兰代表團波兰国旗IOC編碼POLNOC波蘭奧林匹克委員會網站olimpijski.pl(英文)(波兰文)2020年夏季奥林匹克运动会(東京)2021年7月23日至8月8日(受2019冠状病毒病疫情影响推迟,但仍保留原定名称)運動員206參賽項目24个大项旗手开幕式:帕维尔·科热尼奥夫斯基(游泳)和马娅·沃什乔夫斯卡(自行车)[1]闭幕式:卡罗利娜·纳亚(皮划艇)[2…

Kembali kehalaman sebelumnya