Share to: share facebook share twitter share wa share telegram print page

HH-suite

HH-suite
Developer(s)Johannes Söding, Michael Remmert, Andreas Biegert, Andreas Hauser, Markus Meier, Martin Steinegger
Stable release
3.3.0 / 25 August 2020 (2020-08-25)
Repository
Written inC++
Operating systemUnix-like; Debian package available[1]
Available inEnglish
TypeBioinformatics tool
LicenseGPL v3
Websitehttps://github.com/soedinglab/hh-suite

The HH-suite is an open-source software package for sensitive protein sequence searching. It contains programs that can search for similar protein sequences in protein sequence databases. Sequence searches are a standard tool in modern biology with which the function of unknown proteins can be inferred from the functions of proteins with similar sequences. HHsearch and HHblits are two main programs in the package and the entry point to its search function, the latter being a faster iteration.[2][3] HHpred is an online server for protein structure prediction that uses homology information from HH-suite.[4]

The HH-suite searches for sequences using hidden Markov models (HMMs). The name comes from the fact that it performs HMM-HMM alignments. Among the most popular methods for protein sequence matching, the programs have been cited more than 5000 times total according to Google Scholar.[5]

Background

Proteins are central players in all of life's processes. Understanding them is central to understanding molecular processes in cells. This is particularly important in order to understand the origin of diseases. But for a large fraction of the approximately 20 000 human proteins the structures and functions remain unknown. Many proteins have been investigated in model organisms such as many bacteria, baker's yeast, fruit flies, zebra fish or mice, for which experiments can be often done more easily than with human cells. To predict the function, structure, or other properties of a protein for which only its sequence of amino acids is known, the protein sequence is compared to the sequences of other proteins in public databases. If a protein with sufficiently similar sequence is found, the two proteins are likely to be evolutionarily related ("homologous"). In that case, they are likely to share similar structures and functions. Therefore, if a protein with a sufficiently similar sequence and with known functions and/or structure can be found by the sequence search, the unknown protein's functions, structure, and domain composition can be predicted. Such predictions greatly facilitate the determination of the function or structure by targeted validation experiments.

Sequence searches are frequently performed by biologists to infer the function of an unknown protein from its sequence. For this purpose, the protein's sequence is compared to the sequences of other proteins in public databases and its function is deduced from those of the most similar sequences. Often, no sequences with annotated functions can be found in such a search. In this case, more sensitive methods are required to identify more remotely related proteins or protein families. From these relationships, hypotheses about the protein's functions, structure, and domain composition can be inferred. HHsearch performs searches with a protein sequence through databases. The HHpred server and the HH-suite software package offer many popular, regularly updated databases, such as the Protein Data Bank, as well as the InterPro, Pfam, COG, and SCOP databases.

Algorithm

Iterative sequence search scheme of HHblits

Modern sensitive methods for protein search utilize sequence profiles. They may be used to compare a sequence to a profile, or in more advanced cases such as HH-suite, to match among profiles.[2][6][7][8] Profiles and alignments are themselves derived from matches, using for example PSI-BLAST or HHblits. A position-specific scoring matrix (PSSM) profile contains for each position in the query sequence the similarity score for the 20 amino acids. The profiles are derived from multiple sequence alignments (MSAs), in which related proteins are written together (aligned), such that the frequencies of amino acids in each position can be interpreted as probabilities for amino acids in new related proteins, and be used to derive the "similarity scores". Because profiles contain much more information than a single sequence (e.g. the position-specific degree of conservation), profile-profile comparison methods are much more powerful than sequence-sequence comparison methods like BLAST or profile-sequence comparison methods like PSI-BLAST.[6]

HHpred and HHsearch represent query and database proteins by profile hidden Markov models (HMMs), an extension of PSSM sequence profiles that also records position-specific amino acid insertion and deletion frequencies. HHsearch searches a database of HMMs with a query HMM. Before starting the search through the actual database of HMMs, HHsearch/HHpred builds a multiple sequence alignment of sequences related to the query sequence/MSA using the HHblits program. From this alignment, a profile HMM is calculated. The databases contain HMMs that are precalculated in the same fashion using PSI-BLAST. The output of HHpred and HHsearch is a ranked list of database matches (including E-values and probabilities for a true relationship) and the pairwise query-database sequence alignments.

HHblits, a part of the HH-suite since 2001, builds high-quality multiple sequence alignments (MSAs) starting from a single query sequence or a MSA. As in PSI-BLAST, it works iteratively, repeatedly constructing new query profiles by adding the results found in the previous round. It matches against a pre-built HMM databases derived from protein sequence databases, each representing a "cluster" of related proteins. In the case of HHblits, such matches are done on the level of HMM-HMM profiles, which grants additional sensitivity. Its prefiltering reduces the tens of millions HMMs to match against to a few thousands of them, thus speeding up the slow HMM-HMM comparison process.[3]

The HH-suite comes with a number of pre-built profile HMMs that can be searched using HHblits and HHsearch, among them a clustered version of the UniProt database, of the Protein Data Bank of proteins with known structures, of Pfam protein family alignments, of SCOP structural protein domains, and many more.[9]

Applications

Applications of HHpred and HHsearch include protein structure prediction, complex structure prediction, function prediction, domain prediction, domain boundary prediction, and evolutionary classification of proteins.[10]

HHsearch is often used for homology modeling, that is, to build a model of the structure of a query protein for which only the sequence is known: For that purpose, a database of proteins with known structures such as the protein data bank is searched for "template" proteins similar to the query protein. If such a template protein is found, the structure of the protein of interest can be predicted based on a pairwise sequence alignment of the query with the template protein sequence. For example, a search through the PDB database of proteins with solved 3D structure takes a few minutes. If a significant match with a protein of known structure (a "template") is found in the PDB database, HHpred allows the user to build a homology model using the MODELLER software, starting from the pairwise query-template alignment.

HHpred servers have been ranked among the best servers during CASP7, 8, and 9, for blind protein structure prediction experiments. In CASP9, HHpredA, B, and C were ranked 1st, 2nd, and 3rd out of 81 participating automatic structure prediction servers in template-based modeling[11] and 6th, 7th, 8th on all 147 targets, while being much faster than the best 20 servers.[12] In CASP8, HHpred was ranked 7th on all targets and 2nd on the subset of single domain proteins, while still being more than 50 times faster than the top-ranked servers.[4]

Contents

In addition to HHsearch and HHblits, the HH-suite contains programs and perl scripts for format conversion, filtering of MSAs, generation of profile HMMs, the addition of secondary structure predictions to MSAs, the extraction of alignments from program output, and the generation of customized databases.

hhblits (Iteratively) search an HHblits database with a query sequence or MSA
hhsearch Search an HHsearch database of HMMs with a query MSA or HMM
hhmake Build an HMM from an input MSA
hhfilter Filter an MSA by maximum sequence identity, coverage, and other criteria
hhalign Calculate pairwise alignments, dot plots etc. for two HMMs/MSAs
reformat.pl Reformat one or many MSAs
addss.pl Add Psipred predicted secondary structure to an MSA or HHM file
hhmakemodel.pl Generate MSAs or coarse 3D models from HHsearch or HHblits results
hhblitsdb.pl Build HHblits database with prefiltering, packed MSA/HMM, and index files
multithread.pl Run a command for many files in parallel using multiple threads
splitfasta.pl Split a multiple-sequence FASTA file into multiple single-sequence files
renumberpdb.pl Generate PDB file with indices renumbered to match input sequence indices

The HMM-HMM alignment algorithm of HHblits and HHsearch was significantly accelerated using vector instructions in version 3 of the HH-suite.[13]

See also

References

  1. ^ Debian hhsuite package
  2. ^ a b Söding J (2005). "Protein homology detection by HMM-HMM comparison". Bioinformatics. 21 (7): 951–960. doi:10.1093/bioinformatics/bti125. hdl:11858/00-001M-0000-0017-EC7A-F. PMID 15531603.
  3. ^ a b Remmert M, Biegert A, Hauser A, Söding J (2011). "HHblits: Lightning-fast iterative protein sequence searching by HMM-HMM alignment" (PDF). Nat. Methods. 9 (2): 173–175. doi:10.1038/NMETH.1818. hdl:11858/00-001M-0000-0015-8D56-A. PMID 22198341. S2CID 205420247.
  4. ^ a b Söding J, Biegert A, Lupas AN (2005). "The HHpred interactive server for protein homology detection and structure prediction". Nucleic Acids Research. 33 (Web Server issue): W244–248. doi:10.1093/nar/gki408. PMC 1160169. PMID 15980461.
  5. ^ Citations to HHpred, to HHsearch, to HHblits
  6. ^ a b Jaroszewski L, Rychlewski L, Godzik A (2000). "Improving the quality of twilight-zone alignments". Protein Science. 9 (8): 1487–1496. doi:10.1110/ps.9.8.1487. PMC 2144727. PMID 10975570.
  7. ^ Sadreyev RI, Baker D, Grishin NV (2003). "Profile–profile comparisons by COMPASS predict intricate homologies between protein families". Protein Science. 12 (10): 2262–2272. doi:10.1110/ps.03197403. PMC 2366929. PMID 14500884.
  8. ^ Dunbrack RL Jr (2006). "Sequence comparison and protein structure prediction". Current Opinion in Structural Biology. 16 (3): 374–384. doi:10.1016/j.sbi.2006.05.006. PMID 16713709.
  9. ^ Li, Zhaoyu. "Some Notes about HHSuite". Archived from the original on 3 April 2019. Retrieved 3 April 2019.
  10. ^ Guerler A, Govindarajoo B, Zhang Y (2013). "Mapping Monomeric Threading to Protein–Protein Structure Prediction". Journal of Chemical Information and Modeling. 53 (3): 717–25. doi:10.1021/ci300579r. PMC 4076494. PMID 23413988.
  11. ^ Official CASP9 results for the template-based modeling category (121 targets)
  12. ^ Official CASP9 results for all 147 targets
  13. ^ Steinegger M, Meier M, Mirdita M, Vöhringer H, Haunsberger S, Söding J (2019). "HH-suite3 for fast remote homology detection and deep protein annotation". BMC Bioinformatics. 20 (1): 473. doi:10.1186/s12859-019-3019-7. PMC 6744700. PMID 31521110.

Read other articles:

Zadrak Tombeg Wakil Bupati Tana Toraja ke-6PetahanaMulai menjabat 26 Februari 2021PresidenJoko WidodoGubernurNurdin AbdullahAndi Sudirman SulaimanBahtiar Baharuddin (Pj.)BupatiTheofilus Allorerung PendahuluVictor Datuan BataraPenggantiPetahana Informasi pribadiLahir30 Agustus 1962 (umur 61)Makale, Tana Toraja, Sulawesi Selatan, IndonesiaKebangsaanIndonesiaPartai politikPartai Gerakan Indonesia RayaSuami/istriDR. Erni Yetti, SKM; M.KesPekerjaanDokter AkademisiSunting kotak info …

Czech RepublicKaptenJaroslav NavrátilPeringkat ITFPeringkat terkini2Penampilan pertama1921World GroupPenampilan28 (21-27)Hasil terbaik1 (1980)Runner-up2 (1975, 2009)Statistik pemainKemenangan terbanyakJan Kodeš (60-34)Menang terbanyak – TunggalRoderich Menzel (40-12)Menang terbanyak – GandaJan Kodeš (21-15)Tim ganda terbaikJaroslav Drobný dan Vladimír Černík (11-2)Bermain terbanyakJan Kodeš (39)Penampilan terbanyakJan Kodeš (15) Tim PIala Davis Republik Ceko adalah tim yang mewakili…

Labor strike in California, USA This article is missing information about Filipino AWOC's role in the strike. Please expand the article to include this information. Further details may exist on the talk page. (March 2023) Delano grape strikeCésar Chávez shakes hands with John Giumarra Jr. after signing an agreement to end the strikeDateSeptember 7, 1965 – July 29, 1970 (1965-09-07 – 1970-07-29)LocationDelano, CaliforniaGoalsIncreased wages and working con…

Governor of Texas from 1846 to 1847 J. Pinckney HendersonUnited States Senatorfrom TexasIn officeNovember 9, 1857 – June 4, 1858Appointed byElisha M. PeasePreceded byThomas Jefferson RuskSucceeded byMatthias Ward1st Governor of TexasIn officeFebruary 19, 1846 – December 21, 1847LieutenantAlbert Clinton HortonPreceded byAnson Jones (as president of the Republic of Texas)Succeeded byGeorge Tyler WoodMinister to England and France Republic of TexasIn office1837–1840 …

Untuk tempat lain yang bernama sama, lihat Mira (disambiguasi). Lambang Mira (pengucapan bahasa Portugis: [ˈmiɾɐ]) adalah kotamadya yang terletak di Distrik Coimbra, Portugal. Mira adalah kotamadya pesisir yang dikenal akan pantai, hutan, dan pertaniannya. Ekonomi Sebagai pusat perikanan dan akuakultur, aktivitas ekonomi Mira juga bersandar pada pertanian, kehutanan, dan pariwisata. Pada tahun 2007, perusahaan perikanan Pescanova dari Redondela, Spanyol mengumumkan maksudnya untuk membua…

Nunavut This is a list of airports in Nunavut. It includes all Nav Canada certified and registered water and land airports, aerodromes and heliports in the Canadian territory of Nunavut.[1][2] Airport names in italics are part of the National Airports System.[3] With the exception of Iqaluit and Sanikiluaq airports, all other airports in Nunavut are within the Northern Domestic Airspace. List of airports and heliports Pangnirtung Airport Kugluktuk Airport Kugaaruk Airport…

Junior Canadian football team Ottawa Junior Riders Established1995 (as Gloucester Redskins)Based inOttawa, OntarioHome stadiumNepean SportsplexHead coachRob BentoLeagueCanadian Junior Football League (2001–2005)Quebec Junior Football League (1995–2000, 2006–2013, 2017–present)DivisionOntario Football ConferenceColoursRed, Black, and White      League titles1998, 1999, 2000, 2006, 2007, 2008, 2010, 2018, 2019Websitewww.ottawajrriders.ca The Ottawa Junior Riders are a Canadi…

Cet article est une ébauche concernant une localité irakienne. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les recommandations des projets correspondants. Doujaïl الدجيل (ar) Ad-Dujayl Administration Pays Irak Gouvernorat Salah ad-Din District (en) Doujaïl (en) Démographie Population 100 000 hab. (2015 est.) Géographie Coordonnées 33° 51′ 00″ nord, 44° 15′ 58″ est Altitude 185 m Loc…

Building in Cluj-Napoca, RomaniaBánffy PalaceExterior viewGeneral informationArchitectural styleBaroqueTown or cityCluj-NapocaCountryRomaniaConstruction started1774Completed1786ClientGyörgy Bánffy, governor of TransylvaniaDesign and constructionArchitect(s)Johann Eberhard Blaumann Bánffy Castle is a baroque building of the 18th century in Cluj-Napoca, designed by the German architect Johann Eberhard Blaumann.[1] Built between 1774 and 1775 it is considered the most representative for…

Beyond the WallEpisode Game of ThronesDaenerys Targaryen dan naga-naganya tiba di luar TembokNomor episodeMusim 7Episode 6SutradaraAlan TaylorPenulisDavid BenioffD. B. WeissMusikRamin DjawadiSinematografiJonathan FreemanPenyuntingTim PorterTanggal siar20 Agustus 2017 (2017-08-20)Durasi70 menit[1]Bintang tamu Richard Dormer sebagai Beric Dondarrion Paul Kaye sebagai Thoros of Myr Joseph Mawle sebagai Benjen Stark Richard Rycroft sebagai Maester Wolkan Vladimir Furdik sebagai Nig…

American civil rights activist Van JonesBornAnthony Kapel Jones (1968-09-20) September 20, 1968 (age 55)Jackson, Tennessee, U.S.EducationUniversity of Tennessee at Martin (BS)Yale University (JD)Occupation(s)News commentator, author, lawyerPolitical partyDemocraticSpouse Jana Carter ​ ​(m. 2005; div. 2019)​Children3WebsiteOfficial website Anthony Kapel Van Jones (born September 20, 1968) is an American political analyst, media personality, law…

Relationship between objects For information on citing sources in Wikipedia, see Wikipedia:Citing sources. A reference is a relationship between objects in which one object designates, or acts as a means by which to connect to or link to, another object. The first object in this relation is said to refer to the second object. It is called a name for the second object. The next object, the one to which the first object refers, is called the referent of the first object. A name is usually a phrase…

Serge Gumienny Informazioni personali Arbitro di Calcio Federazione  Belgio Professione Ufficiale di dogana Altezza 182 cm Peso 82 kg Attività nazionale Anni Campionato Ruolo 2000-20182004-2018 Jupiler LeagueEredivisie ArbitroArbitro Attività internazionale 2003-2018 UEFA e FIFA Arbitro Esordio Fær Øer - Svizzera 1-34 giugno 2005 Serge Gumienny (Bree, 14 aprile 1972) è un ex arbitro di calcio belga. Carriera Arbitro della Jupiler League, dove ha diretto ad oggi ben oltre 160 partite, G…

Judaism's day of rest This article is about the day of rest in Judaism. For the general day of rest in Abrahamic religions, see Sabbath. For Sabbath in the Bible, see Biblical Sabbath. For the Talmudic tractate, see Shabbat (Talmud). ShabbatKiddush cup, Shabbat candles and challah coverHalakhic texts relating to this articleTorah:Exodus 20:7–10, Deut 5:12–14, numerous others.[1]Mishnah:Shabbat, EruvinBabylonian Talmud:Shabbat, EruvinJerusalem Talmud:Shabbat, EruvinMishneh Torah:Sefer…

Halo, IreneLouie. Selamat datang di Wikipedia bahasa Indonesia! Memulai Memulai Para pengguna baru dapat melihat halaman Pengantar Wikipedia terlebih dahulu. Anda bisa mengucapkan selamat datang kepada Wikipediawan lainnya di Halaman perkenalan. Bingung mulai menjelajah dari mana? Kunjungi Halaman sembarang. Untuk mencoba-coba menyunting, silakan gunakan bak pasir. Baca juga aturan yang disederhanakan sebelum melanjutkan. Ini adalah hal-hal mendasar yang perlu diketahui oleh semua penyunting Wik…

Artikel ini perlu dikembangkan dari artikel terkait di Wikipedia bahasa Inggris. (Desember 2023) klik [tampil] untuk melihat petunjuk sebelum menerjemahkan. Lihat versi terjemahan mesin dari artikel bahasa Inggris. Terjemahan mesin Google adalah titik awal yang berguna untuk terjemahan, tapi penerjemah harus merevisi kesalahan yang diperlukan dan meyakinkan bahwa hasil terjemahan tersebut akurat, bukan hanya salin-tempel teks hasil terjemahan mesin ke dalam Wikipedia bahasa Indonesia. Janga…

Ini adalah nama Papua, (Dani), marganya adalah Tabuni Natalis Tabuni,S.S., M.Si. Bupati Intan Jaya Ke-1PetahanaMulai menjabat 2012PresidenS.B. YudhoyonoJoko WidodoGubernurLukas EnembeWakilYann Kabogoyauw Informasi pribadiLahirNatalis Tabuni11 Juli 1977 (umur 46)Soanggama, Irian JayaKewarganegaraanIndonesiaKebangsaanIndonesiaPartai politikNasDemSuami/istriHerawati PujiningsihAnakCarolus TabuniCarlos TabuniChristian TabuniOrang tuaHugo Tabuni (ayah)Hana Janamba (ibu)Alma materUniversi…

2017 BTS Live Trilogy Episode III (Final Chapter): The Wings TourTur World yang diadakan oleh BTSAlbum terkaitWings You Never Walk AloneTanggal awal18 Februari 2017Tanggal akhir10 Desember 2017Legs6Jumlah acara40Hadirin550,000Kronologi konser BTS 2016 BTS LIVE The Most Beautiful Moment in Life On Stage: Epilogue(2016) 2017 BTS Live Trilogy Episode III: The Wings Tour(2017) BTS World Tour: Love Yourself(2018-2019) The Wings Tour, juga dikenal sebagai 2017 BTS Live Trilogy Episode III (Final Chapt…

Азиатский барсук Научная классификация Домен:ЭукариотыЦарство:ЖивотныеПодцарство:ЭуметазоиБез ранга:Двусторонне-симметричныеБез ранга:ВторичноротыеТип:ХордовыеПодтип:ПозвоночныеИнфратип:ЧелюстноротыеНадкласс:ЧетвероногиеКлада:АмниотыКлада:СинапсидыКласс:Млеко…

This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages) Some of this article's listed sources may not be reliable. Please help improve this article by looking for better, more reliable sources. Unreliable citations may be challenged and removed. (May 2014) (Learn how and when to remove this message) This article needs a plot summary. Please add one in your own words. (July 2020) (Learn how and when …

Kembali kehalaman sebelumnya