Share to: share facebook share twitter share wa share telegram print page

LightGBM

LightGBM
Original author(s)Guolin Ke[1] / Microsoft Research
Developer(s)Microsoft and LightGBM contributors[2]
Initial release2016; 8 years ago (2016)
Stable release
v4.3.0[3] / January 15, 2024; 11 months ago (2024-01-15)
Repositorygithub.com/microsoft/LightGBM
Written inC++, Python, R, C
Operating systemWindows, macOS, Linux
TypeMachine learning, gradient boosting framework
LicenseMIT License
Websitelightgbm.readthedocs.io

LightGBM, short for Light Gradient-Boosting Machine, is a free and open-source distributed gradient-boosting framework for machine learning, originally developed by Microsoft.[4][5] It is based on decision tree algorithms and used for ranking, classification and other machine learning tasks. The development focus is on performance and scalability.

Overview

The LightGBM framework supports different algorithms including GBT, GBDT, GBRT, GBM, MART[6][7] and RF.[8] LightGBM has many of XGBoost's advantages, including sparse optimization, parallel training, multiple loss functions, regularization, bagging, and early stopping. A major difference between the two lies in the construction of trees. LightGBM does not grow a tree level-wise — row by row — as most other implementations do.[9] Instead it grows trees leaf-wise. It will choose the leaf with max delta loss to grow. [10] Besides, LightGBM does not use the widely used sorted-based decision tree learning algorithm, which searches the best split point on sorted feature values,[11] as XGBoost or other implementations do. Instead, LightGBM implements a highly optimized histogram-based decision tree learning algorithm, which yields great advantages on both efficiency and memory consumption.[12] The LightGBM algorithm utilizes two novel techniques called Gradient-Based One-Side Sampling (GOSS) and Exclusive Feature Bundling (EFB) which allow the algorithm to run faster while maintaining a high level of accuracy.[13]

LightGBM works on Linux, Windows, and macOS and supports C++, Python,[14] R, and C#.[15] The source code is licensed under MIT License and available on GitHub.[16]

Gradient-based one-side sampling

Gradient-based one-side sampling (GOSS) is a method that leverages the fact that there is no native weight for data instance in GBDT. Since data instances with different gradients play different roles in the computation of information gain, the instances with larger gradients will contribute more to the information gain. So to retain the accuracy of the information, GOSS keeps the instances with large gradients and randomly drops the instances with small gradients.[13]

Exclusive feature bundling

Exclusive feature bundling (EFB) is a near-lossless method to reduce the number of effective features. In a sparse feature space many features are nearly exclusive, implying they rarely take nonzero values simultaneously. One-hot encoded features are a perfect example of exclusive features. EFB bundles these features, reducing dimensionality to improve efficiency while maintaining a high level of accuracy. The bundle of exclusive features into a single feature is called an exclusive feature bundle.[13]

See also

References

  1. ^ "Guolin Ke". GitHub.
  2. ^ "microsoft/LightGBM". GitHub. 7 July 2022.
  3. ^ "Releases · microsoft/LightGBM". GitHub.
  4. ^ Brownlee, Jason (March 31, 2020). "Gradient Boosting with Scikit-Learn, XGBoost, LightGBM, and CatBoost".
  5. ^ Kopitar, Leon; Kocbek, Primoz; Cilar, Leona; Sheikh, Aziz; Stiglic, Gregor (July 20, 2020). "Early detection of type 2 diabetes mellitus using machine learning-based prediction models". Scientific Reports. 10 (1): 11981. Bibcode:2020NatSR..1011981K. doi:10.1038/s41598-020-68771-z. PMC 7371679. PMID 32686721 – via www.nature.com.
  6. ^ "Understanding LightGBM Parameters (and How to Tune Them)". neptune.ai. May 6, 2020.
  7. ^ "An Overview of LightGBM". avanwyk. May 16, 2018.
  8. ^ "Parameters — LightGBM 3.0.0.99 documentation". lightgbm.readthedocs.io.
  9. ^ The Gradient Boosters IV: LightGBM – Deep & Shallow
  10. ^ "Features". LightGBM Official Documentation. Nov 3, 2024.
  11. ^ Manish, Mehta; Rakesh, Agrawal; Jorma, Rissanen (Nov 24, 2020). "SLIQ: A fast scalable classifier for data mining". International Conference on Extending Database Technology: 18–32. CiteSeerX 10.1.1.89.7734.
  12. ^ "Features — LightGBM 3.1.0.99 documentation". lightgbm.readthedocs.io.
  13. ^ a b c Ke, Guolin; Meng, Qi; Finley, Thomas; Wang, Taifeng; Chen, Wei; Ma, Weidong; Ye, Qiwei; Liu, Tie-Yan (2017). "LightGBM: A Highly Efficient Gradient Boosting Decision Tree". Advances in Neural Information Processing Systems. 30.
  14. ^ "lightgbm: LightGBM Python Package". 7 July 2022 – via PyPI.
  15. ^ "Microsoft.ML.Trainers.LightGbm Namespace". docs.microsoft.com.
  16. ^ "microsoft/LightGBM". October 6, 2020 – via GitHub.

Further reading

Read other articles:

Bernard GoumouGoumou pada 2022 Perdana Menteri GuineaPetahanaMulai menjabat 16 Juli 2022Penjabat: 16 Juli 2022 – 20 Agustus 2022PresidenMamady Doumbouya PendahuluMohamed BéavoguiPenggantiBah Oury Informasi pribadiLahir8 September 1980 (umur 43)Abidjan, Pantai GadingPartai politikIndependenSunting kotak info • L • B Bernard Goumou (lahir 8 September 1980) adalah seorang politikus Guinea yang menjabat sebagai Perdana Menteri Guinea, diangkat setelah perdana menteri sement…

Carl Menger, salah satu penggagas pemikiran ekonomi neoklasik. Ekonomi neoklasik adalah istilah yang digunakan untuk mendefinisikan beberapa aliran pemikiran ilmu ekonomi yang mencoba menjabarkan pembentukan harga, produksi, dan distribusi pendapatan melalui mekanisme penawaran dan permintaan pada suatu pasar. Asumsi maksimalisasi utilitas mendekatkan teori ini pada aliran ekonomi marjinalis yang lahir pada akhir abad ke-19 Masehi. Tiga penggagas utama mazhab ini adalah Carl Menger (1840-1941) d…

قرية أندوفر الإحداثيات 42°09′31″N 77°47′43″W / 42.1586°N 77.7953°W / 42.1586; -77.7953  [1] تقسيم إداري  البلد الولايات المتحدة[2]  التقسيم الأعلى مقاطعة ألليغاني  خصائص جغرافية  المساحة 2.631886 كيلومتر مربع2.631887 كيلومتر مربع (1 أبريل 2010)  ارتفاع 506 متر،  و509 متر[…

قرية كانيستو الإحداثيات 42°16′13″N 77°36′21″W / 42.27035°N 77.60582°W / 42.27035; -77.60582   [1] تاريخ التأسيس 1789  تقسيم إداري  البلد الولايات المتحدة[2]  التقسيم الأعلى مقاطعة ستوبين  خصائص جغرافية  المساحة 2.418195 كيلومتر مربع2.418196 كيلومتر مربع (1 أبريل 2010)  ارتف…

  هذه المقالة عن اللغات التي يتكلم بها سكان قارة أوروبا وليس جزيرة أروبا في البحر الكاريبي. لمعانٍ أخرى، طالع لغات أروبا. اللغات المتكلمة في أوروبا أغلبها من العائلة اللغوية الهندية الأوروبية أو الفينية الأوغرية.[1] أيضًا اللغات تركية موجودة بكثرة في أوروبا. توزع ال…

Artikel ini tidak memiliki referensi atau sumber tepercaya sehingga isinya tidak bisa dipastikan. Tolong bantu perbaiki artikel ini dengan menambahkan referensi yang layak. Tulisan tanpa sumber dapat dipertanyakan dan dihapus sewaktu-waktu.Cari sumber: Keajaiban Cinta – berita · surat kabar · buku · cendekiawan · JSTOR Keajaiban CintaAlbum studio karya HelenaDirilis2 Oktober 2005GenrePopLabelPro SoundKronologi Helena Keajaiban Cinta (2005) Helena(2011…

Kuil Yasukuni, dibangun pada abad ke-19, menghormati orang-orang yang tewas atas perantaraan kaisar Jepang. Tempat tersebut dipandang oleh beberapa orang sebagai sebuah penggambaran dari nasionalisme Bendera Jepang Nasionalisme Jepang (Jepang: 国家主義code: ja is deprecated , Hepburn: Kokka shugi) adalah nasionalisme yang memandang bahwa Jepang adalah sebuah bangsa dan mempromosikan penyatuan kebudayaan Jepang. Ini meliputi serangkaian besar gagasan dan sentimen yang dilabuhkan oleh orang Je…

Joko AnwarLahir3 Januari 1976 (umur 48)Medan, Sumatera Utara, IndonesiaAlmamaterInstitut Teknologi BandungPekerjaanSutradarapemeranpenulis skenarioproduser filmTahun aktif2002—sekarangTanda tangan Penghargaan Festival Film Indonesia Sutradara Terbaik 2015 A Copy of My Mind 2020 Perempuan Tanah Jahanam Penulis Skenario Terbaik 2008 fiksi. — bersama Mouly Surya Joko Anwar (lahir 3 Januari 1976) adalah sutradara, pemeran, penulis skenario, dan produser film Indonesia. Awal kehidupan J…

  هذه المقالة عن المسجد الأقصى. لمعانٍ أخرى، طالع أقصى (توضيح). المسجد الأقصى منظور جوي للمسجد الأقصى من الجنوب. إحداثيات 31°46′39″N 35°14′13″E / 31.7775067°N 35.2368801°E / 31.7775067; 35.2368801 معلومات عامة الموقع جبل المعبد  القرية أو المدينة البلدة القديمة، القدس الدولة  فل…

Constituency of Bangladesh's Jatiya Sangsad Chittagong-10Constituencyfor the Jatiya SangsadDistrictChittagong DistrictDivisionChittagong DivisionElectorate469,314 (2018)[1]Current constituencyCreated1973Parliamentary PartyBangladesh Awami LeagueMember of ParliamentMd. Mohiuddin BacchuCity Council areaChattogram City CorporationPrev. ConstituencyChittagong-9 (Constituency 286)Next ConstituencyChittagong-11 (Constituency 288) Chittagong-10 is a constituency represented in the Jatiya Sangsa…

Ferdinando III d'AsburgoFrans Luycx, ritratto dell'imperatore Ferdinando III, 1637 circa; Kunsthistorisches MuseumImperatore Eletto dei RomaniStemma In carica15 febbraio 1637 –2 aprile 1657 Incoronazione18 novembre 1637 PredecessoreFerdinando II SuccessoreLeopoldo I Re d'Ungheria e CroaziaRe di BoemiaIn carica15 febbraio 1637 –2 aprile 1657 Incoronazione8 dicembre 1625 (Ungheria)21 novembre 1627 (Boemia) PredecessoreFerdinando II SuccessoreLeopoldo I Altri titoliRe in Germani…

Hernán Salgado Pesantes Hakim Pengadilan Hak Asasi Manusia Antar-AmerikaMasa jabatan1991–2003 Informasi pribadiKebangsaanEkuadorProfesiHakim, yurisSunting kotak info • L • B Hernán Salgado Pesantes adalah seorang yuris (ahli hukum) asal Ekuador yang dikenal akan kiprahnya sebagai hakim di Pengadilan Hak Asasi Manusia Antar-Amerika. Ia mulai menjabat sebagai hakim di mahkamah tersebut pada tahun 1991. Masa baktinya sebagai hakim berakhir pada tahun 2003. [1] Referensi ^ …

Sporting event delegationMozambique at the2020 Summer ParalympicsIPC codeMOZNPCParalympic Committee Mozambiquein TokyoCompetitors2 in 1 sportMedals Gold - Silver - Bronze - Total Summer appearances2012 • 2016 • 2020 Mozambique competed at the 2020 Summer Paralympics in Tokyo, Japan, from 24 August to 5 September 2021.[1][2][3] Athletics Main article: Athletics at the 2020 Summer Paralympics Track Athlete Event Heats Final Result Rank Result Rank Hilario Chavela Men's …

Kualuh HilirKecamatanPeta lokasi Kecamatan Kualuh HilirNegara IndonesiaProvinsiSumatera UtaraKabupatenLabuhanbatu UtaraPemerintahan • Camat-Populasi • Total30,052 jiwa (2.001) jiwaKode Kemendagri12.23.03 Kode BPS1223060 Luas385,48 km²Desa/kelurahan7 Kualuh Hilir adalah sebuah kecamatan di Kabupaten Labuhanbatu Utara, Sumatera Utara, Indonesia. Pranala luar (Indonesia) Keputusan Menteri Dalam Negeri Nomor 050-145 Tahun 2022 tentang Pemberian dan Pemutakhiran Kode, Da…

Ibéromaurusien Définition Auteur Paul Pallary Caractéristiques Répartition géographique Maghreb Période Paléolithique supérieur Chronologie 25 000 à 10 000 ans AP Type humain associé Homo sapiens Tendance climatique Dernier maximum glaciaireet Tardiglaciaire modifier Extension géographique de la culture ibéromaurusienne L’Ibéromaurusien[a] est une culture archéologique préhistorique qui s'est développée sur l'actuel Maghreb, occupant une bande littorale all…

Extinct genus of dinosaurs Not to be confused with Cratoavis. CratonavisTemporal range: Early Cretaceous (Aptian), ~120 Ma PreꞒ Ꞓ O S D C P T J K Pg N ↓ Life restoration of Cratonavis (left) Scientific classification Domain: Eukaryota Kingdom: Animalia Phylum: Chordata Clade: Dinosauria Clade: Saurischia Clade: Theropoda Clade: Avialae Family: †Jinguofortisidae Genus: †CratonavisLi et al., 2023 Species: †C. zhui Binomial name †Cratonavis zhuiLi et al., 2023 Cr…

Railway station in County Durham, England HeighingtonGeneral informationLocationHeighington, County DurhamEnglandCoordinates54°35′50″N 1°34′54″W / 54.5971091°N 1.5817510°W / 54.5971091; -1.5817510Grid referenceNZ271224Owned byNetwork RailManaged byNorthern TrainsPlatforms2Tracks2Other informationStation codeHEIClassificationDfT category F2HistoryOriginal companyStockton and Darlington RailwayPre-groupingNorth Eastern RailwayPost-grouping London and North Easte…

The South Korean honors system includes orders of merit, medals of honor, and commendations conferred by the South Korean government onto its citizens and foreigners. Orders Orders (Korean: 훈장; Hanja: 勳章) are given by the president of South Korea to people who rendered distinguished services to the country. The first honor, the Grand Order of Mugunghwa, was established in 1949.[1][2] Grand Order of Mugunghwa Order of Merit for National Foundation Ord…

Historic battle on the Iberian peninsula Battle of Las BabiasPart of Razias of Hirsham Ithe ReconquistaA map of the Iberian Peninsula around the time of the conflict.Date18 September 795LocationBabia, near Astorga, SpainResult Muslim VictoryBelligerents Kingdom of Asturias Emirate of CórdobaCommanders and leaders Alfonso II of Asturias Hisham I of CórdobaAbd al-Karim ibn Abd al-Wahid ibn Mugit Farach ibn KinanahStrength Unknown 10,000Casualties and losses Unknown Unknown vteBattles in the Reco…

坐标:43°11′38″N 71°34′21″W / 43.1938516°N 71.5723953°W / 43.1938516; -71.5723953 此條目需要补充更多来源。 (2017年5月21日)请协助補充多方面可靠来源以改善这篇条目,无法查证的内容可能會因為异议提出而被移除。致使用者:请搜索一下条目的标题(来源搜索:新罕布什尔州 — 网页、新闻、书籍、学术、图像),以检查网络上是否存在该主题的更多可靠来源(…

Kembali kehalaman sebelumnya