International standard for three-letter codes identifying languages
ISO 639-3:2007, Codes for the representation of names of languages – Part 3: Alpha-3 code for comprehensive coverage of languages, is an international standard for language codes in the ISO 639 series. It defines three-letter codes for identifying languages. The standard was published by International Organization for Standardization (ISO) on 1 February 2007.[1]
ISO 639-3 extends the ISO 639-2 alpha-3 codes with an aim to cover all known natural languages. The extended language coverage was based primarily on the language codes used in the Ethnologue (volumes 10–14) published by SIL International, which is now the registration authority for ISO 639-3.[2] It provides an enumeration of languages as complete as possible, including living and extinct, ancient and constructed, major and minor, written and unwritten.[1] However, it does not include reconstructed languages such as Proto-Indo-European.[3]
ISO 639-3 is intended for use as metadata codes in a wide range of applications. It is widely used in computer and information systems, such as the Internet, in which many languages need to be supported. In archives and other information storage, it is used in cataloging systems, indicating what language a resource is in or about. The codes are also frequently used in the linguistic literature and elsewhere to compensate for the fact that language names may be obscure or ambiguous.
Find a language
Enter an ISO 639-3 code to find the corresponding language article.
ISO 639-3 includes all languages in ISO 639-1 and all individual languages in ISO 639-2. ISO 639-1 and ISO 639-2 focused on major languages, most frequently represented in the total body of the world's literature. Since ISO 639-2 also includes language collections and Part 3 does not, ISO 639-3 is not a superset of ISO 639-2. Where B and T codes exist in ISO 639-2, ISO 639-3 uses the T-codes.
As of 23 January 2023[update], the standard contains 7,916 entries.[6] The inventory of languages is based on a number of sources including: the individual languages contained in 639-2, modern languages from the Ethnologue, historic varieties, ancient languages and artificial languages from the Linguist List,[7] as well as languages recommended within the annual public commenting period.
Machine-readable data files are provided by the registration authority.[6] Mappings from ISO 639-1 or ISO 639-2 to ISO 639-3 can be done using these data files.
ISO 639-3 is intended to assume distinctions based on criteria that are not entirely objective.[8] It is not intended to document or provide identifiers for dialects or other sub-language variations.[9] Nevertheless, judgments regarding distinctions between languages may be subjective, particularly in the case of language varieties without established literary traditions, usage in education or media, or other factors that contribute to language conventionalization. Therefore, the standard should not be regarded as an authoritative statement of what distinct languages exist in the world (about which there may be substantial disagreement in some cases), but rather simply one useful way for identifying different language varieties precisely.
Code space
Since the code is three-letter alphabetic, one upper bound for the number of languages that can be represented is 26 × 26 × 26 = 17,576. Since ISO 639-2 defines special codes (4), a reserved range (520) and B-only codes (22), 546 codes cannot be used in part 3. Therefore, a stricter upper bound is 17,576 − 546 = 17,030.
The upper bound gets even stricter if one subtracts the language collections defined in 639-2 and the ones yet to be defined in ISO 639-5.
There are 58 languages in ISO 639-2 which are considered, for the purposes of the standard, to be "macrolanguages" in ISO 639-3.[10]
Some of these macrolanguages had no individual language as defined by ISO 639-3 in the code set of ISO 639-2, e.g. ara (Generic Arabic). Others like nor (Norwegian) had their two individual parts (nno (Nynorsk), nob (Bokmål)) already in ISO 639-2.
That means some languages (e.g. arb, Standard Arabic) that were considered by ISO 639-2 to be dialects of one language (ara) are now in ISO 639-3 in certain contexts considered to be individual languages themselves.
This is an attempt to deal with varieties that may be linguistically distinct from each other, but are treated by their speakers as two forms of the same language, e.g. in cases of diglossia.
"A collective language code element is an identifier that represents a group of individual languages that are not deemed to be one language in any usage context."[12] These codes do not precisely represent a particular language or macrolanguage.
While ISO 639-2 includes three-letter identifiers for collective languages, these codes are excluded from ISO 639-3. Hence ISO 639-3 is not a superset of ISO 639-2.
ISO 639-5 defines 3-letter collective codes for language families and groups, including the collective language codes from ISO 639-2.
Special codes
Four codes are set aside in ISO 639-2 and ISO 639-3 for cases where none of the specific codes are appropriate. These are intended primarily for applications like databases where an ISO code is required regardless of whether one exists.
mis (uncoded languages, originally an abbreviation for 'miscellaneous') is intended for languages which have not (yet) been included in the ISO standard.
mul (multiple languages) is intended for cases where the data includes more than one language, and (for example) the database requires a single ISO code.
und (undetermined) is intended for cases where the language in the data has not been identified, such as when it is mislabeled or never had been labeled. It is not intended for cases such as Trojan where an unattested language has been given a name.
zxx (no linguistic content / not applicable) is intended for data which is not a language at all, such as animal calls.[13]
In addition, 520 codes in the range qaa–qtz are 'reserved for local use'. For example, Rebecca Bettencourt assigns a code to constructed languages, and new assignments are made upon request.[14] The Linguist List uses them for extinct languages. Linguist List has assigned one of them a generic value: qnp, unnamed proto-language. This is used for proposed intermediate nodes in a family tree that have no name.
Maintenance processes
The code table for ISO 639-3 is open to changes. In order to protect stability of existing usage, the changes permitted are limited to:[15]
modifications to the reference information for an entry (including names or categorizations for type and scope),
addition of new entries,
deprecation of entries that are duplicates or spurious,
merging one or more entries into another entry, and
splitting an existing language entry into multiple new language entries.
The code assigned to a language is not changed unless there is also a change in denotation.[16]
Changes are made on an annual cycle. Every request is given a minimum period of three months for public review.
The ISO 639-3 Web site has pages that describe "scopes of denotation"[17] (languoid types) and types of languages,[18] which explain what concepts are in scope for encoding and certain criteria that need to be met. For example, constructed languages can be encoded, but only if they are designed for human communication and have a body of literature, preventing requests for idiosyncratic inventions.
The registration authority documents on its Web site instructions made in the text of the ISO 639-3 standard regarding how the code tables are to be maintained.[19] It also documents the processes used for receiving and processing change requests.[20]
A change request form is provided, and there is a second form for collecting information about proposed additions. Any party can submit change requests. When submitted, requests are initially reviewed by the registration authority for completeness.
When a fully documented request is received, it is added to a published Change Request Index. Also, announcements are sent to the general LINGUIST discussion list at Linguist List and other lists the registration authority may consider relevant, inviting public review and input on the requested change. Any list owner or individual is able to request notifications of change requests for particular regions or language families. Comments that are received are published for other parties to review. Based on consensus in comments received, a change request may be withdrawn or promoted to "candidate status".
Three months prior to the end of an annual review cycle (typically in September), an announcement is sent to the LINGUIST discussion list and other lists regarding Candidate Status Change Requests. All requests remain open for review and comment through the end of the annual review cycle.
Decisions are announced at the end of the annual review cycle (typically in January). At that time, requests may be adopted in whole or in part, amended and carried forward into the next review cycle, or rejected. Rejections often include suggestions on how to modify proposals for resubmission. A public archive of every change request is maintained along with the decisions taken and the rationale for the decisions.[21]
Criticism
Linguists Morey, Post and Friedman raise various criticisms of ISO 639, and in particular ISO 639-3:[16]
The three-letter codes themselves are problematic, because while officially arbitrary technical labels, they are often derived from mnemonic abbreviations for language names, some of which are pejorative. For example, Yemsa was assigned the code jnj, from pejorative "Janejero". These codes may thus be considered offensive by native speakers. However, codes can be changed with a request submission on SIL's website.
The administration of the standard is problematic because SIL is a missionary organization with inadequate transparency and accountability. Decisions as to what deserves to be encoded as a language are made internally. While outside input may or may not be welcomed, the decisions themselves are opaque, and many linguists have given up trying to improve the standard.
Permanent identification of a language is incompatible with language change.
Languages and dialects often cannot be rigorously distinguished, and dialect continua may be subdivided in many ways, whereas the standard privileges one choice. Such distinctions are often based instead on social and political factors.
ISO 639-3 may be misunderstood and misused by authorities that make decisions about people's identity and language, abolishing the right of speakers to identify or identify with their speech variety. Though SIL is sensitive to such issues, this problem is inherent in the nature of an established standard, which may be used (or mis-used) in ways that ISO and SIL do not intend.
Martin Haspelmath agrees with four of these points, but not the point about language change.[22] He disagrees because any account of a language requires identifying it, and we can easily identify different stages of a language. He suggests that linguists may prefer to use a codification that is made at the languoid level since "it rarely matters to linguists whether what they are talking about is a language, a dialect or a close-knit family of languages." He also questions whether an ISO standard for language identification is appropriate since ISO is an industrial organization, while he views language documentation and nomenclature as a scientific endeavor. He cites the original need for standardized language identifiers as having been "the economic significance of translation and software localization", for which purposes the ISO 639-1 and 639-2 standards were established. But he raises doubts about industry need for the comprehensive coverage provided by ISO 639-3, including as it does "little-known languages of small communities that are never or hardly used in writing and that are often in danger of extinction".
BCP 47: Best Current Practice 47,[26] which includes RFC 5646
RFC 5646, which superseded RFC 4646, which superseded RFC 3066. (Therefore, all standards which depend on any of these 3 IETF standards now use ISO 639-3.)
The ePub 3.0 standard for language metadata[27] uses Dublin Core Metadata elements. These language metadata elements in ePubs must contain valid RFC 5646 codes for languages.[27] RFC5646 points to ISO 639-3 for languages without shorter IANA codes.
Internet Assigned Numbers Authority (IANA) The W3C's internationalization effort recommends the use of the IANA Language Subtag Registry for selecting codes for languages.[29] The IANA Language Subtag Registry[30] depends on ISO 639-3 codes for languages which did not previously have codes in other parts of the ISO 639 standard.
^ ab"ISO 639-3 status and abstract". International Organization for Standardization. 20 July 2010. Archived from the original on 14 January 2012. Retrieved 14 June 2012.
Good, Jeff; Cysouw, Michael (2013). "Languoid, doculect, and glossonym: formalizing the notion 'language'". Language Documentation & Conservation. 7: 331–359. hdl:10125/4606.
Anna ShafferShaffer, 2011Lahir15 Maret 1992 (umur 32)London, InggrisPekerjaanAktrisTahun aktif2009–sekarangSuami/istriJimmy Stephenson (m. 2021) Anna Shaffer (lahir 15 Maret 1992) adalah seorang aktris Inggris, dikenal karena perannya sebagai Ruby Button dalam sinetron remaja Hollyoaks dan Romilda Vane dalam Harry Potter.[1][2] Dia berperan sebagai Triss Merigold dalam serial Netflix The Witcher. Referensi ^ Ruby isn't leaving 'Hollyoaks…
العلاقات الغينية الموزمبيقية غينيا موزمبيق غينيا موزمبيق تعديل مصدري - تعديل العلاقات الغينية الموزمبيقية هي العلاقات الثنائية التي تجمع بين غينيا وموزمبيق.[1][2][3][4][5] مقارنة بين البلدين هذه مقارنة عامة ومرجعية للدولتين: وجه المقارنة غ…
1419 treaty between the Ottoman Empire and the Republic of Venice Ottoman–Venetian peace treatyThe Balkans and western Anatolia in 1410. Ottoman and other Turkish territories are marked in shades of brown, Venetian or Venetian-influenced ones in shades of green.Signed6 November 1419 (1419-11-06)Mediators Manuel II PalaiologosSignatories Republic of Venice Ottoman Empire The Ottoman–Venetian peace treaty of 1419 was signed between the Ottoman Empire and Republic of Venice…
Non-profit medical organization Remote Area Medical Logo Remote Area Medical (RAM) is a non-profit provider of mobile medical clinics delivering free dental, vision, and medical care (as well as veterinary services when available) to under-served and uninsured individuals. Founded by British philanthropist Stan Brock, it was originally conceived to treat people in the developing world, but turned its attention to those in need of health care in the United States.[1] History RAM was found…
Gujarat has both private and public universities, many of which are supported by the Government of India and the state government - Government of Gujarat. Apart from these there are private universities supported by various bodies and societies. Here is a list of research organisations and educational institutions of Gujarat. Universities Gujarat University in Ahmedabad is the largest university in Gujarat. Kala Bhavan, Maharaja Sayajirao University of Baroda Resource centre in Dhirubhai Ambani …
Tea garden in Sreemangal Bangladesh is an important tea-producing country. It is the 12th[1] largest tea producer in the world. Its tea industry dates back to British rule, when the East India Company initiated the tea trade in the hills of the Sylhet region.[2] In addition to that, tea cultivation was introduced to Greater Chittagong in 1840.[3] Today, the country has 166 commercial tea estates, including many of the world's largest working plantations.[4][5&…
UlpianusPatung Ulpianus dari abad ke-19 di Palais de Justice di kota Brussels, Belgia.LahirSekitar tahun 170Meninggal223RomaKebangsaanRomawiPekerjaanAhli hukum Ulpianus (bahasa Latin: Gnaeus Domitius Annius Ulpianus; sekitar tahun 170 – 223) adalah seorang ahli hukum Romawi keturunan Tirus. Ia dianggap sebagai salah satu ahli hukum terbaik pada masanya. Ia juga merupakan salah satu dari lima ahli hukum yang dijadikan sebagai otoritas hukum menurut lex citationum yang dikeluark…
Political party in Ukraine Party of Christian Socialists Партія Xристиянські соціалістиChairmanArthur Martin[1]FounderMykhailo DobkinFounded15 February 2018 (2018-02-15)Split fromOpposition BlocHeadquartersKyivIdeologyChristian socialism[2]Christian left[3]Regionalism[4]Euroscepticism[5]Political positionLeft-wing[3][6]ReligionRussian Orthodoxy[7]National affiliationOpposition …
Bus system serving the Greater Toronto Area in Ontario, Canada Toronto Transit Commission bus systemAn Orion VII Next Generation hybrid electric bus in Downtown TorontoParentCorporation of the City of TorontoFounded1921HeadquartersWilliam McBrien Building1900 Yonge StreetToronto, Ontario, CanadaLocaleTorontoService areaToronto, Mississauga, Vaughan, MarkhamService type10-minute network, Local, Express, Night, Shuttle, Paratransit, Express bus serviceAllianceGO Transit, MiWay, York Region Transit…
French film director, screenwriter and film critic Olivier AssayasAssayas in 2010Born (1955-01-25) 25 January 1955 (age 69)Paris, FranceOccupation(s)Film director, screenwriter, film criticYears active1977–presentSpouse Maggie Cheung (m. 1998; div. 2001)PartnerMia Hansen-Løve (2002–2017)Children1 Olivier Assayas (French: [ɔlivje asajas]; born 25 January 1955) is a French film director, screenwriter and film critic. Assay…
Go-HorikawaKaisar JepangBerkuasa29 Juli 1221 – 17 November 1232PendahuluChūkyōPenerusShijōInformasi pribadiKelahiran22 Maret 1212Kematian31 Agustus 1234(1234-08-31) (umur 22)PemakamanKannon-ji no Misasagi (Kyoto)WangsaYamatoAyahPangeran MorisadaPasanganFujiwara no ArikoFujiwara no ChōshiFujiwara no ShunshiAnakKaisar Shijō Emperor Go-Horikawa (後堀河天皇code: ja is deprecated , Go-Horikawa-tennō) (22 Maret 1212 – 31 Agustus 1234) adalah kaisar Jepang ke-86, menur…
Entrée du club. Le West Side Tennis Club est un club de tennis privé situé à Forest Hills, un quartier de l'arrondissement de Queens à New York. Le club possède 38 courts de tennis en terre battue, en gazon, en Har-Tru (terre battue américaine) et en dur ainsi que plusieurs équipements collectifs[1]. Le club est particulièrement connu pour avoir hébergé l'US Open de tennis à soixante reprises, d'abord de 1915 à 1920 puis de 1924 à 1977. En outre, dix finales de Coupe Davis se sont …
دائرة تونس الثانية الانتخابية خارطة الدائرة الانتخابية.الموقع على المستوى الوطني. جغرافيا الدولة تونس الولاية تونس التقسيم الإداري المعتمديات 10 التمثيل النواب 8 المجلس الأول تعديل مصدري - تعديل دائرة تونس الثانية الانتخابية (تونس 2) كانت سابقا واحدة من 27 دائرة انتخابية …
Tom KelleyKelley with A New Wrinkle calendarBornTom Kelley(1914-12-12)December 12, 1914Philadelphia, Pennsylvania, U.S.DiedJanuary 8, 1984(1984-01-08) (aged 69)Los Angeles, California, U.S.OccupationPhotographerKnown forEntertainment, commercial, and advertising photographyChildren1 Tom Kelley Sr. (December 12, 1914 – January 8, 1984) was an American photographer who photographed Hollywood celebrities in the 1940s and 1950s. He is best known for his iconic 1949 nude photographs of Ma…
Soccer club in Major League Soccer This article is about the Major League Soccer team. For the USL Pro team, see Orlando City SC (2010–2014). Soccer clubOrlando City SCNickname(s)The Lions[1]FoundedNovember 19, 2013; 10 years ago (2013-11-19)[a 1]StadiumInter&Co Stadium Orlando, FloridaCapacity25,500OwnerZygi, Leonard and Mark WilfHead coachÓscar ParejaLeagueMajor League Soccer2023Eastern Conference: 2nd Overall: 2nd Playoffs: Conference SemifinalsWebsite…
This article is about the interstate trade in enslaved people, especially after 1808. For the general topic of American chattel slavery, see Slavery in the United States. For the international routes, see Atlantic slave trade. (1) Charleston S.C. 4th March 1833 The land of the free & home of the brave watercolor by British naval officer Henry Byam Martin; (2a) Broadside advertising sale of slaves in the rotunda of the St. Louis Hotel in New Orleans (1858), and (2b) postcard depicting the Old…
American train robber (1867–1908) Sundance KidThe Sundance Kid and Etta Place before they left for South America (c. 1901)BornHarry Alonzo Longabaugh1867 (1867)Mont Clare, Pennsylvania, United StatesDiedNovember 7, 1908(1908-11-07) (aged 40–41)San Vicente Canton, BoliviaCause of deathGunshotResting placeSan Vicente CemeteryNationalityAmericanOccupation(s)Thief, bank robber, train robber, criminal gang leaderAllegianceButch Cassidy's Wild BunchCriminal chargeTheft (1887)P…
Palace in Jodhpur, India Umaid Bhawan Palace, Jodhpurउम्मैद भवन पैलेसFront façade of the Umaid Bhawan PalaceGeneral informationArchitectural styleIndo-Saracenic ArchitectureTown or cityJodhpurCountryIndiaConstruction started1928Completed1943OwnerGaj SinghTechnical detailsStructural systemGolden yellow or dun-coloured sandstoneDesign and constructionArchitect(s)Er. Mohan Lal Nepalia, Budhmal Rai and Sir Samuel Swinton JacobEngineerHenry Vaughan Lanchester Umaid Bhawan…