Cette page n'est pas disponible en Français.
This registry defines standardized codes used for identifying the language varieties of the world. It is part of the Harvest Information System (HIS).
The previous adopted standard was called HIS Registry of Dialects (ROD).
The Editor and Steward of the Registry Of Language Varieties is Allan Starling.
Overview
The function of the Registry is to:
- Identify and verify specific varieties of given languages defined by ISO 639-3
- Provide unique, standardized codes for these varieties
Contents of the Registry
The Registry contains a set of varieties of living languages. A code in this set represents a unique variety of a language.
By definition the scope of a language variety code is always a smaller group of speakers than the group represented by the assigned language as a whole.
Two codes are provided:
- ROLV Code. This is a standardized five-digit code (including leading zeros, when necessary) for uniquely referring to a particular variety.
- BCP-47 code. IETF Language Tags defined in BCP-47 identify not only the language variety, but also script and spelling conventions. This scheme enables users to tap into a wider set of language research and tools.
Information about the varieties is available in the Global Recordings Network website. Users have full access to those descriptions as follows:
- To search by language name - https://globalrecordings.net/search/language
For example: https://globalrecordings.net/en/search/language?search=asmat will list all language varieties that include "Asmat" as part of the name or alternate name.
- To search by ROLV code - https://globalrecordings.net/language/vvvvv where vvvvv is the ROLV code
For example: https://globalrecordings.net/language/4231 will show information for Asmat: Waganu (ROLV code "4231") including links to GRN material and other sites.
- To search by BCP-47 code - https://globalrecordings.net/language/langtag where langtag is the BCP-47 language tag
For example: https://globalrecordings.net/language/asc-x-HIS01514 will show information for Asmat: Shuwru (language tag "asc-x-HIS01514").
- To search by ISO 639-3 code https://globalrecordings.net/language/xxx where xxx is the ISO 639-3 code
For example: https://globalrecordings.net/language/asc will show information about all the varieties of the Asmat language (ISO language code "asc").
Changes to the Registry
Language varieties may be added or retired from the registry as a result of the following:
- The International Organization for Standards (ISO) has dropped, divided, merged, or retired an ISO 639-3 code.
- The ISO has generated a new code.
- An item is determined to be a duplicate of another item.
- A new language variety has been identified
Verification of language-related data
Data can be verified as language varieties by any of the following:
- Has audio/video recordings.
- Has literature in any form of media, including Bible translations.
- Is described in the Ethnologue, but not just in a list of names.
- Is adequately described in Wikipedia, Joshua Project, Glottolog, Google Search, or similar references.
- Has a documented percentage intelligibility with another variety.
- An individual or organization working among the language variety has made a documented request for an ROLV code.
- Changes are reported by qualified field workers.
Language Varieties may have differences in any or all of vocabulary, grammatical construction, idioms, or marked accents. Differences may also be marked by religious or social prejudices.
Downloads
The Registry consists of three lists:
1. Code List
Column | Format | Description |
---|---|---|
Language Code | 3 characters | The ISO 639-3 code of the language of which this is a variety |
ROLV Code | 5 digits | A unique identifier of this language variety |
Language Tag | 20 characters | A unique identifier of this language variety using BCP-47 |
Country Code | 2 characters | The ISO 3166-1 code of the country in which this variety is predominantly spoken |
Dialect Name | 75 characters | The name of the unique variety of this language |
Language Name | 75 characters | The name of the language of which this is a variety |
Location Name | 250 characters | The name of the location in which this variety is primarily spoken |
Download the Code List in JSON format.
2. Alternate Name List
Column | Format | Description |
---|---|---|
ROLV Code | 5 digits | A unique identifier of this language variety |
Language Tag | 20 characters | A unique identifier of this language variety using BCP-47 |
Alternate Name | 75 characters | An alternate name or spelling for the variety including names in other languages and scripts |
Download the Alternate Name List in JSON format.
3. Changes List
Column | Format | Description |
---|---|---|
ROLV Code | 5 digits | A unique identifier of this language variety |
Language Tag | 20 characters | A unique identifier of this language variety using BCP-47 |
Date | yyyy-mm-dd | The change date |
Change Type | 1 character | The type of change that has occurred: A = Added - The code is newly created M = Moved - The variety has been assigned to a different language U = Updated - The definition of the variety has extended, contracted or changed in some other way R = Retired - The code should no longer be used |
Prev Language Code | 3 characters | The ISO 639-3 code of the language to which the variety was previously assigned |
Explanation | text | A more detailed description of what has changed and why |
Download the Changes List in JSON format.
Updates
Use the ROLV Request Form to to identify new language varieties and/or requesting a code.
The form is available online or as a Word document