HIS Registry Of Language Varieties (ROLV)

HIS Registry Of Language Varieties (ROLV)

Cette page n'est pas disponible en Français.

This registry defines standardized codes used for identifying the language varieties of the world. It is part of the Harvest Information System (HIS).

The previous adopted standard was called HIS Registry of Dialects (ROD).

The Editor and Steward of the Registry Of Language Varieties is Allan Starling.

Overview

The function of the Registry is to:

  • Identify and verify specific varieties of given languages defined by ISO 639-3
  • Provide unique, standardized codes for these varieties

Contents of the Registry

The Registry contains a set of varieties of living languages. A code in this set represents a unique variety of a language.

By definition the scope of a language variety code is always a smaller group of speakers than the group represented by the assigned language as a whole.

Two codes are provided:

  • ROLV Code. This is a standardized five-digit code (including leading zeros, when necessary) for uniquely referring to a particular variety.
  • BCP-47 code. IETF Language Tags defined in BCP-47 identify not only the language variety, but also script and spelling conventions. This scheme enables users to tap into a wider set of language research and tools.

Information about the varieties is available in the Global Recordings Network website. Users have full access to those descriptions as follows:

Changes to the Registry

Language varieties may be added or retired from the registry as a result of the following:

  • The International Organization for Standards (ISO) has dropped, divided, merged, or retired an ISO 639-3 code.
  • The ISO has generated a new code.
  • An item is determined to be a duplicate of another item.
  • A new language variety has been identified

Verification of language-related data

Data can be verified as language varieties by any of the following:

  • Has audio/video recordings.
  • Has literature in any form of media, including Bible translations.
  • Is described in the Ethnologue, but not just in a list of names.
  • Is adequately described in Wikipedia, Joshua Project, Glottolog, Google Search, or similar references.
  • Has a documented percentage intelligibility with another variety.
  • An individual or organization working among the language variety has made a documented request for an ROLV code.
  • Changes are reported by qualified field workers.

Language Varieties may have differences in any or all of vocabulary, grammatical construction, idioms, or marked accents. Differences may also be marked by religious or social prejudices.

Downloads

The Registry consists of three lists:

1. Code List

ColumnFormatDescription
Language Code3 charactersThe ISO 639-3 code of the language of which this is a variety
ROLV Code5 digitsA unique identifier of this language variety
Language Tag20 charactersA unique identifier of this language variety using BCP-47
Country Code2 charactersThe ISO 3166-1 code of the country in which this variety is predominantly spoken
Dialect Name75 charactersThe name of the unique variety of this language
Language Name75 charactersThe name of the language of which this is a variety
Location Name250 charactersThe name of the location in which this variety is primarily spoken

Download the Code List in JSON format.

2. Alternate Name List

ColumnFormatDescription
ROLV Code5 digitsA unique identifier of this language variety
Language Tag20 charactersA unique identifier of this language variety using BCP-47
Alternate Name75 charactersAn alternate name or spelling for the variety including names in other languages and scripts

Download the Alternate Name List in JSON format.

3. Changes List

ColumnFormatDescription
ROLV Code5 digitsA unique identifier of this language variety
Language Tag20 charactersA unique identifier of this language variety using BCP-47
Dateyyyy-mm-ddThe change date
Change Type1 characterThe type of change that has occurred:
A = Added - The code is newly created
M = Moved - The variety has been assigned to a different language
U = Updated - The definition of the variety has extended, contracted or changed in some other way
R = Retired - The code should no longer be used
Prev Language Code3 charactersThe ISO 639-3 code of the language to which the variety was previously assigned
ExplanationtextA more detailed description of what has changed and why

Download the Changes List in JSON format.

Updates

Use the ROLV Request Form to to identify new language varieties and/or requesting a code.

The form is available online or as a Word document

Informations reliées

Speech Varieties Research - Find out about the different varieties of speech in the world.