Skip to content

Indian languages appear to be incorrectly cataloged #41

@tohagan

Description

@tohagan

Describe the bug

I download this file and reformatted it to make it more readable and discovered some data errors.

For many (most?) of the Indian languages, there appears be confusion between script used to write the language (Sanscript) and the language itself (Assamese, Burmese, Gujarati, Kannada, Bengali)

All of these show the same language values

"language": { "name": "Sanskrit", "code": "san", "level": "Common - Literary" },

Whereas they should use a iso3 code for the corresponding language - not the script. Otherwise there is now way to reliably find a translation for a given Indian language. You can try searching using the description field I guess but for a database lookup there should be consistent iso3 value.

There are also some that have a description indicating a different script but again showing the same incorrect language value. So this suggests that these a bug in the code that generate this file.

  {
    "id": "sa-Latn-001-laukika-sanco",
    "version": "Sanskrit Bible (NT) in Cologne Script (satyavEdaH|)",
    "description": "New Testament in Sanskrit Language; printed in Cologne Script", <<Cologne Script
    "scope": "New Testament",
    "language": { "name": "Sanskrit", "code": "san", "level": "Common - Literary" },  << Incorrect
    "country": { "name": "India", "code": "IN" },
    "numeralSystem": "Roman",
    "script": "Latin",
    "archivist": "Manu Mathew Thomas",
    "copyright": "",
    "localVersionName": "Sanskrit Bible (NT) in Cologne Script",
    "localVersionAbbreviation": "SANCO"
  },
....
  {
    "id": "sa-Zzzz-KH-laukika-sankh",
    "version": "Sanskrit Bible (NT) in Khmer Script (សត្យវេទះ។)",
    "description": "New Testament in Sanskrit Language; printed in Khmer Script", << Khmer Script
    "scope": "New Testament",
    "language": { "name": "Sanskrit", "code": "san", "level": "Common - Literary" },   << Incorrect 
    "country": { "name": "Cambodia", "code": "KH" },
    "numeralSystem": "Roman",
    "script": "Code for uncoded script",
    "archivist": "Manu Mathew Thomas",
    "copyright": "",
    "localVersionName": "Sanskrit Bible (NT) in Khmer Script",
    "localVersionAbbreviation": "SANKH"
  },

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions