Language dictionary for ICEM data

ICEM_LANG_CODEBOOK     (6565576)
Language dictionary for ICEM data
Documentation Author:
Paula Aucott
The data cover the period 1881 to 1911.


  1. This meta-data file "language dictionary" was downloaded in spreadsheet format on the 26th November 2015 from the website of the Integrated Census Microdata (I-CeM) project based at the University of Essex.


  1. This table covers England and Wales 1891-1911 and Scotland 1891-1901.
  2. Numeric codes identifying language used within the Icem project are as follows: 10 = Welsh only, 11 = Welsh and English, 12 = Welsh, English and Other, 13 = Welsh and Gaelic, 14 = Welsh and Other, 15 = Welsh, Gaelic and English, 20 = Gaelic only, 21 = Gaelic and English, 22 = Gaelic, English and Other, 23 = Gaelic and Other, 30 = Manx only, 31 = Manx and English, 32 = Manx, English and Other, 33 = Manx and Other, 40 = English and Yiddish, 45 = Yiddish only or Yiddish and Other, 50 = English and Other(s), 60 = Other languages only, 99 = Answer not applicable or codable.
  3. The I-CeM guide identifies this as relating to language spoken when asked in Wales, Scotland and the Isle of Man. From 1901 onwards it was asked of those aged 3 and above. It only relates to Welsh, Gaelic, Manx and English.
  4. The I-CeM guide specifies that the 1891 Scottish language information came from the birthplace data and is a variable in that dataset.


  1. The data have not been checked.


IndexTypeColumn(s) indexed
icem_lang_codebook_pkey Primary key rec_num
icem_lang_codebook_idx Unique langcode, rec_num


The table has the following associated constraints:

icem_lang_codebook_pkey Primary Key See details above for primary key index

Columns within table:

lang Text string (max.len.=64). Language as given in the transcription from the Census between 1891 and 1911.
langcode Integer number. Standardised two digit numeric code identifying language used within the Icem project.
rec_num Integer number. Unique number identifying row in table.