Skip to content

Biographical data of selected insurance agencies in Germany (BASiD)

The Biographical Data of Social Insurance Agencies in Germany (BASiD) is a 1% sample of the population of the German Pension insurance related to the Sample of insured persons and their insurance accounts 2007 (Versichertenkontenstichprobe - VSKT). The VSKT contains all individuals which have one of the following employment episode during the observation period: Employment liable to social security (in the data since 1951), Marginal employment (in the data since 1999), Self-employment with voluntary pension insurance, Receipt of benefit according to the German Social Code Book III (in the data since 1975) or German Social Code Book II (in the data since 2005).

All persons who are in the VSKT will be identified in the data of the Integrated Employment Biographies (IEB). Additional information is merged to the VSKT: times of job-seeking registered at BA (in the data since 2000) or (planned) participation in active labour market policies.

For a brief overview of the weakly anonymous version of the BASiD data please refer to the outline (pdf).

Differences between the versions

The Biographical Data of Social Insurance Agencies in Germany (BASiD) is available in two different versions:

  • weakly anonymous version (BASiD 5109)
  • factually anonymous version

The factually anonymous version and the weakly anonymous version were produced in different ways. The main differences are the degree of anonymisation and the amount of variables. The factually anonymous version contains aggregated variables (e.g. region, classification of occupations and classification of industries). Furthermore the format of the data is different. The weakly anonymous version is available in long format and the factually anonymous data set in wide format.

The weakly anonymous version may only be used during research visits (included remote data access). The factually anonymous version can be downloaded by the applying institutions as a Scientific Use File (SUF) at the Research Data Center of the Pension Insurance.

Dataset Descriptions and Frequencies

Version
FDZ Datenreport
Frequencies and Labels
weakly anonymous version
BASiD 5109
 FDZ-Datenreport 09/2011 (pdf in German)
 FDZ-Datenreport 09/2011 (pdf in English)
 Auszählungen und Labels (zip in German)
 Frequencies and labels (zip in English)

Research papers with BASiD

Data Access

The Biographical Data of Social Insurance Agencies in Germany (BASiD) is available via the following ways of access:

Weakly anonymous version

On-site Use included Remote Data Access. Further information on Applying for on-site use

Factually anonymous version

The factually anonymous version of the BASiD data will only be available at the Research Data Center of the Pension Insurance by the end of January 2012.

Test Data of the weakly anonymous version

Test data in Stata are available in order to allow for the preparation of programs to facilitate remote data access as well as on-site use:

Test data for Stata (zip, 36 MB)

Test data are random and therefore not eligible for analysis!

Other Working Tools

A brief description can be found in: Hochfellner, Daniela; Müller, Dana; Wurdack, Anja (2012): Biographical Data of Social Insurance Agencies in Germany – Improving the Content of Administrative Data. In: Schmollers Jahrbuch 132 (2012), 443 – 451.

Further working tools as the overview on classifications of economic activities, a list of upper earnings limits and marginal part-time income thresholds and papers on working with the FDZ data can be found at key working tools of the FDZ.

Error correction

An error has been detected in the labels for the variable 'estatvor' (employment status prior to job search). Please read the following info sheet (pdf) to learn more and find guidelines to correct the error.