The Biographical Data of Social Insurance Agencies in Germany (BASiD) is a 1% sample of the population of the German Pension insurance related to the Sample of insured persons and their insurance accounts 2007 (Versichertenkontenstichprobe - VSKT). The VSKT contains all individuals which have one of the following employment episode during the observation period: Employment liable to social security (in the data since 1951), Marginal employment (in the data since 1999), Self-employment with voluntary pension insurance, Receipt of benefit according to the German Social Code Book III (in the data since 1975) or German Social Code Book II (in the data since 2005).
All persons who are in the VSKT will be identified in the data of the Integrated Employment Biographies (IEB). Additional information is merged to the VSKT: times of job-seeking registered at BA (in the data since 2000) or (planned) participation in active labour market policies.
For a brief overview of the weakly anonymous version of the BASiD data please refer to the outline (pdf).
Differences between the versions
The Biographical Data of Social Insurance Agencies in Germany (BASiD) is available in two different versions:
- weakly anonymous version (BASiD 5109)
- factually anonymous version
The factually anonymous version and the weakly anonymous version were produced in different ways. The main differences are the degree of anonymisation and the amount of variables. The factually anonymous version contains aggregated variables (e.g. region, classification of occupations and classification of industries). Furthermore the format of the data is different. The weakly anonymous version is available in long format and the factually anonymous data set in wide format.
The weakly anonymous version may only be used during research visits (included remote data access). The factually anonymous version can be downloaded by the applying institutions as a Scientific Use File (SUF) at the Research Data Center of the Pension Insurance.
Dataset Descriptions and Frequencies
Version | FDZ Datenreport | Frequencies and Labels |
---|---|---|
weakly anonymous version BASiD 5109 |
Research papers with BASiD
Data Access
The Biographical Data of Social Insurance Agencies in Germany (BASiD) is available via the following ways of access:
Weakly anonymous version
On-site Use included Remote Data Access. Further information on Applying for on-site use.
Factually anonymous version
The factually anonymous version of the BASiD data will only be available at the Research Data Center of the Pension Insurance by the end of January 2012.
Test Data of the weakly anonymous version
Test data in Stata are available in order to allow for the preparation of programs to facilitate remote data access as well as on-site use:
Test data for Stata (zip, 36 MB)
Test data are random and therefore not eligible for analysis!
Other Working Tools
A brief description can be found in: Hochfellner, Daniela; Müller, Dana; Wurdack, Anja (2012): Biographical Data of Social Insurance Agencies in Germany – Improving the Content of Administrative Data. In: Schmollers Jahrbuch 132 (2012), 443 – 451.
Further working tools as the overview on classifications of economic activities, a list of upper earnings limits and marginal part-time income thresholds and papers on working with the FDZ data can be found at key working tools of the FDZ.
Error correction
An error has been detected in the labels for the variable 'estatvor' (employment status prior to job search). Please read the following info sheet (pdf) to learn more and find guidelines to correct the error.