MaFiText-Bundestag speeches: Processing stenographic protocols of the German Bundestag

  • 1. Justus Liebig Universität Gießen

Published: August 7, 2024

Version v1

Description

This dataset includes replication data from the study "Fiscal Policy in the Bundestag: Textual Analysis and Macroeconomic Effects" by Albina Latifi, Viktoriia Naboka-Krell, Peter Tillmann, and Peter Winker. The dataset encompasses all speeches from the German Bundestag from September 7, 1949, to September 7, 2021, totaling 877,140 speeches.

Files

all_bundestag_speeches_replication_data.csv

Files (1.7 GB)

Name Size Download all
md5:91aa1451299885ea78e49c42d3ae0f00
1.7 GB Preview Download

Variables

Name Description
doc_id The overall document ID.
doc_lp_id ID for each legislature period.
speech_identification_ent Identified Entity for speech identification obtained by Named Entity Recognition Model.
date Date object which represents date of each speech.
period The legislative period.
session Specifies the particular session.
pos_speechbeginning Potential identification of a speech. Needed to disaggregate the corpus.
Party Party affiliation of the speaker. Values: {'no-text' (e.g. chair), 'CDU/CSU', 'KPD', 'SPD', 'FDP', 'BP', 'Cabinet' (e.g. members of government), 'DP', 'Zentrum', 'NR', 'WAV', 'parteilos', 'NS', 'DRP', 'SRP', 'GB/BHE', 'fraktionslos', 'FU', 'DPB', 'DA', 'GRÜNE', 'PDS', 'LINKE', 'AfD'}
Role Role of the speaker. Values: {'Alterspraesident', 'MdB', 'Bundestagspraesident', 'Schriftfuehrer', 'Bundeskanzler', 'Bundesminister', 'Vizepraesident', 'Staatssekretär', 'Staatsminister, 'Landesminister', 'Senator', 'Buergermeister', 'Gastredner', 'Wehrbeauftragter', 'Beauftragter'}
governing_Party Indicates governing Party. Values: {0: Opposition, 1: Governing Party, 'no-text' : without assignment (e.g. chair), nan: non-party members/ non-affiliated members of parliament}}
text Speech content.
text_length Total number of words in a speech.

Details

Resource type Funded Research project dataset
Title MaFiText-Bundestag speeches: Processing stenographic protocols of the German Bundestag
Creators
  • Latifi, Albina1 ORCID icon
  • Research Fields Economics Political Science Economic & Social History
    Size 1.6 GB
    Formats Comma-separated values (CSV) (.csv)
    License(s) Creative Commons Attribution 4.0 International
    Countries Germany

    Additional Details

    Related works

    Is cited by
    Peer review: 10.1016/j.euroecorev.2024.104827 (DOI)