Publications

2017
  • Stefanie Dipper and Sandra Kübler (2017). German Treebanks: TIGER and TüBa-D/Z. In: Nancy Ide and James Pustejovsky (eds.), Handbook of Linguistic Annotation. Berlin: Springer.

  • Stefanie Dipper and Sandra Waldenberger (2017). Investigating Diatopic Variation in a Historical Corpus. In: Proceedings of the EACL-Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), Valencia, Spain. PDF

2016
  • Marcel Bollmann, Stefanie Dipper and Florian Petran (2016). Evaluating Inter-Annotator Agreement on Historical Spelling Normalization. In: Proceedings of the ACL Linguistic Annotation Workshop (LAW X), pp. 89–98, Berlin, Germany. PDF

  • Stefanie Dipper (2016). Variationen über ein Thema: statistisch bestimmte linguistische Ähnlichkeiten und Differenzen der Textzeugen des Anselm-Textes. In: Sarah Kwekkeboom & Sandra Waldenberger (eds.), PerspektivWechsel oder: Die Wiederentdeckung der Philologie (Vol. 1), pp. 71–94. Berlin: Erich Schmidt.

  • Stefanie Dipper (2016). Althochdeutsch‐Datengenerator. In: Carmen Bluhm, Jens Hopperdietzel and Lars Erik Zeige (eds.), Glossarium amicorum. Festschrift für Karin Donhauser, pp. 1–4. Institut für deutsche Sprache und Linguistik, HU Berlin. PDF

  • Ronja Laarmann-Quante and Stefanie Dipper (2016): An Annotation Scheme for the Comparison of Different Genres of Social Media with a Focus on Normalization. In: Proceedings of the LREC-Workshop on Normalisation and Analysis of Social Media Texts (NormSoMe), pp. 23–30, Portoroz, Slovenia. PDF

  • Ronja Laarmann-Quante, Lukas Knichel, Stefanie Dipper and Carina Betken (2016). Annotating Spelling Errors in German Texts Produced by Primary School Children. In: Proceedings of the ACL Linguistic Annotation Workshop (LAW X), pp. 32–43, Berlin, Germany. PDF

2015
  • Stefanie Dipper (2015). Annotierte Korpora für die Historische Syntaxforschung: Anwendungsbeispiele anhand des Referenzkorpus Mittelhochdeutsch. Zeitschrift für Germanistische Linguistik 43(3):516–563. Link

  • Stefanie Dipper, Julia Krasselt and Simone Schultz-Balluff (2015). Creating synopses of ‘parallel’ historical manuscripts and early prints. Alignment guidelines, evaluation, and applications. In: Jost Gippert and Ralf Gehrke (eds.), Historical Corpora, Challenges and Perspectives (= CLIP 5). Tübingen: Narr. PDF preprint

  • Julia Krasselt, Marcel Bollmann, Stefanie Dipper and Florian Petran (2015). Guidelines für die Normalisierung historischer deutscher Texte / Guidelines for Normalizing Historical German Texts. Bochumer Linguistische Arbeitsberichte (BLA), Vol. 15. PDF

2014
  • Stefanie Dipper (2014). Querying topological fields in the TIGER scheme with TIGERSearch. In: Proceedings of the 13th International Workshop on Treebanks and Linguistic Theories (TLT13), pp. 37-50, Tübingen, Germany. PDF

  • Marcel Bollmann, Florian Petran, Stefanie Dipper and Julia Krasselt (2014). CorA: A web-based annotation tool for historical and other non-standard language data. In: Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH), pp. 86–90, Gothenburg, Sweden. PDF

  • Marcel Bollmann, Florian Petran and Stefanie Dipper (2014). Applying Rule-Based Normalization to Different Types of Historical Texts — An Evaluation. In: Zygmunt Vetulani and Joseph Mariani (eds.), Human Language Technology Challenges for Computer Science and Linguistics. 5th Language and Technology Conference, LTC 2011. Revised Selected Papers, pp. 166–177. Lecture Notes in Computer Science, Volume 8387. Springer. PDF preprint (The final publication is available at www.springerlink.com (Revised version of Bollmann et al. (2011)).

2013
  • Stefanie Dipper, Karin Donhauser, Thomas Klein, Sonja Linde, Stefan Müller and Klaus-Peter Wegera (2013). HiTS: ein Tagset für historische Sprachstufen des Deutschen. In: Journal for Language Technology and Computational Linguistics, Special Issue, 28(1):85–137. PDF

  • Stefanie Dipper, Maria Liakata and Antonio Pareja-Lora (eds.) (2013). Proceedings of the 7th Linguistic Annotation Workshop & Interoperability with Discourse (LAW VII & ID). ACL, Sofia. PDF

  • Stefanie Dipper, Heike Zinsmeister and Bonnie Webber (eds.) (2013). Beyond Semantics: The Challenges of Annotating Pragmatic and Discourse Phenomena, Special Issue of Dialogue & Discourse 4(2). Website

  • Stefanie Dipper, Anke Lüdeling and Marc Reznicek (2013). NoSta-D: A Corpus of German Non-Standard Varieties. In: Marcos Zampieri and Sascha Diwersy (eds.), Non-Standard Data Sources in Corpus-Based Research, pp. 69–76. Shaker. (= ZSM-Studien 5), PDF

  • Stefanie Dipper and Simone Schultz-Balluff (2013). The Anselm Corpus: Methods and Perspectives of a Parallel Aligned Corpus. In: Proceedings of the NODALIDA Workshop on Computational Historical Linguistics. Oslo, Norway. PDF

  • Simone Schultz-Balluff and Stefanie Dipper (2013). St. Anselmi Fragen an Maria – Schritte zu einer (digitalen) Erschließung, Auswertung und Edition der gesamten deutschsprachigen Überlieferung (14.-16. Jh.). In: Anne Bohnenkamp et al. (eds.), Medienwandel/Medienwechsel in der Editionswissenschaft. 13. internationale Tagung der Arbeitsgemeinschaft für germanistische Edition 17.–20. Februar 2010, Beihefte zu editio, pp. 173–191.

2012
  • Marcel Bollmann, Stefanie Dipper, Julia Krasselt and Florian Petran (2012). Manual and Semi-automatic Normalization of Historical Spelling – Case Studies from Early New High German. In: Proceedings of the KONVENS-Workshop on Language Technology for Historical Text(s) (LThist2012), Wien, Austria. PDF

  • Stefanie Dipper, Melanie Seiss and Heike Zinsmeister (2012). The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English. In: Proceedings of the 8th LREC-2012 Conference, pp. 138–145, Istanbul, Turkey. PDF

  • Stefanie Dipper and Heike Zinsmeister (2012). Annotating Abstract Anaphora. In: Language Resources and Evaluation 46(1):37–52. (Revised and extended version of Dipper and Zinsmeister (2009a)). PDF (preprint) (The final publication is available at www.springerlink.com; published Online First 2011, Sep 3.)

  • Lara Kresse, Stefan Kirschner, Stefanie Dipper and Eva Belke (2012). Towards exploring the specific influences of wordform frequency, lemma frequency and OLD20 on visual word recognition and reading aloud. In: Kay-Michael Würzner, Edmund Pohl (eds.), Lexical Resources in Psycholinguistic Research: Proceedings of the Workshop Held at the Forth Conference on Quantitative Investigations in Theoretical Linguistics (qitl4). Potsdam Cognitive Science Series (vol. 3), pp. 9–22. Universitätsverlag Potsdam. PDF

  • Heike Zinsmeister, Stefanie Dipper and Melanie Seiss (2012). Abstract pronominal anaphors and label nouns in German and English: selected case studies and quantitative investigations. In: TC3 2(1):47–80. PDF

2011
  • Marcel Bollmann, Florian Petran and Stefanie Dipper (2011). Applying Rule-Based Normalization to Different Types of Historical Texts – An Evaluation. In: Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznan, Poland. PDF

  • Marcel Bollmann, Florian Petran and Stefanie Dipper (2011). Rule-Based Normalization of Historical Texts. In: Proceedings of the RANLP Workshop on Language Technologies for Digital Humanities and Cultural Heritage, pp. 34–42, Hissar, Bulgaria. PDF

  • Stefanie Dipper (2011). Morphological and Part-of-Speech Tagging of Historical Language Data: A Comparison. In: Journal for Language Technology and Computational Linguistics, Special Issue, 26(2):25–37. (= Proceedings of the TLT-Workshop on Annotation of Corpora for Research in the Humanities, 2012). PDF

  • Stefanie Dipper (2011). Digitale Korpora in der Lehre – Anwendungsbeispiele aus der Theoretischen Linguistik und der Computerlinguistik. In: Journal for Language Technology and Computational Linguistics, Special Issue, 26(1):81–95. PDF

  • Stefanie Dipper, Christine Rieger, Melanie Seiss and Heike Zinsmeister (2011). Abstract Anaphors in German and English. In: Iris Hendrickx et al. (eds.), Anaphora Processing and Applications: 8th Discourse Anaphora and Anaphor Resolution Colloquium, DAARC 2011. Revised selected papers, pp. 96–107. Lecture Notes in Computer Science. Springer. PDF (preprint) (The final publication is available at www.springerlink.com.)

  • Stefanie Dipper and Martin Schnurrenberger (2011). OTTO: A Tool for Diplomatic Transcription of Historical Texts. In: Zygmunt Vetulani (ed.), Human Language Technology: Challenges for Computer Science and Linguistics. 4th Language and Technology Conference, LTC 2009. Revised Selected Papers, pp. 456–467. Lecture Notes in Computer Science. Springer. PDF (preprint) (The final publication is available at www.springerlink.com.) (Revised version of Dipper and Schnurrenberger (2009)).

  • Stefanie Dipper and Heike Zinsmeister (eds.) (2011). Beyond Semantics: Corpus-based Investigations of Pragmatic and Discourse Phenomena. Proceedings of the DGfS Workshop, Göttingen. Volume 3 of BLA (Bochumer Linguistische Arbeiten). Link

2010
  • Stefanie Dipper (2010). POS-Tagging of Historical Language Data: First Experiments. In: Semantic Approaches in Natural Language Processing. Proceedings of the 10th Conference on Natural Language Processing (KONVENS-10), pp. 117–121, Saarbrücken, Germany. PDF (preprint)

  • Stefanie Dipper, Lara Kresse, Martin Schnurrenberger and Seong-Eun Cho (2010). OTTO: a Transcription and Management Tool for Historical Texts. In: Proceedings of the ACL Linguistic Annotation Workshop (LAW IV), Uppsala, Sweden. PDF

  • Stefanie Dipper and Heike Zinsmeister (2010). Towards a Standard for Annotating Abstract Anaphora. In: Proceedings of the LREC Workshop on Language Resource and Language Technology Standards – state of the art, emerging needs, and future developments, pp. 54–59, Malta. PDF\ (NOTE: this is a fixed version! – by mistake, the study by Hedberg et al. (2007) is missing from the survey table in the workshop proceedings)

2009
  • Stefanie Dipper and Martin Schnurrenberger (2009). OTTO: A Tool for Diplomatic Transcription of Historical Texts. In: Proceedings of the 4th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 516–520, Poznan, Poland. PDF

  • Stefanie Dipper and Heike Zinsmeister (2009a). Annotating Discourse Anaphora. In: Proceedings of the Third Linguistic Annotation Workshop (LAW III), ACL-IJCNLP, pp. 166–169, Singapore. PDF

  • Stefanie Dipper and Heike Zinsmeister (2009b). The Role of the German Vorfeld for Local Coherence: A Pilot Study. In: Christian Chiarcos, Richard Eckart de Castilho and Manfred Stede (eds.), From Form to Meaning: Processing Texts Automatically. Proceedings of the Biennial GSCL Conference 2009, pp. 69–79. Tübingen: Narr. PDF (preprint)

2008
  • Heike Bieler and Stefanie Dipper (2008). Measures for Term and Sentence Relevances: an Evaluation for German. In: Proceedings of the 6th LREC-2008 Conference, Marrakech, Morocco. PDF

  • Christian Chiarcos, Stefanie Dipper, Michael Götze, Ulf Leser, Anke Lüdeling, Julia Ritz and Manfred Stede (2008a). A Flexible Framework for Integrating Annotations from Different Tools and Tagsets. In: Traitement Automatique des Langues 49(2):271–293. (Revised and extended version of Chiarcos et al. (2008b)) PDF

  • Christian Chiarcos, Stefanie Dipper, Michael Götze, Julia Ritz and Manfred Stede (2008b). A Flexible Framework for Integrating Annotations from Different Tools and Tagsets. In: Proceedings of the First International Conference on Global Interoperability for Language Resources (ICGL-08), pp. 43–50, Hongkong. PDF

  • Stefanie Dipper (2008). Theory-driven and Corpus-driven Computational Linguistics, and the Use of Corpora. In: Anke Lüdeling and Merja Kytö (eds.), Corpus Linguistics. An International Handbook. Handbooks of Linguistics and Communication Science, pp. 68–96. Berlin: Mouton de Gruyter. Link, PDF (preprint, with incorrect affiliation (should be Bochum))

  • Stefanie Dipper and Bettina Schrader (2008). Computing Distance and Relatedness of Medieval Text Variants from German. In: Angelika Storrer, Alexander Geyken, Alexander Siebert and Kay-Michael Würzner (eds.), Text Resources and Lexical Knowledge. Selected Papers from the 9th Conference on Natural Language Processing (KONVENS-08), pp. 39–51. Berlin: Mouton de Gruyter. PDF (preprint)

  • Julia Ritz, Stefanie Dipper and Michael Götze (2008). Annotation of Information Structure: An Evaluation Across Different Types of Texts. In: Proceedings of the 6th LREC-2008 Conference, pp. 2137–2142, Marrakech, Morocco. PDF

2007
  • Heike Bieler, Stefanie Dipper and Manfred Stede (2007). Identifying Formal and Functional Zones in Film Reviews. In: Simon Keizer, Harry Bunt and Tim Paek (eds.), Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, pp 75–78, Antwerp, Belgium. PDF

  • Stefanie Dipper, Michael Götze, Uwe Küssner and Manfred Stede (2007). Representing and Querying Standoff XML. In: Georg Rehm, Andreas Witt and Lothar Lemnitzer (eds.), Data structures for linguistic resources and applications. Proceedings of the Biennial GLDV Conference 2007, pp. 337–346. Tübingen: Narr. PDF (preprint)

  • Stefanie Dipper, Michael Götze and Stavros Skopeteas (eds.) (2007). Information Structure in Cross-Linguistic Corpora: Annotation Guidelines for Phonology, Morphology, Syntax, Semantics, and Information Structure. Volume 7 of Interdisciplinary Studies on Information Structure (ISIS), Working Papers of the SFB 632. Universitätsverlag Potsdam, Germany. PDF

  • Stefanie Dipper, Heike Zinsmeister and Miriam Butt (eds.) (2007). Hybrid methods in computational linguistics. Volume 26(2) of Zeitschrift für Sprachwissenschaft (Themenheft). Berlin, New York: de Gruyter. Link

  • Kepa Joseba Rodriguez, Stefanie Dipper, Michael Götze, Massimo Poesio, Giuseppe Riccardi, Christian Raymond and Joanna Rabiega-Wisniewska (2007). Standoff Coordination for Multi-Tool Annotation in a Dialogue Corpus. In: Proceedings of the ACL Linguistic Annotation Workshop, pp. 148–155, Prague, Czech Republic. PDF

2006
  • Stefanie Dipper, Erhard Hinrichs, Thomas Schmidt, Andreas Wagner and Andreas Witt (2006). Sustainability of Linguistic Resources. In: Erhard Hinrichs, Nancy Ide, Martha Palmer and James Pustejovsky (eds.), Proceedings of the LREC 2006 Workshop on Merging and Layering Linguistic Information, pp 48–54, Genoa, Italy. PDF

  • Stefanie Dipper and Manfred Stede (2006). Disambiguating potential connectives. In: Miriam Butt (ed.), Proceedings of the KONVENS-2006 Workshop on the Lexicon-Discourse Interface, pp. 167–173, Konstanz, Germany. PDF

  • Michael Götze and Stefanie Dipper (2006). ANNIS: Complex Multilevel Annotations in a Linguistic Database. In: Proceedings of the EACL Workshop on Multi-dimensional Markup in Natural Language Processing (NLPXML-2006), pp. 61–64, Triento, Italy. PDF

  • Manfred Stede, Heike Bieler and Stefanie Dipper (2006). Automatische Textzusammenfassung. wisu – das Wirtschaftsstudium 5/06:674–680.

  • Manfred Stede, Heike Bieler, Stefanie Dipper and Arthit Suriyawongkul (2006). SUMMaR: Combining Linguistics and Statistics for Text Summarization. In: Proceedings of the 17th European Conference on Artificial Intelligence (ECAI-06), pp. 827–828, Riva del Garda, Italy. PDF

  • Kai Wörner, Andreas Witt, Georg Rehm and Stefanie Dipper (2006). Modelling Linguistic Data Structures. In: Proceedings of the Extreme Markup Languages 2006, Montréal, Canada. PDF

2005
  • Stefanie Dipper (2005). German Quantifiers: Determiners or Adjectives? In: Miriam Butt and Tracy Holloway King (eds.), Proceedings of the LFG05 Conference, Bergen, Norway. CSLI Publications. PDF

  • Stefanie Dipper (2005). XML-based Stand-off Representation and Exploitation of Multi-Level Linguistic Annotation. In: Proceedings of Berliner XML Tage 2005 (BXML 2005), pp. 39–50, Berlin, Germany. PDF

  • Stefanie Dipper and Michael Götze (2005). Accessing Heterogeneous Linguistic Data – Generic XML-based Representation and Flexible Visualization. In: Proceedings of the 2nd Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 206–210, Poznan, Poland. PDF

  • Stefanie Dipper, Michael Götze and Manfred Stede (eds.) (2005). Heterogeneity in Focus: Creating and Using Linguistic Databases. Volume 2 of Interdisciplinary Studies on Information Structure (ISIS), Working Papers of the SFB 632. Universitätsverlag Potsdam, Germany. PDF

2004
  • Sabine Brants, Stefanie Dipper, Peter Eisenberg, Silvia Hansen, Esther König, Wolfgang Lezius, Christian Rohrer, George Smith and Hans Uszkoreit (2004). TIGER: Linguistic Interpretation of a German Corpus. Research on Language and Computation, Special Issue, 2(4):597–620. (Revised and extended version of Brants et al. (2002)) Link

  • Stefanie Dipper (2004). Grammar Modularity and its Impact on Grammar Documentation. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING), pp. 1–7, Geneva, Switzerland. PDF

  • Stefanie Dipper, Lukas Faulstich, Ulf Leser and Anke Lüdeling (2004). Challenges in Modelling a Richly Annotated Diachronic corpus of German. In: Proceedings of the LREC Workshop on XML-based Richly Annotated Corpora, pp. 21–29, Lisbon, Portugal. PDF

  • Stefanie Dipper, Michael Götze and Stavros Skopeteas (2004). Towards User-Adaptive Annotation Guidelines. In: Proceedings of the COLING Workshop on Linguistically Interpreted Corpora LINC-2004, pp. 23–30, Geneva, Switzerland. PDF

  • Stefanie Dipper, Michael Götze and Manfred Stede (2004). Simple Annotation Tools for Complex Annotation Tasks: an Evaluation. In: Proceedings of the LREC Workshop on XML-based Richly Annotated Corpora, pp. 54–62, Lisbon, Portugal. PDF

  • Stefanie Dipper, Michael Götze, Manfred Stede and Tillmann Wegst (2004). ANNIS: A Linguistic Database For Exploring Information Structure. In: Interdisciplinary Studies on Information Structure (ISIS), Working Papers of the SFB 632, pp. 245–279. Universitätsverlag Potsdam, Germany. PDF

  • Tracy Holloway King, Stefanie Dipper, Anette Frank, Jonas Kuhn and John Maxwell (2004). Ambiguity management in grammar writing. Research on Language and Computation, Special Issue, 2(2):259–280. (Revised and extended version of King et al. (2000)) Link

2003
  • Stefanie Dipper (2003). Implementing and Documenting Large-Scale Grammars – German LFG. Doctoral Dissertation, IMS, University of Stuttgart. Arbeitspapiere des Instituts für Maschinelle Sprachverarbeitung (AIMS), Vol. 9(1). PDF Reduced PDF (2 pages on 1)

  • Stefanie Albert et al. (2003). TIGER Annotationsschema. Draft, Universities of Saarbrücken, Stuttgart and Potsdam. PDF

  • Ulrich Schade, Stefan Barattelli, Beate Lingnau, Kerstin Hadelich and Stefanie Dipper (2003). Relativsatzproduktion. Linguistische Berichte 193:33–55.

2002
  • Sabine Brants, Stefanie Dipper, Silvia Hansen, Wolfgang Lezius and George Smith (2002). The TIGER Treebank. In: Proceedings of the Workshop on Treebanks and Linguistic Theories, pp. 24–41, Sozopol, Bulgaria . PDF

  • Heike Zinsmeister, Jonas Kuhn and Stefanie Dipper (2002). Utilizing LFG Parses for Treebank Annotation. In: Miriam Butt and Tracy Holloway King (eds.), Proceedings of the LFG02 Conference, pp. 427–447, Athens, Greece. CSLI Publications. PDF

2000
  • Stefanie Dipper (2000). Grammar-based Corpus Annotation. In: Anne Abeillé, Thorsten Brants and Hans Uszkoreit (eds.), Proceedings of the COLING Workshop on Linguistically Interpreted Corpora LINC-2000, pp. 56–64, Luxembourg. PDF

  • Tracy Holloway King, Stefanie Dipper, Anette Frank, Jonas Kuhn and John Maxwell (2000). Ambiguity management in grammar writing. In: Erhard Hinrichs, Detmar Meurers and Shuly Wintner (eds.), Proceedings of the ESSLLI Workshop on Linguistic Theory and Grammar Implementation, pp. 5–19, Birmingham, UK. PDF

  • Wolfgang Lezius, Stefanie Dipper and Arne Fitschen (2000). IMSLex – Representing Morphological and Syntactical Information in a Relational Database. In: Ulrich Heid, Stefan Evert, Egbert Lehmann and Christian Rohrer (eds.), Proceedings of the 9th EURALEX International Congress, pp. 133–139, Stuttgart, Germany. PDF

1999
  • Norbert Bröker and Stefanie Dipper (1999). Zur Konstruktion von Lexika für die maschinelle syntaktische Analyse. In: Jost Gippert and Peter Olivier (eds.), Multilinguale Corpora – Codierung, Strukturierung, Analyse. 11. Jahrestagung der Gesellschaft für Linguistische Datenverarbeitung, pp. 159–168. Enigma corporation, Prag. PDF

  • Stefanie Dipper (1999). How a Grammar is Written – a Case Study in German Compound Nouns. = Section 2 from: Butt, Miriam, Stefanie Dipper, Anette Frank and Tracy Holloway King (1999). Writing Large-Scale Parallel Grammars for English, French and German. In: Miriam Butt and Tracy Holloway King (eds.), Proceedings of the LFG99 Conference, Manchester, UK. CSLI Publications. PDF

1998
  • Judith Berman, Stefanie Dipper, Christian Fortmann and Jonas Kuhn (1998). Argument Clauses and Correlative ‘es’ in German: Deriving Discourse Properties in a Unification Analysis. In: Miriam Butt and Tracy Holloway King (eds.), Proceedings of the LFG98 Conference, Brisbane, Australia. CSLI Publications. PDF
1997
  • Stefanie Dipper (1997). Zur Selektion von Fragesatz-Komplementen. Master thesis, University of Tübingen. Arbeitspapiere des SFB 340 (AIMS), Vol. 122. PDF (24MB)