Zu Hauptinhalt springen

Dr.-Ing. Tanja Auge

I am a PostDoc at the chair for?Data Engineering?at the University of Regensburg. Since October 2022 I am focusing on reconstructability/reproducibility and plausibility in the field of data engineering. For this, I am cooperating with researches from different fields at the University of Regensburg as well as international colleges from the US, Germany or Austria.

Before joining University of Regensburg, I was a PhD student at the University of Rostock advised by Andraes Heuer at the Data Research Group. My PhD thesis is available?here. Prior to that, I studied Computer Science (M.Sc.) and Mathematics (M.Sc. and B.Sc.) at the University of Hamburg and the University of Rostock.

In my PhD thesis I combined the Chase -- a family of algorithms for database transformation -- with data provenance and additional annotations to compute an anonymized (minimal) sub-database of an original (research) database for a given evaluation query. To ensure the reproducibility, replicability or plausibility of a query result, the evaluations performed on the original database must also be feasible on the reconstructed sub-database. For this purpose I used a version of Chase&Backchase, an extension of Chase.


Projects

  • Research Data Management (Website)
  • ProSA: Provenance Management using Schema mappings with Annotations (Website)?


Research interests

My interests include a variety of subjects, such as

  • provenance, in particular data provenance,
  • research data management,
  • Chase algorithm,
  • schema evolution and data updates, and.
  • data provenance for AI.

For me, access to data and its analysis in science and society is the cornerstone of free and sound research. The open, FAIR and privacy-compliant provision of this data is therefore one of our most important basic obligations. My goal in provenance research is to ensure the traceability of a (published) result throughout the entire data science life cycle back to its (possibly physical) source. Especially for machine-generated analysis results, it is particularly important to know where the data used comes from, as it is often (pre-)processed in several stages and may originate from unclear sources. Data from the internet or social media in particular often has a very short half-life and is then no longer traceable. Possible boundary conditions that must be taken into account here include the evolution of the databases at schema and data level, the database type and compliance with privacy aspects. Areas of application for this include research data management and various data engineering, data science or AI applications. With my research, I would like to contribute to making scientific, but also economic processes and evaluations more transparent and thus reduce or prevent errors in data processing as well as ambiguities in their interpretation, hallucinations (of LLMs) and the like.?


Publications (extraction)

  • L. Waltersdorfer, D. Hausler, T. Auge:
    Provenance Question-based Selection for AI Transparency and Accountable AI Governance.
    (Accepted for AIGOV@AAAI 2025)
  • T. Auge, S. Genehr, M. Klettke, F.?Krüger, M.?Schr?der:
    Towards dimensions and granularity in a unified workflow and data provenance framework.
    (Presented at LWDA 2024)
  • T. Auge, L. Waltersdorfer, E.?Michels, S. Feistel, S.?Jürgensmann, F. J. Ekaputra, M. Klettke:
    Towards an Integrated Provenance Framework -?A Scenario for Marine Data.?
    TaPP@EuroS&P Workshops, 2024 (DOI)
  • T. Auge, F. J. Ekaputra, S. Feistel, S. Jürgensmann, M. Klettke, L. Waltersdorfer:
    Challenges of Tracking Provenance in Marine Data.?
    IMDIS, 2024 (pdf)
  • S. Diemt, T. Auge:
    Anforderungsanalyse für ein universit?tsweites nutzerorientiertes Forschungsdatenmanagementsystem basierend auf einer Nutzerumfrage. Softwaretechnik-Trends 44(1), 2024 (pdf)
  • M. L. M?ller, D. Hausler, S. Strasser, T. Auge and M. Klettke:?
    Heterogeneity in NoSQL Databases: Challenges of Handling schema-less Data.?
    LWDA, 2023 (pdf)
  • T. Auge, G. Bali, M.?Klettke, B. Lud?scher, W. S?ldner, S. Weish?upl, T. Wettig:
    Provenance for Lattice QCD workflows.
    TaPP@WWW, 2023 (DOI)

  • T. Auge:
    ProSA - A provenance system for reproducing query results.
    TaPP@WWW, 2023 (DOI)


  • T. Auge:
    Provenance Management unter Verwendung von Schemaabbildungen mit Annotationen.
    PhD Thesis,?University of Rostock, 2023 (pdf)

  • T. Auge, A. Heuer:?
    Tracing the History of the Baltic Sea Oxygen Level?Evolution and Provenance for Research Data Management.?
    BTW, 2021 (DOI)

  • T. Auge:?
    Extended Provenance Management for Data Science Applications.?
    PhD@VLDB, 2020 (pdf)

For further publications see dblp or the list below.


Talks

  • ProSA Pipeline — Provenance conquers the CHASE.?University of Illinois at Urbana-Champaign, School of Information Sciences, 2022

  • Schema Evolution in Research Data. Spring Symposium Databaeses,?2024


Full list of publications

Articles and Workshop papers:

  • Tanja Auge, Sascha Genehr, Meike Klettke, Frank Krüger, Max Sch?rder:
    Towards dimensions and granularity in a unified workflow and data provenance framework.
    (Presented at LWDA 2024)

  • Tanja Auge, Laura Waltersdorfer, Emil Michels, Susanne?Feistel, Susanne Jürgensmann, Fajar J. Ekaputra, Meike Klette:
    Towards an Integrated Provenance Framework - A Scenario for Marine Data. TaPP at EuroS&P Workshops, 2024 (DOI)

  • Tanja Auge, Fajar J. Ekaputra, Susanne Feistel, Susanne Jürgensmann, Meike Klettke, Laura Walterdorfer:
    Challenges of Tracking Provenance in Marine Data.
    IMDIS, 2024 (pdf)

  • Mark Lukas M?ller, Dominique Hausler, Sebastian Strasser, Tanja Auge and Meike Klettke:?
    Heterogeneity in NoSQL Databases: Challenges of Handling schema-less Data.
    LWDA 2023 (pdf)

  • Tanja Auge:?
    ProSA -?A provenance system for reproducing query results.
    TaPP@WWW, 2023 (DOI)

  • Tanja Auge, Gunnar Bali, Meike Klettke, Bertram?Lud?scher, Wolfgang S?ldner, Simon Weish?upl, Tilo Wettig:?
    Provenance for Lattice QCD workflows.?
    TaPP@WWW,?2023 (DOI)

  • Tanja Auge,?Moritz Hanzig,?Andreas Heuer:?
    ProSA Pipeline: Provenance Conquers the Chase.?
    ADBIS, 2022 (DOI)

  • Tanja Auge,?Andreas Heuer:?
    Enhanced Inversion of Schema Evolution with Provenance.?
    CoRR?abs/2211.13810, 2022?(pdf)

  • Tanja Auge, Nic?Scharlau, Andreas?G?rres, Jakob, Zimmer, Andreas?Heuer:
    ChaTEAU -?A Universal Toolkit for Applying the?Chase.?
    CoRR?abs/2206.01643, 2022 (pdf)

  • Tanja Auge, Nic Scharlau, Andreas?Heuer:?
    Provenance and Privacy in ProSA -?A Guided Interview on Privacy-Aware Provenance.?
    DEXA Workshops, 2021. (DOI)

  • Tanja Auge, Nic?Scharlau, Andreas?Heuer:?
    Privacy Aspects of Provenance Queries.?
    IPAW, 2021 (DOI)

  • Tanja Auge, Andreas?Heuer:?
    Tracing the History of the Baltic Sea Oxygen Level?Evolution and Provenance for Research Data Management.?
    BTW, 2021 (DOI)

  • Tanja Auge, Erik Manthey, Susanne?Jürgensmann, Susanne Feistel, Andreas?Heuer:?
    Schema Evolution and Reproducibility of Long-term Hydrographic Data Sets at the IOW.?
    LWDA, 2020 (pdf)

  • Tanja Auge:
    Extended Provenance Management for Data Science Applications.
    PhD@VLDB, 2020?(pdf)

  • Tanja Auge, Andreas Heuer:
    ProSA - Using the CHASE for Provenance Management.
    ADBIS, 2019 (DOI)

  • Tanja Auge, Andreas?Heuer:?
    The Theory behind Minimizing Research Data -?Result equivalent CHASE-inverse Mappings.?
    LWDA, 2018 (pdf)

  • Tanja Auge, Andreas?Heuer:?
    Combining Provenance Management and Schema Evolution, Provenance and Annotation of Data and Processes.
    IPAW,?2018 (DOI)

  • Tanja Auge, Andreas Heuer:?
    Inverse im Forschungsdatenmanagement - Eine Kombination aus Provenance Management, Schema- und Daten-Evolution.?
    Grundlagen von Datenbanken, 2018 (pdf)

  • Robin Nicolay,?Nikolaj Troels Graf von Malotky,?Tanja Auge,?Alke Martens:
    Autonomous Semantic Structuring of Lecture Topics -?Synthesis of Knowledge Models.?
    CSEDU,?2017 (DOI)

Thesis:

  • Tanja Auge:
    Provenance Management unter Verwendung von Schemaabbildungen mit Annotationen.
    PhD Thesis,?University of Rostock. 2023 (pdf)
  • Tanja Auge:?
    Umsetzung von Provenance-Anfragen in Big-Data-Analytics-Umgebungen.
    Master Thesis, University of Rostock, 2017 (pdf)

Scientific Services

Commitees

  • LWDA 2023 (Track Chair FGDB)

University self-administration

  • Member in the faculty council, representative of the scientific staff, University of Regensburg, Faculty for Informatics and Data Science

  • Representative for the equality of women in academia and the arts, University of

    Regensburg, Faculty for Informatics and Data Science

  • Board member of the UR Data Hub


Teaching

Lectures and Seminars:

Ongoing:

  • ?bungen zu Data Engineering (WS24/25)

Past:

  • ?bungen zu Datenbanken (SS24)

  • ?bungen zu Programmieren I (WS23/24)

  • Seminar Data Engineering (SS23)


Thesis:

Work in Progress:

  • Schema-Evolution von Graphdatenbanken in ProSA (Bachelorarbeit)
  • Untersuchung und Systematisierung von Forschungsdatenmanagementsystemen (Masterarbeit)

  • Ontologien, Standards und Methode in der Welt der NFDIs (Bachelorarbeit)

OPEN TOPICS:

PAST:

  • Aufarbeitung von Standards und Methoden im Forschungsdatenmanagement (Bachelorarbeit)
  • Studie zur Anforderungsanalyse für ein eigenes uni-weites Forschungsdatenmanagementsystem (Bachelorarbeit)
  • Untersuchung und Systematisierung von Forschungsdatenmanagmentsystemen (Masterarbeit)


Curriculum vitae

Scientific career

June 2023 PhD in Computer Science,
Universiy of Rostock, Germany
February 2023 Research Visit at TU Wien, Austria
since October 2022

Postdoc,
Faculty for Informatics and Data Science,
Data Engineering,
University of Regensburg, Germany

August 2022

Research Visit at?School of Information Sciences,
University of Illinois at Urbana-Champaign,?USA

2017-2022

PhD student,
Institute for?Computer Science,
Database and Information Systems,
University of Rostock, Germany

2017-2018

Research assistant,
Institute for Computer Science,
Practical Informatics,
University of Rostock, Germany

2017

Graduate student of Computer Science,
University of Rostock, Germany?

2016

Graduate student of Mathematics,
University of Rostock, Germany

2014

Undergraduate student of Mathematics,
University of Hamburg, Germany



  1. Fakult?t für Informatik und Data Science

Lehrstuhl Data Engineering

Dr.-Ing. Tanja Auge


Telefon: 0941 943-68616

E-Mail: tanja.auge@ur.de

Raum 636