Skip to main content


Dr.-Ing. Tanja Auge

I am a PostDoc at the chair for Data Engineering at the University of Regensburg. Since October 2022 I am focusing on reconstructability/reproducibility and plausibility in the field of data engineering. For this, I am cooperating with researches from different fields at the University of Regensburg as well as international colleges from the US, Germany or Austria.

Before joining University of Regensburg, I was a PhD student at the University of Rostock advised by Andraes Heuer at the Data Research Group (external link, opens in a new window). In my PhD thesis (external link, opens in a new window) I combined the Chase -- a family of algorithms for database transformation -- with data provenance and additional annotations to compute an anonymized (minimal) sub-database of an original (research) database for a given evaluation query. To ensure the reproducibility, replicability or plausibility of a query result, the evaluations performed on the original database must also be feasible on the reconstructed sub-database. For this purpose I used a version of Chase&Backchase, an extension of Chase.

Dr.-Ing. Auge, Tanja

Wissenschaftliche Mitarbeiterin

Research Interests

My interests include a variety of subjects, such as

  • provenance, in particular data provenance,
  • research data management,
  • Chase algorithm,
  • schema evolution and data updates, and.
  • data provenance for AI.

For me, access to data and its analysis in science and society is the cornerstone of free and sound research. The open, FAIR and privacy-compliant provision of this data is therefore one of our most important basic obligations. My goal in provenance research is to ensure the traceability of a (published) result throughout the entire data science life cycle back to its (possibly physical) source. Especially for machine-generated analysis results, it is particularly important to know where the data used comes from, as it is often (pre-)processed in several stages and may originate from unclear sources. Data from the internet or social media in particular often has a very short half-life and is then no longer traceable. Possible boundary conditions that must be taken into account here include the evolution of the databases at schema and data level, the database type and compliance with privacy aspects. Areas of application for this include research data management and various data engineering, data science or AI applications. With my research, I would like to contribute to making scientific, but also economic processes and evaluations more transparent and thus reduce or prevent errors in data processing as well as ambiguities in their interpretation, hallucinations (of LLMs) and the like. 

Projects

  • Integrated Provenance (Website)
  • Research Data Management (Website)
  • ProSA: Provenance Management using Schema mappings with Annotations (Website) 

Invited Talks

Teaching

Lectures and Seminars:

  • Exercise Datenbanken II (SS25)
  • Exercise Data Engineering (WS24/25)
  • Exercise Datenbanken (SS24)
  • Exercise Programmieren I (WS23/24)
  • Exercise Engineering (SS23)

Thesis (finished):

  • Schema-Evolution von Graphdatenbanken in ProSA (Bachelorarbeit)
  • Untersuchung und Systematisierung von Forschungsdatenmanagementsystemen (Masterarbeit)
  • Ontologien, Standards und Methode in der Welt der NFDIs (Bachelorarbeit)
  • Aufarbeitung von Standards und Methoden im Forschungsdatenmanagement (Bachelorarbeit)
  • Studie zur Anforderungsanalyse für ein eigenes uni-weites Forschungsdatenmanagementsystem (Bachelorarbeit)
  • Untersuchung und Systematisierung von Forschungsdatenmanagmentsystemen (Masterarbeit)

Open Topics:

Scientific Services

Commitees

University Self-Administration

  • Member in the faculty council, representative of the scientific staff, University of Regensburg, Faculty for Informatics and Data Science (since October 2023)
  • Representative for the equality of women in academia and the arts, University of Regensburg, Faculty for Informatics and Data Science (since October 2022)
  • Board member of the UR Data Hub

Curriculum Vitae

DurationInstitution
June 2023

PhD in Computer Science,

Institute for Computer Science
University of Rostock, Germany

February 2023Research Visit,
TU Wien, Austria
since October 2022Postdoc,
Faculty for Informatics and Data Science,
Data Engineering,
University of Regensburg, Germany
August 2022Research Visit,
School of Information Sciences,
University of Illinois at Urbana-Champaign, USA
2017-2022Research assistant,
Institute for Computer Science,
Database and Information Systems,
University of Rostock, Germany
2017-2018Research assistant,
Institute for Computer Science,
Practical Informatics,
University of Rostock, Germany
2017M.Sc. Computer Science,
University of Rostock, Germany 
2016M.Sc. Mathematics,
University of Rostock, Germany
2014B.Sc. Mathematics,
University of Hamburg, Germany

Publications (extraction)

To top