Main Page

News

Additional Topics

Written on 26.11.25 by Franziska Boenisch

Dear everyone,

some of you have asked whether the remaining unassigned topics, currently indicated by XX on the main CMS page are available. They are. If you are interested in presenting one of them, to make it fair for everyone, you have time until the end of the week (Friday, Nov 28th, 6PM) to write me an email with your name and the additional topic you want to present. If there is more than one person interested in a given paper, I'll use a random generator to make the assignment. I'll then let you know about the outcome. For those who end up presenting two papers eventually, the better presentation grade will be used for the final grading.

See you later,
Franziska

Report Instructions Online

Written on 26.11.25 by Franziska Boenisch

Dear Everyone,

As promised last time, I uploaded the requirements for the seminar report to the materials section here on CMS. In addition, I also provided the tex-template that you should use.

The deadline for the report is February 20th, 2026. Submissions must be made via the CMS-submissions… Read more

Dear Everyone,

As promised last time, I uploaded the requirements for the seminar report to the materials section here on CMS. In addition, I also provided the tex-template that you should use.

The deadline for the report is February 20th, 2026. Submissions must be made via the CMS-submissions function. You have lots of time now to explore this function, old submissions are overwritten with newer ones, so feel free to experiment. This way, you can make sure to submit your report by the deadline on time. Submissions received via email or after the deadline will *not* be considered for grading and considered as not submitted.

At the beginning of our seminar session today, I will also briefly introduce the requirements and answer any questions.

Kind regards
Franziska

Differential Privacy in the Era of Foundation Models

Abstract:

In recent years, foundation models, such as GPT, LLaMA, Dall-E, or Stable Diffusion, have transformed the field of machine learning, particularly in large-scale tasks like natural language processing and computer vision. These models, trained on vast datasets, are capable of transferring their learned knowledge to a wide range of applications, making them incredibly powerful and versatile. However, this also raises significant privacy concerns when sensitive data is involved.

This seminar will explore how differential privacy (DP), the leading standard for privacy protection, can be applied to foundation models to mitigate these risks. DP ensures that changes in individual data points in a model’s training data minimally affect the overall model predictions, providing a safeguard for privacy even in the most data-intensive models. We will dive into the fundamentals of both DP and foundation models, study how they intersect, and explore strategies for integrating privacy guarantees into these cutting-edge systems. Key topics will include the theory behind DP, practical privacy-preserving mechanisms, and case studies of DP implementation in advanced foundation models.

Learning Objective:

There are two main learning objectives of this course.

1) Learning the foundations of Differential Privacy, Foundation Models, what they are, how they play together, how we can leverage them to achieve privacy preservation in machine learning.

2) Getting a glimpse into how to be a successful researcher. As part of research, you have to read papers, understand what they are about, and be able to apply what they talk about, in the best case to your own research ideas. Additionally, you will learn how to give a good (research) presentation, how to identify the relevant questions, ask and answer them, and how to do scientific writing.

Time:

The seminar will take place on Wednesdays 4:05 PM-6:00 PM in the CISPA building (Stuhlsatzenhaus 5, 66123 Saarbrücken). Please make sure to be on time, we start at 16:05 sharp.

Rooms, Dates, and Topics:

15.10.2025 (Room 0.07): Introduction: Presentation of Seminar Topics, and "How-To" give a presentation

22.10.2025 (Room 0.02): Topic 1: Introduction to Foundation Models & The Pre-train/Adapt Paradigm

29.10.2025 (Room 0.02): Topic 2: Introduction to Differential Privacy

5.11.2025 (Room 0.02): Topic 3: Privacy Risks in Foundation Models (Data Extraction)

12.11.2025 (Room 0.02): Topic 4: Privacy Risks in Foundation Models (Membership Inference)

19.11.2025 (Room 0.02): Topic 5: Memorization in Foundation Models

26.11.2025 (Room 0.02): Topic 6: Privately Pre-Training Diffusion Models

26.11.2025 (Room 0.02): Topic 7: Privately Training Large Language Models

7.1.2026 (Room 0.02): Topic 8: Other Private Language Model Adaptations

14.1.2026 (Room 0.02): Topic 9: Differential Privacy Auditing

21.1.2026 (Room 0.02): Topic 10: Unlearning

28.1.2026 (Room 0.02): Topic 11: Problems and Open Research Directions in Privacy-Preserving Machine Learning in Foundation Models

4.2.2026 (Room 0.02): Topic 12: Technical and Societal Impact of Foundation Model Privacy

20.2.2026: Report Due

Papers:

Topic 1: Introduction to Foundation Models & The Pre-train/Adapt Paradigm

Muhammad Shayan: Diffusion Models: Denoising Diffusion Probabilistic Models (https://arxiv.org/abs/2006.11239)

Ivo: Large Language Models: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (https://arxiv.org/abs/1810.04805)

Summary of the course in G-Doc: Nuren, Melih

Topic 2: Introduction to Differential Privacy

Harsha: Differential Privacy: Differential privacy (https://www.comp.nus.edu.sg/~tankl/cs5322/readings/dwork.pdf)

Maitri: Differential Privacy in Machine Learning: Deep learning with differential privacy (https://arxiv.org/abs/1607.00133)