Registration for this course is open until Friday, 01.11.2024 23:59.

News

Currently, no news are available

Let’s Role Play in the Deep: Do we really need overparameterization in deep learning?

It's time to connect to your inner kid in this fun and engaging seminar format (https://colinraffel.com/blog/role-playing-seminar.md.html): Several students read the same paper, but each student takes on a specific role, which defines the lens through which they contribute to the discussion. Students cycle through roles throughout the course of this seminar on overparameterization in deep learning.

 

Content: Do we really need overparameterization in deep learning?

Deep learning continues to impress us with breakthroughs across disciplines and is a major driving force behind a multitude of industry innovations like ChatGPT. Most of its successes are achieved by increasingly large neural networks that are trained on massive data sets and still achieve zero training loss. This recent trend to overparameterize neural networks defies classic concepts of statistical learning theory that suggest to avoid overfitting by reducing the number of trainable parameters. We will look into recent explanations of this puzzling phenomenon, discuss related insights, and challenge the modern belief that scaling up neural networks is always the best way to move forward. Are the simplest models always the best choice? And is counting parameters really the best way to measure model complexity? Please join the seminar if you enjoy thinking about this kind of questions.

 

Important Dates

* Kick-off meeting in the first week of the semester (to be held online, via zoom).
* Two reviews (and 3 questions) must be submitted during the semester.
* The presentations will be either organized in a block format during the semester break or as a bi-weekly meeting during the semester. Participation is mandatory.

 

Deliverables

* Short reviews from the perspective of your respective role and regular presentations of your findings: (40% of your final grade): Write a short review (max 1 page) on one of the papers (not the one that you are presenting) that addresses the following questions:
    1. What is the problem addressed by the paper?
    2. What was done before, and how does the paper improve on previous work?
    3. What are the strengths and the limitations of the techniques in the paper
    4. What part of the paper was difficult to understand?
    5. What are possible improvements or extensions of the techniques in the paper?
* In addition to your review you will have to submit 3 questions that you will ask during the discussion (ideally in the same document).

* Discussion: (20%). How actively you engage in discussions during the seminar will also influence your grade.
* Seminar Paper: (20%) You will write a seminar paper on a paper that you have studied from the perspective of an author. It must not be longer than 6 pages, not counting references and appendices. Note that appendices are not meant to provide information that is absolutely necessary to understand the paper, but rather to provide auxiliary material. Papers can be shorter, but in general the provided page limit is a good indicator of how long a paper should be.

Privacy Policy | Legal Notice
If you encounter technical problems, please contact the administrators.