Thanks for checking in for the second update - the first one with a weekly review.

If this is your first time reading an “NLP Journey in 75 Weekly Steps” update, welcome to being my real-time weekly witness on my journey toward getting a Ph.D.!

These weekly updates are always structured in the same way. So let’s continue.

You can follow these updates on: Substack Blog Telegram WhatsApp LinkedIn Medium Twitter Calendly

What Happened Since Last Week?

I participated in the Frankfurt Data Science Cocktail Night With DALL-E 2 this week. Made a couple of new connections there and talked to many old ones. I recommend the Frankfurt Data Science Meetup, make sure to check it out if and when you are in Frankfurt! The organizer is Eldar Rakhmatullaev, and this is their Meetup page: Frankfurt Data Science Meetup. I met Saurabh Chakravorty, Zheng Ma, Shivam Agrawal, Marcel Grimm, and Katharina Sturm there!

Also, I had a good conversation with Christian Strässle from ValueFocus AG, a Swiss asset manager that cooperates with the Chair of Data Science and Natural Language Processing. We will announce details of the company cooperation in the coming months.

What Were the Biggest Obstacles?

Unlike last week, there were no major distractions by my corporate work. Some minor private distractions (but worthy ones). All good.

Which Goals Did I Meet?

Goal 2 from last week: I’ve set up the GitLab documentation for the dissertation code. It’s working well, and I may make the page publicly available in the future.

Which Goals Did I Miss?

Goal 1 from last week: The appointment with my supervisors is still not scheduled. This is due to things outside my direct control, but not a big issue. I’ll carry this goal over to the next week.

Goal 3 from last week: End-to-end document processing for the Python code is still not working. I simply did not put enough time into this one.

Was It a Good Week?

Well, 2/3 goals were missed, so it was not a good week. The appointment goal is not fully in my control, and for the MVP goal, I think my schedule was too scattered to complete the MVP task.

I need a couple of hours of deep concentration to finish the MVP task. While I did have enough gross time allocated to Ph.D. tasks, I did not have a long, uninterrupted period of time when I could focus on this task.

So next week, I will lock myself in a room for a day without access to my phone and will not schedule any calls on that day.

Short-Term Tasks for The Coming Week

  1. Carried over from last week: Make an appointment with my supervisor to discuss my Ph.D. dissertation proposal.
  2. Carried over from last week: First successful end-to-end document processing with the Python module (MVP). (With >80% test coverage and all tests passing.)
  3. Literature review:
    1. Add the following retrieval-augmented generation (RAG) papers to my literature database: Guu et al., 2020 (REALM), Yogatama et al., 2021, Bordeaud et al., 2021, Izacard & Grave, 2020, Min et al., 2020
    2. Add this text-to-text framework paper to my literature database: Raffel et al., 2019
    3. Add Contriever paper (Izacard et al., 2022, unsupervised training & continuous dense embeddings) to my literature database.

About “75-Step Journey Toward a Ph.D. in Natural Language Processing”

You will, from now on, witness my grind. Feel my blood, sweat, and tears.

With this series of articles, you become a real-life weekly witness of my dissertation progress, all in 75 steps. This has multiple purposes:

1) Forcing myself to keep moving through the power of public shame!

2) Helping other (prospective) Ph.D. students to stay motivated and to show that hard times are normal when going through this process.

3) Getting support from the community when I go through hard times.

Share this with your Ph.D. student friends: Substack Blog Telegram WhatsApp LinkedIn Medium Twitter Calendly.

Read More From the 75 Steps Toward a Ph.D. in NLP Series

2022-08-20: Update 1/75 - Kicking Off the Journey Toward a Ph.D. in NLP

2022-09-04: Update 3/75 - Back on Track and Back to Vallendar

2022-09-10: Update 4/75 - Long Test Runtime; Retriever Works

2022-09-18: Update 5/75 - Jour Fixe Joy

2022-09-26: Update 6/75 - Reading Group

2022-10-02: Update 7/75 - Leaving the Phone at Home

2022-10-09: Update 8/75 - Finding a Conference

2022-10-16: Update 9/75 - Dataset - Make or Take

2022-10-23: Update 10/75 - Still Unsure About the Dataset

2022-10-30: Update 11/75 - NVIDIA DGX-2 and Swiss Cheese

2022-11-10: Update 12/75 - Three Days of Conference via Zoom

2022-11-24: Update 13/75 - Vacation and Roadmap for 2023

2022-11-30: Update 14/75 - Supervising B.Sc. and M.Sc. Theses

2022-12-14: Update 15/75 - A Rather Uneventful Week

2022-12-24: Update 16/75 - Year-End Cleanup Sprint

2023-01-01: Update 17/75 - New Year’s Resolutions

2023-07-20: Update 18-28/75 - A Long Gap and Two Papers Handed In!

2023-12-12: Update 29-50/75 - First On-Site Conference Visit and Increased Focus

2023-12-25: Update 51-52/75 - Merry Christmas!