All signs indicate that we will create our own dataset. There is no dataset yet for our domain-task combination. (I’m almost certain about that by now.)

That’s a good thing because building a dataset allows us to venture into uncharted territory and possibly leave a big scientific mark in our field.

I hope to finalize the “dataset - make or take” question, which I originally planned to take this week, in the coming week… As mentioned, I am now even more certain than last week that we will “make” and not “take.”

We want to be sure before we go in this direction. Building a dataset would involve a significant commitment of our resources.

You can follow these updates: Substack Blog Telegram WhatsApp LinkedIn Medium Twitter Calendly

I usually don't do Ph.D. work from Commerzbank's office, but it was convenient that day. I usually don’t do Ph.D. work from Commerzbank’s office, but it was convenient that day.

What Happened Since Last Week?

I re-read a paper (FinBERT, by Yang, Uy, Huang from 2020) that I may cite in our research. To my delight, I noticed that since I last read this paper, I have understood much more.

It’s always good to see that one is making progress.

Apart from that, I set up everything to start a website that can serve us to advertise and inform about our planned new dataset. These benchmarking/dataset websites usually contain a leaderboard (with the scores each research team has received on the dataset with their algorithms) and instructions for downloading, using, and evaluating the dataset.

What Were the Biggest Obstacles?

All good. Great week!

Which Goals Did I Meet?

  1. Align the outlet (conference) with my supervisor. (I.e., ask him if he likes the conference and thinks it fits my research question.
  2. Prepare a website for advertising our dataset. (This goal was not in the previous update, it came up spontaneously.)
  3. Write instructions for using our dataset in the format of a mini-paper. (This goal was not in the previous update, it came up spontaneously.)

Which Goals Did I Miss?

  1. Decide on whether to prepare a dataset ourselves or take an existing dataset.

Was It a Good Week?

Yes.

I re-activated my YouTube Premium subscriptions and noticed that listening to music without interruptions is a significant productivity boost for me.

Short-Term Tasks for The Coming Week

  1. None. I'll wait and see what comes up in Tuesday's meeting with my supervisor. Now that I have meetings with my supervisor on Tuesdays, I have less certainty about my weekly to-dos on Sundays.

About “75-Step Journey Toward a Ph.D. in Natural Language Processing”

You will, from now on, witness my grind. Feel my blood, sweat, and tears.

With this series of articles, you become a real-life weekly witness of my dissertation progress, all in 75 steps. This has multiple purposes:

1) Forcing myself to keep moving through the power of public shame!

2) Helping other (prospective) Ph.D. students to stay motivated and to show that hard times are normal when going through this process.

3) Getting support from the community when I go through hard times.

Share this with your Ph.D. student friends: Substack Blog Telegram WhatsApp LinkedIn Medium Twitter Calendly.

Read More From the 75 Steps Toward a Ph.D. in NLP Series

2022-08-20: Update 1/75 - Kicking Off the Journey Toward a Ph.D. in NLP

2022-08-28: Update 2/75 - Literature Review

2022-09-04: Update 3/75 - Back on Track and Back to Vallendar

2022-09-10: Update 4/75 - Long Test Runtime; Retriever Works

2022-09-18: Update 5/75 - Jour Fixe Joy

2022-09-26: Update 6/75 - Reading Group

2022-10-02: Update 7/75 - Leaving the Phone at Home

2022-10-09: Update 8/75 - Finding a Conference

2022-10-16: Update 9/75 - Dataset - Make or Take

2022-10-30: Update 11/75 - NVIDIA DGX-2 and Swiss Cheese

2022-11-10: Update 12/75 - Three Days of Conference via Zoom

2022-11-24: Update 13/75 - Vacation and Roadmap for 2023

2022-11-30: Update 14/75 - Supervising B.Sc. and M.Sc. Theses

2022-12-14: Update 15/75 - A Rather Uneventful Week

2022-12-24: Update 16/75 - Year-End Cleanup Sprint

2023-01-01: Update 17/75 - New Year’s Resolutions

2023-07-20: Update 18-28/75 - A Long Gap and Two Papers Handed In!

2023-12-12: Update 29-50/75 - First On-Site Conference Visit and Increased Focus

2023-12-25: Update 51-52/75 - Merry Christmas!