diff --git a/labs/l1/NLP-L1.ipynb b/labs/l1/NLP-L1.ipynb
index b3c38b8ca883bdb3aa3506130676c7d38822328b..498416e09ed7e8a4831994271400b415dfb7634d 100644
--- a/labs/l1/NLP-L1.ipynb
+++ b/labs/l1/NLP-L1.ipynb
@@ -162,6 +162,15 @@
     "To test your code, print the sizes of the vocabularies constructed from the two datasets, as well as the count totals. The correct vocabulary size for the minimal dataset is 3,231; for the full dataset, the correct vocabulary size is 73,339. The correct totals are 155,818 for the minimal dataset and 17,297,355 for the full dataset."
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# TODO: Test your code here."
+   ]
+  },
   {
    "cell_type": "markdown",
    "metadata": {},
@@ -229,6 +238,15 @@
     "Test your code by comparing the total number of tokens in the preprocessed version of each dataset with the corresponding number for the original data. The former should be ca. 59% of the latter for the minimal dataset, and ca. 69% for the full dataset. The exact percentage will vary slightly because of the randomness in the sampling. You may want to repeat your computation several times."
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# TODO: Test your code here."
+   ]
+  },
   {
    "cell_type": "markdown",
    "metadata": {},
@@ -341,6 +359,15 @@
     "To test your code, compare the total number of positive samples (across all batches) to the total number of tokens in the (un-preprocessed) minimal dataset. The ratio between these two values should be ca. 2.64. If you can spare the time, you can make the same comparison on the full dataset; here, the expected ratio is 3.25. As before, the numbers may vary slightly because of randomness, so you may want to run the comparison more than once."
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# TODO: Test your code here."
+   ]
+  },
   {
    "cell_type": "markdown",
    "metadata": {},
@@ -400,6 +427,15 @@
     "Test your code by creating an instance of the model, and check that `forward` returns the expected result on random input tensors *w* and *c*. To help you, the following function will return a random example from the first 100 examples produced by `training_examples`."
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# TODO: Test your code here."
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": null,
@@ -495,6 +531,15 @@
     "Training on the full dataset will take some time â€“ on a CPU, you should expect 10â€“40 minutes per epoch, depending on hardware. To give you some guidance: The total number of positive examples is approximately 58M, and the batch size is chosen so that each batch contains roughly 10% of these examples. To speed things up, you can train using a GPU; our reference implementation runs in less than 2 minutes per epoch on [Colab](http://colab.research.google.com)."
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# TODO: Train your model on the full dataset here."
+   ]
+  },
   {
    "cell_type": "markdown",
    "metadata": {},
@@ -560,7 +605,7 @@
  ],
  "metadata": {
   "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "Python 3",
    "language": "python",
    "name": "python3"
   },
@@ -574,7 +619,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.6.8"
   }
  },
  "nbformat": 4,