Exercise 2 - NLP Text Preprocessing

Laboratory ExerciseITC C508Prelims
Submitted:August 06, 2025 11:36:00 PM

Entries

File Entry

Exer2_Sarmiento_CharlesAaron.ipynb

File Entry

Exer2_SARMIENTO_CharlesAaron.doc

File Entry

Phishing_Email.csv

Learning Reflections

In this activity, I learned the steps of preprocessing, from how it works and how it affects the textual data, to how important it is to ensuring the quality of said data. That said, going into the activity, I had the thought that text preprocessing is a "one size fits all" method, thinking that I can just apply preprocessing steps known to a dataset and it will result in the highest quality of data possible. After going through the activity, it enlightened me to the fact that these preprocessing steps should be applied with proper justification to it.

This activity also made me relearn how to write proper technical papers. While my output is not the highest quality that I can produce, I aim to utilize any feedback I can receive from these activities to further enhance the papers that I might write and the machine learning models that I create in the future.