Explore projects
-
Led development of an AWS-hosted spam detection system. Utilized Hive, Pig, PySpark on EMR for ETL. Processed email data stored in S3. Implemented TF-IDF for accurate classification. SQL in Hive for data manipulation and tracking top spam/ham accounts. Integrated bag of words for spam keyword identification. Utilized PySpark to compute TF-IDF scores. Stored results in S3 for analysis. Demonstrated proficiency in cloud services, data mining techniques, and programming languages for effective spam detection.
Updated -
Updated
-
Updated
-
Terminal Text Editor for Novice Programmers (TTENS)
Updated -
Updated
-
Updated
-
Updated