No description
Find a file
2024-06-13 18:46:32 +10:00
data Project restructure, fix path issues 2024-06-13 18:46:32 +10:00
docs Project restructure, fix path issues 2024-06-13 18:46:32 +10:00
report Project restructure, fix path issues 2024-06-13 18:46:32 +10:00
src Project restructure, fix path issues 2024-06-13 18:46:32 +10:00
.gitignore Project restructure, fix path issues 2024-06-13 18:46:32 +10:00
README.md Project restructure, fix path issues 2024-06-13 18:46:32 +10:00
requirements.txt Project restructure, fix path issues 2024-06-13 18:46:32 +10:00

COMP20008 2021 Semester 1 Assignment 1

Student name: Rory Healy

Student ID: 964275

Description

Part A: COVID-19 data analysis

Using the Our World In Data (OWID) COVID-19 dataset (available here), this part of the project deals with pre-processing, visualisation, and discussion of the dataset.

Part B: Cricket news search engine

Using the cricket dataset from the LMS, this part of the project deals with building up a basic (and advanced) search engine to search through the news articles to find keywords, and provides rankings.

Dependencies:

Python version used: 3.9.4

- pandas >= 1.2.4
- matplotlib >= 3.4.1
- numpy >= 1.20.2
- regex >= 2021.4.4
- scikit-learn >= 0.24.1
- nltk >= 3.6.1
  - Must install punkt and stemwords through nltk.download()