Compare commits

..

No commits in common. "cb7c0351fb23d29b87eb205269f13b096d8b3835" and "4a04643bc7a14f4b0371267456cff754ffad444b" have entirely different histories.

149 changed files with 3529 additions and 4008 deletions

8
.gitignore vendored
View file

@ -1,8 +0,0 @@
__pycache__
bin
include
lib
lib64
share
pyvenv.cfg
.venv

View file

@ -1,27 +1,15 @@
# COMP20008 2021 Semester 1 Assignment 1
Student name: Rory Healy
Student ID: 964275
## Description
### Part A: COVID-19 data analysis
Using the Our World In Data (OWID) COVID-19 dataset (available [here](https://covid.ourworldindata.org/data/owid-covid-data.csv)), this part of the project deals with pre-processing, visualisation, and discussion of the dataset.
### Part B: Cricket news search engine
Using the cricket dataset from the LMS, this part of the project deals with building up a basic (and advanced) search engine to search through the news articles to find keywords, and provides rankings.
## Dependencies:
Dependencies:
Python version used: 3.9.4
- pandas >= 1.2.4
- matplotlib >= 3.4.1
- numpy >= 1.20.2
- regex >= 2021.4.4
- scikit-learn >= 0.24.1
- nltk >= 3.6.1
- Must install punkt and stemwords through nltk.download()
- pandas >= 1.2.2

Some files were not shown because too many files have changed in this diff Show more