Compare commits

..

No commits in common. "cb7c0351fb23d29b87eb205269f13b096d8b3835" and "4a04643bc7a14f4b0371267456cff754ffad444b" have entirely different histories.

149 changed files with 3529 additions and 4008 deletions

8
.gitignore vendored
View file

@ -1,8 +0,0 @@
__pycache__
bin
include
lib
lib64
share
pyvenv.cfg
.venv

View file

@ -1,27 +1,15 @@
# COMP20008 2021 Semester 1 Assignment 1 # COMP20008 2021 Semester 1 Assignment 1
Student name: Rory Healy Student name: Rory Healy
Student ID: 964275 Student ID: 964275
## Description ## Description
### Part A: COVID-19 data analysis ### Part A: COVID-19 data analysis
Using the Our World In Data (OWID) COVID-19 dataset (available [here](https://covid.ourworldindata.org/data/owid-covid-data.csv)), this part of the project deals with pre-processing, visualisation, and discussion of the dataset. Using the Our World In Data (OWID) COVID-19 dataset (available [here](https://covid.ourworldindata.org/data/owid-covid-data.csv)), this part of the project deals with pre-processing, visualisation, and discussion of the dataset.
### Part B: Cricket news search engine ### Part B: Cricket news search engine
Using the cricket dataset from the LMS, this part of the project deals with building up a basic (and advanced) search engine to search through the news articles to find keywords, and provides rankings. Using the cricket dataset from the LMS, this part of the project deals with building up a basic (and advanced) search engine to search through the news articles to find keywords, and provides rankings.
## Dependencies: Dependencies:
Python version used: 3.9.4 - pandas >= 1.2.2
- pandas >= 1.2.4
- matplotlib >= 3.4.1
- numpy >= 1.20.2
- regex >= 2021.4.4
- scikit-learn >= 0.24.1
- nltk >= 3.6.1
- Must install punkt and stemwords through nltk.download()

Some files were not shown because too many files have changed in this diff Show more