Project restructure, fix path issues
This commit is contained in:
parent
a171eb1ca1
commit
cb7c0351fb
142 changed files with 3883 additions and 3842 deletions
8
.gitignore
vendored
Normal file
8
.gitignore
vendored
Normal file
|
@ -0,0 +1,8 @@
|
|||
__pycache__
|
||||
bin
|
||||
include
|
||||
lib
|
||||
lib64
|
||||
share
|
||||
pyvenv.cfg
|
||||
.venv
|
|
@ -1,13 +1,17 @@
|
|||
# COMP20008 2021 Semester 1 Assignment 1
|
||||
|
||||
Student name: Rory Healy
|
||||
|
||||
Student ID: 964275
|
||||
|
||||
## Description
|
||||
|
||||
### Part A: COVID-19 data analysis
|
||||
|
||||
Using the Our World In Data (OWID) COVID-19 dataset (available [here](https://covid.ourworldindata.org/data/owid-covid-data.csv)), this part of the project deals with pre-processing, visualisation, and discussion of the dataset.
|
||||
|
||||
### Part B: Cricket news search engine
|
||||
|
||||
Using the cricket dataset from the LMS, this part of the project deals with building up a basic (and advanced) search engine to search through the news articles to find keywords, and provides rankings.
|
||||
|
||||
## Dependencies:
|
||||
|
@ -20,4 +24,4 @@ Python version used: 3.9.4
|
|||
- regex >= 2021.4.4
|
||||
- scikit-learn >= 0.24.1
|
||||
- nltk >= 3.6.1
|
||||
- punkt model and Porter stemmer needed
|
||||
- Must install punkt and stemwords through nltk.download()
|
||||
|
|
Can't render this file because it is too large.
|
Some files were not shown because too many files have changed in this diff Show more
Loading…
Reference in a new issue