comp20008-project01/README.md

24 lines
846 B
Markdown
Raw Normal View History

2021-03-01 17:57:17 +11:00
# COMP20008 2021 Semester 1 Assignment 1
2021-04-09 22:44:53 +10:00
Student name: Rory Healy
2021-04-09 22:49:08 +10:00
2021-04-09 22:44:53 +10:00
Student ID: 964275
## Description
### Part A: COVID-19 data analysis
2021-04-09 22:48:25 +10:00
Using the Our World In Data (OWID) COVID-19 dataset (available [here](https://covid.ourworldindata.org/data/owid-covid-data.csv)), this part of the project deals with pre-processing, visualisation, and discussion of the dataset.
2021-04-09 22:44:53 +10:00
### Part B: Cricket news search engine
Using the cricket dataset from the LMS, this part of the project deals with building up a basic (and advanced) search engine to search through the news articles to find keywords, and provides rankings.
2021-04-12 17:31:44 +10:00
## Dependencies:
2021-04-09 22:49:08 +10:00
Python version used: 3.9.4
- pandas >= 1.2.4
- matplotlib >= 3.4.1
- numpy >= 1.20.2
2021-04-12 17:31:44 +10:00
- regex >= 2021.4.4
- scikit-learn >= 0.24.1
2021-04-12 17:31:44 +10:00
- nltk >= 3.6.1
- punkt model and Porter stemmer needed