UPDATE-03

In this update, I’ll discuss the data set I’ll be using and provide definitions for each variable in the data set I’ll be utilizing.

Total items: The dataset has 23,204 items in total.

There were both strings and integers/float values in the data collection.

Columns: There are twelve columns in the dataset:

NAME: The staff members’ names.
DEPARTMENT_NAME: The titles of the divisions in which the staff members are employed.
TITLE: The staff members’ job titles.
REGULAR: Periodic pay.
RETRO: Pay back in kind.
OTHER: Alternative forms of payment.
Pay for overtime is provided.
HURT: Damages for harm sustained.
DETAIL: Extra information regarding pay.
QUINN_EDUCATION: Remuneration associated with education.
TOTAL_GROSS: The total amount of gross pay.
Postal codes are POSTAL.

In the upcoming update, I will discuss data cleansing.

Leave a Reply

Your email address will not be published. Required fields are marked *