Finance and Socio-economic Datasets
Titanic Dataset
- The Titanic dataset provides information about the passengers aboard the Titanic, used to predict survival rates. It includes features such as passenger class, age, gender, ticket fare, and whether they had family on board.
- This dataset is popular for binary classification and feature engineering tasks.
Adult Census Income Dataset
- Also known as the “Census Income” dataset, it contains demographic information from the 1994 Census database to predict whether an individual earns more than $50,000 a year.
- It has 48,842 instances with 14 attributes like age, work class, education, marital status, and occupation.
- It can be obtained from official website.
Dataset for Classification
Classification is a type of supervised learning where the objective is to predict the categorical labels of new instances based on past observations. The goal is to learn a model from the training data that can predict the class label for unseen data accurately. Classification problems are common in many fields such as finance, healthcare, marketing, and more. In this article we will discuss some popular datasets used for classification.