データセット解説#
データセット一覧#
Dataset |
Description |
Associated Tasks |
Target Column |
Number of Columns |
Number of Rows |
---|---|---|---|---|---|
AutoGluon Example Dataset |
Binary/Multiclass classification |
class (binary), occupation (multiclass) |
15 |
39,073 (train), 9,769 (test) |
|
Bank Marketing Dataset. |
Binary classification |
y |
21 |
28,831 (train), 12,357(test) |
|
Vehicle Coupon Recommendation Dataset. |
Multiclass classification |
coupon |
26 |
8,878 (train), 3,806 (test) |
|
Online Retail Transactional Dataset. |
Regression (CLTV prediction), RFM |
cltv |
11 |
2,230 (train), 956 (test) |
|
Telco Churn Event Dataset |
Binary classification (Churn prediction) |
churn |
21 |
4,930 (train), 2,113 (test) |
|
House Price Dataset of California. |
Regression |
median_house_value |
10 |
14,448 (train), 6,192 (test) |
|
Sample Transition Dataset of Web Access. |
Network Analysis |
- |
3 |
12 |
|
Time Series Airline Passenger Dataset. |
Timeseries Forecasting (Univariate) |
number_of_airline_passengers |
2 |
100 (train), 44 (test) |
|
Quartierly Time Series of M4 Dataset |
Timeseries Forecasting (Multivariate) |
v7 (or any v?) |
867 |
33,600 (train), 14,400 (test) |
|
Next Best Action Dataset |
Next Best Action |
- |
6 |
43,196 (train), 12,829 (test) |
|
DP6 Dataset for Marketing Attribution Models |
Multi-Touch Attribution |
- |
4 |
500,000 |
|
Dermatology Diseases Dataset. |
Multi-class classification, Clustering |
class |
35 |
366 |
|
Credit Card Fraud Dataset. |
Binary classification (Fraud detection) |
fraud |
29 |
199,364 (train), 85,443 (test) |
|
Cluto Dataset for Clustering |
Clustering |
class |
3 |
10,000 |
|
Forest Cover Type Dataset. |
Multiclass classification |
target |
55 |
406,708 (train),174,304(test) |
|
20 Newsgroup Documents Dataset. |
Multiclass classification |
target |
301 |
11,314 (train), 7,532 (test) |
|
Cometics Shop E-Commerce Events History Dataset |
RFM analysis, Clustering |
- |
5 |
1,287,007 |