Dataset Repository Explorer

Browse and explore your dataset repository structure with detailed file information

Folders

1

CSV Files

3

Total Size

12.4 MB

Last Updated

2 days ago

Repository Structure

my_dataset_repository 4 items
README.md 2.4 KB
train.csv 8.2 MB
test.csv 2.1 MB
validation.csv 1.7 MB

Quick Actions

File Details

train.csv

Training dataset for machine learning model

CSV Training Data 8.2 MB
Example Element

Dataset Visualization

Dataset Overview

  • Rows: 42,840
  • Columns: 18
  • Missing Values: 0.2%
  • Last Modified: 2023-07-15

Column Types

Numerical: 12
Categorical: 5
Text: 1

Preview (First 5 rows)

ID Feature 1 Feature 2 Target
1 0.84 Category A 1
2 0.67 Category B 0
3 0.92 Category A 1
4 0.45 Category C 0
5 0.78 Category B 1

Recent Activity

New file added

validation.csv was added to the repository

2 days ago

File updated

train.csv was modified with additional data

5 days ago

Documentation updated

README.md was updated with new information

1 week ago

Data Quality Report

Overlapping Rows

0

Rows present in both train and test sets

Total Unique Rows

85,680

Combined unique rows from both datasets

Overlap Percentage

0%

Percentage of overlapping rows

Analysis Summary

No Data Leakage Detected

There are no overlapping rows between training and test datasets, indicating proper data partitioning.

Recommendation

The dataset partitioning follows best practices. Proceed with confidence in your model evaluation process.

Made with DeepSite LogoDeepSite - 🧬 Remix