ProjectsDatasetsContact

Datasets

Some of the datasets I created over the past few years. Find more on my Kaggle

"Mar 28, 2023"
39

YouTube Thumbnail Dataset

Most versatile dataset of YouTube thumbnails. Goes hand in hand with the YouTubers-Saying-Things dataset

Read more →

34

YouTubers Saying Things

Biggest dataset containing real youtube video's transcripts, alongside other interesting attributes such as Channel type (manually annotated), Channel Name, Views etc

10+

Amazon Question/Answer Dataset

Dataset containing Customer question and answers, initially collected by Prof. Julian McAuley. Wrangled and simplied for Kaggle use