Looking for a Really Large Data Set – Criteo’s 1TB Click Prediction Dataset Now Available
Criteo has released a real world sample data set of over 1TB and provides over 4 billion examples with binary labels (click vs. no-click) including over 156 billion total (dense) feature-values and over 800 million unique attribute values