Criteo has released a real world sample data set of over 1TB and provides over 4 billion examples with binary labels (click vs. no-click) including over 156 billion total (dense) feature-values and over 800 million unique attribute values
With the speed at which technology changes today, keeping up todate cannot only be a challenge, but also expensive. There are 2 sites listed below that make eBooks available in a number of formats and also over a number of technologies.
The Kimball methodology is the go to process for architecting your BI process. This book is the latest version and covers a great deal of what you need to know.