I added R to my toolkit and have had great success using it on many projects, for not only discovery but also for visualizations. R provides many packages and functions that let you perform data discovery over a data set.
Posts from the ‘Resources’ Category
There are many tools that you can purchase to generate documentation, however, also costs money or requires an install on client machines or servers. I wanted something fast which would not leave a foot print or require weeks of negotiations to get installed. The T-SQL script generates an HTML document out of the schema results that you can paste into Word to kickstart the documentation process.
I was working with a client the other day who was using the Microsoft Azure Storage Explore. They were trying to create a folder in an Azure blob storage location and could not find the option to do it. We seemed to be talking about 2 different products. Funny enough we were.
When you do a search for Microsoft Azure Storage Explore, the CodePlex version still ranks very high on the search results even though it has not been updated since August 2014. People get tripped up if they remember the CodePlex version.
The Latest version is currently updated and is available on Windows, Mac and Linux. If you are using the free Azure Storage explorer, use this one.
Forbes Tech has just released a list of 33 free data sources that you can use for your next proof of concept or demo. The list is a wide and varied source of Canadian, US and European open data initiatives.
Bernard Marr has written the article which is available on the Forbes website: Big Data: 33 Brilliant And Free Data Sources For 2016.
A short post, but a great resource.
Google have announced changes with a major algorithm update which has just gone live, April 21, 2015. This can change how your site is returned based on the device that the user is using for their search. Mobile is where most searches are trending and as such Google is going to rank sites that have a friendlier experience to mobile users higher if that user is searching from a mobile device.
Criteo has released a real world sample data set of over 1TB and provides over 4 billion examples with binary labels (click vs. no-click) including over 156 billion total (dense) feature-values and over 800 million unique attribute values
When you're in the planning phase of a database project, one of the important questions is how much space your data will take up. Great MSDN resource for SQL database sizing.