narrowing down a large dataset

I have a dataset of approximately 8 million rows and 7 variables. In order to be able to perform my analysis, I want to take a sample from it. My data contain dates(the first semester of a year), days and times and I want a sample that is a representative one of my original and lose the least information possible. The data also involves two numeric variables and the rest of them are factors. Help!



from Recent Questions - Stack Overflow https://ift.tt/2KQWPfv
https://ift.tt/eA8V8J

Comments

Popular posts from this blog

Today Walkin 14th-Sept

Hibernate Search - Elasticsearch with JSON manipulation

Spring Elasticsearch Operations