Rochester Institute of Technology 2575 and 2585 Hugh Carey Hall
We recommend that every member of the team bring a laptop, if possible. You might find it helpful to have a mix of PCs and Macs, since they have different strengths.We recommend that you make sure beforehand that the software you will be using throughout the weekend is properly installed and running on your computer. You will be working with a large dataset so make sure that you have the space for it on your drive.
We will have snacks and munchies. Feel free to bring anything additional you might want. You are of course free to come and go as you please, but particularly the first night (up until midnight) will be fairly structured.
You might want to bring some favorite statistical or computational reference books, if you have them, or bookmark some pages that you routinely refer to.
Large Data Advice
The dataset you will be working with is quite. If you type a variable name to view it, it will take a while to display.
Therefore, remember these R commands: head(), tail(), str().
We strongly recommend you create a small data set that you can use to test things on. Then, if it works out, you can apply your procedure to the large dataset. Some procedures can take a frustratingly long time to run on large data sets, and so it will be comforting to know that your procedure works (because you tested it on a smaller data set) while you wait. We recommend taking a random sample of rows from the original data set, but there might be other approaches you find useful.
Rochester Institute of Technology College of Science