Do statistical analysis of data

IU provides powerful supercomputers with popular statistics software such as SAS, SPSS, Stata, and R pre-installed that can accommodate analysis at scale. Your code may need to be optimized and/or parallelized to fully take advantage of the performance offered by supercomputing environments. To request programming support for your research project, contact the Research Applications and Deep Learning group.

Prerequisites 

  1. Secure your own environment.
  2. Decide which supercomputer best fits your needs.
    1. Note: for most workloads, Carbonate is the supercomputer to use.  It allows you to use SAS, SPSS, Stata, etc. via the web or command line.
  3. Request an account on the supercomputer you will be using.

Directions

  1. Access the system, likely using SSH most systems require access via SSH, while for Carbonate you also have the option to access via Research Desktop (RED)
  2. Transfer the data from your workstation to the appropriate location
  3. Perform your analysis using the software available on the system
  4. Encrypt your data while it is not being worked on to help ensure it's confidentiality
  5. Complete your analysis and transfer the data back to your workstation as necessary.

Other Considerations

  • If you are based on the IUPUI campus and are performing biostatistical analysis for health research, contact the IUSM Department of Biostatistics for data analysis advice, interpretation of results and statistical software, and more.

Additional Resources

Approved for

We want your feedback

Please email securemyresearch@iu.edu to report errors/omissions and send critiques, suggestions for improvements, new use cases/recipes, or any other positive or negative feedback you might have.  It will be your contribution to the Cookbook and appreciated by all who use it.