Handling large datasets with cryoDRGN preprocess