Currently it blocks, but we just got lucky: https://github.com/pytorch/text/blob/caaa8e3c08309fe2c40ae40efaf7c2adcf1a2c8a/torchtext/datasets/stsb.py#L85 Please change the order of `filter` and `end_caching`