Embedded files for the 1M-documents version MS MARCO dataset. The url to the corpus: https://public.ukp.informatik.tu-darmstadt.de/kwang/datasets/ir/msmarco-1m.zip To load the corpus, one can use the [BeIR](https://github.com/UKPLab/beir) repo: ```python from beir.datasets.data_loader import GenericDataLoader corpus, queries, qrels = GenericDataLoader(data_folder=data_path).load(split="valid") ```