To help you remember what’s in your corpus years down the line, and to help other people understand what’s in your corpus, you should create a document that contains metadata for the files in your corpus. Metadata often organizes information in a table structure, with one row per file and columns for information such as author, year of publication, filename (a unique identifier), and title.
If your corpus already has a metadata file, great! Be sure to update it with any changes you make to the corpus. Add any additional columns of information to the metadata file that would be helpful to your project.
If you want to discover more on how to organise, clean and wrangle the metadata you have collected about your corpus check out these webpages


