Text this: Large-scale manual curation and harmonization of metadata from metagenomic and cancer genomic repositories: challenges and solutions