Merges text files from Document AI output shards into a single text file corresponding to the parent document.

merge_shards(source_dir = getwd(), dest_dir = getwd())

Arguments

source_dir

folder path for input files

dest_dir

folder path for output files

Value

no return value, called for side effects

Details

The function works on .txt files generated from .json output files, not on .json files directly. It also presupposes that the .txt filenames have the same name stems as the .json files from which they were extracted. For the v1 API, this means files ending with "-0.txt", "-1.txt", "-2.txt", and so forth. The safest approach is to generate .txt files using get_text() with the save_to_file parameter set to TRUE.

Examples

if (FALSE) {
merge_shards()

merge_shards(tempdir(), getwd())
}