I have thousands of PDF files that are mostly in these formats:
TITLE - FIRSTNAME [url removed, login to view]
TITLE - SUBTITLE - FIRSTNAME [url removed, login to view]
[url removed, login to view]
I am seeking a freelancer to create an Automator action and/or a script to do the following:
1) Parse each part of the filename into separate variables (i.e., TITLE, SUBTITLE, FIRSTNAME, LASTNAME);
2) Use these variables to write the relevant PDF metadata; and
3) Repeat this for all PDF files in the folder.
Also, the script needs to be smart enough to understand when there is no SUBTITLE or FIRSTNAME LASTNAME info so it doesn't fail or write nulls.
One important requirement: The end result must NOT use MacOS's native PDF handler, because many of the PDF files have Adobe Clearscan OCR layers. Unfortunately, there is a known bug in which MacOS's native PDF handler corrupts the OCR layer. The end result should use some type of command line tool like Exiftool or something similar.