I have a stack of 4 PDF forms that have been filled in. It is the same empty form filled in. I would like to take all the PDF files in a folder.
I would like to end up with
(a)an Excel file or
(b) comma-separated-variables file with quote marks around cell entries or
(c) a pipe "|" separated file
(d) a tab-separated file
any of which can be used in statistical packages.
The first row of the target file would be the names of the fields. Then there would be 1 row from each of the PDF files.
What I have tried:
A year and a half ago, I had worked out enough Python to get the data in a scrambled text file. I gave up after being stuck at that point.
The scrambled file looked a lot like an Algol heap from the mid-1970s.
I was hoping someone had worked this out. I wanted to find out if this had been
done before jumping back into it.
I do not see how to attach files on this forum, but I can supply a PDF file that has not been filled in, 4 PDF files that have been filled in, and an example of what I want to end up with.