Home > Forum Home > Automating Data Analysis Tasks > Check PDF files for text from Excel file. Share

Check PDF files for text from Excel file.

Excel Help for Check Pdf Files For Text From Excel File. in Automating Data Analysis Tasks


Forum TopicPost Reply Login

Check Pdf Files For Text From Excel File.

Rate this:
(3/5 from 1 vote)
Confused Hi.
I have a XLS document with a bunch of "tags". I want to search through a bunch of PDF files, to see which tags is in which pdf files.
For example;
My excel-file contains the following tags;
AA-1234
BB-2311
CC-2222
And then I want to get a list where I can see witch pdf files contains witch tags for example;
HHHHHHH.pdf contains; AA-1234
AAAAAAAA.pdf contains; AA-1234, CC-2222
SSSSSSSS.pdf contains; BB-2311, CC-2222

And so on and so on. Is this possible in any way?
A bonus would be if it could do it with pdf files, where it was not stored as text (my pdf program can convert the text in the files to text if there is no solution for this).

Thanks
 Posted by on
 
Replies - Displaying 1 to 3 of 3Order Replies By: Most recent | Chronological | Highest Rated
Happy
Rate this:
(3/5 from 1 vote)
If you download the application from this post, you'll find an open source VBA routine that loops through all PDF files in a directory and creates test files from them. It also analyzes the text content in each file.

What you can do is modify this or use it as an engine so that it checks for the text in your Excel file and creates a matching log for PDF files where the matched are made. If you are unable to make this development yourself, you can always open it up to the community by creating a small project.

I hope this helps.
 Excel Business Forums Administrator
 Posted by on
Sad
Rate this:
(3/5 from 1 vote)
The problem is, I have over 1000 pdf-files. It would take forever to do it manually...
 Posted by on
Oops
Rate this:
(4.3/5 from 3 votes)
The best option will be to import the text into Excel (or memory vis VBA) and then use a referencing routine to check the Excel text against the imported PDF text for each PDF file.

Two steps are required:
  1. Convert the PDFs to Text - see the "importing data from PDf files" thread.
  2. VBA routine to loop through text files and check for content.
 Excel Business Forums Administrator
 Posted by on
 Displaying page 1 of 1 

Find relevant Excel templates and add-ins for Check PDF files for text from Excel file. in the