Home > Forum Home > Developing Business Administration Solutions > Importing Data from PDF Files | Share |
Forum Topic | Post Reply Login |
Importing Data From Pdf Files | Rate this: (4.2/5 from 24 votes) |
Business Spreadsheets has developed a free Excel program to extract and import PDF data into Excel which can be downloaded and used without restriction. There is a common need to extract and import specific data from PDF files into Excel. Since Excel does not natively support the reading of PDF content, utilities are needed to convert the PDF file content for the Excel format. Several commercial applications accomplish this; however it is often the case where only specific data is required to be imported from multiple PDF files into one structured format. We created such an application by using VBA code in conjunction with an open source PDF to Text conversion utility, which can be found at Foolabs. [Download the free PDF data import Excel program here] The program relies on the conversion utility (included in the download) and all PDF files to reside in the same directory as the Excel application. Text or data to extract are defined in the Control sheet by specifying start text, end text and multiple replacements routines with wildcard support. This enables flexibility to obtain comparable data from multiple PDF files based on patterns independent of different PDF file structures. As many extraction rules as required can be set in order to create a table of information imported by extraction rule and PDF file name. Information on how to set up rules is available within the Excel application with a help icon and cell comments. The VBA code is commented and open for modification. Any improvements or new features to the code are welcome to be posted here so that we can update the download version to the benefit of everyone. | ||
Excel Business Forums Administrator | ||
Posted by Excel Helper on |
Replies - Displaying 41 to 50 of 88 | Order Replies By: Most recent | Chronological | Highest Rated |
Rate this: (3/5 from 1 vote) | |
Posted by Alekzi13 on |
Rate this: (3/5 from 1 vote) | |
Excel Business Forums Administrator | |
Posted by Excel Helper on |
Rate this: (3/5 from 1 vote) This really is "One of a kind" program, thanks for that to the developers. I have managed to input the commands quite succesfully, but I have a problem with how the program overwrites the old data every time I run the extraction. Is that supposed to be happening? How it can be changed? | |
Posted by Alekzi13 on |
Rate this: (3/5 from 1 vote) Example: ------------------------------ (Name of 1st School) (*5-10 spaces in between*) (full URL of school website) (*line break*) Address: (School Address) (*5-10 spaces in between*) County: (County) (*5-10 spaces in between*) Phone: (Phone #) (*line break*) Grade Range: (Grade Range) (*5-10 spaces in between*) Enrollment: (Enrollment #) (*5-10 spaces in between*) District: (District) (*5-10 spaces in between*) Fax:(Fax) (*line break*) Personnel: (Person's name Person's position) (There also may be additional people listed below or just one person listed with a line break for each personnel and position title) (*line break*) (Name of 2nd School) etc. ----------------------------------------------------- There is only a single line break between the last entry in the "Personnel" section and the name of the following school. Update: the actual number of spaces between the personnel name and the personnel position is between 37-40. So it's not a consistent number of spaces. | |
Posted by xcfreek58 on |
Rate this: (3/5 from 1 vote)
I hope that this helps. | |
Excel Business Forums Administrator | |
Posted by Excel Helper on |
Rate this: (3/5 from 1 vote) | |
Posted by xcfreek58 on |
Rate this: (3/5 from 1 vote) | |
Excel Business Forums Administrator | |
Posted by Excel Helper on |
Rate this: (3/5 from 1 vote) | |
Posted by praveen on |
Rate this: (4/5 from 2 votes) To extract rows we need to specify the start and end text. End text is simple as it will be a new line by using the [new line] option. The start text is more complicated as the extraction will return whatever is found after the starting text. We can specify the first column text and replace it afterward is there is a common pattern. Alternatively if a common pattern exists the the last column of the prevoius row or text then we can use that with [new line] to extract the entire subsequent row. | |
Excel Business Forums Administrator | |
Posted by Excel Helper on |
Rate this: (3/5 from 1 vote) | |
Posted by praveen on |
Back | Displaying page 5 of 9 | Next |
Excel templates and solutions matched for Importing Data from PDF Files:Solutions: Export MapPoint Waypoints Survey Data Analysis |