Thereâs no easy way I know of but having had to do that with a PDF report of lots of data (12 Pages), I had to select all tabular data on each page, copy it and paste into a text file. I then removed all commas before doing a find and replace on spaces and replacing them with commas. Finally I saved the file as a csv file and imported it into my spreadsheet. Even after all of that I spent a fair amount time aligning the data into proper columns. The previous assumes that PDF is text-based. If itâs image-based you need to OCR it first before doing what wrote. Then is it essential to carefully proofread everything to make sure the OCR didnât screw up some numbers.
You may have to spend considerable time and effort editing text to make sure the data is correct before sending it out to the ITAs. One could argue that once the ITA is done you're basically done for the dayââª and may as well let ITAs handle it, but that is your choice. This may sound like an extremely time-consuming and laborious task, but you only have that much time. You have to get the data there and onto the tables that ITAs are using. Of course, you can save Excel and OCR the data and import it into R in a few minutes instead, but it is very time intensive. In the following video I will describe the steps I took to get the data from Excel as CSV's into LibreOffice and then using a spreadsheet editor to manipulate the data. Please note: A) I have shown.