<html><head></head><body><div class="ydp6dd48f81yahoo-style-wrap" style="font-family:Helvetica Neue, Helvetica, Arial, sans-serif;font-size:16px;"><div></div>
<div dir="ltr" data-setdir="false">Wow your pdftotext settings worked.</div><div dir="ltr" data-setdir="false">You said I need to make my own script to strip the transactions from the file. Do you have any hints for me?</div><div dir="ltr" data-setdir="false">I can open the file in a text editor and just delete the unneeded text, but I was looking for something more hands off that will clean it up and move this to comma separated.</div><div dir="ltr" data-setdir="false">Thanks,</div><div dir="ltr" data-setdir="false">Aaron</div><div><br></div>
</div><div id="yahoo_quoted_9529723218" class="yahoo_quoted">
<div style="font-family:'Helvetica Neue', Helvetica, Arial, sans-serif;font-size:13px;color:#26282a;">
<div>
On Monday, December 28, 2020, 10:14:42 AM EST, pjfarley3@earthlink.net <pjfarley3@earthlink.net> wrote:
</div>
<div><br></div>
<div><br></div>
<div><div id="yiv3216725230"><style>#yiv3216725230 #yiv3216725230 --
_filtered {}
_filtered {}
_filtered {}
#yiv3216725230
#yiv3216725230 p.yiv3216725230MsoNormal, #yiv3216725230 li.yiv3216725230MsoNormal, #yiv3216725230 div.yiv3216725230MsoNormal
{margin:0in;font-size:11.0pt;font-family:sans-serif;}
#yiv3216725230 a:link, #yiv3216725230 span.yiv3216725230MsoHyperlink
{color:#0563C1;text-decoration:underline;}
#yiv3216725230 span.yiv3216725230EmailStyle18
{font-family:sans-serif;color:windowtext;}
#yiv3216725230 .yiv3216725230MsoChpDefault
{font-size:10.0pt;}
_filtered {}
#yiv3216725230 div.yiv3216725230WordSection1
{}
#yiv3216725230 </style><div><div class="yiv3216725230WordSection1"><p class="yiv3216725230MsoNormal">I had the same issue. My bank statements are also PDF and my bank provides no online access from KMM or any checking program other than Quicken, so I download the PDF from my bank website and use pdftotext (yes, there is a Windows version) to extract the statement to a text file that can then be processed by any text-processing language of your choice. My original bank statement text processing code was written (by me) in gawk, then later I switched to an awk derivative called miller.</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">For my bank’s PDF’s I found this to be the most effective way to extract the text, which (for the most part) preserves columns and headings from the PDF version. The key is the “-layout” and “-enc UTF-8” options:</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">C:\MyBankFolder> pdftotext -eol dos -cfg sample-xpdfrc -layout -nopgbrk -enc UTF-8 Bank-statement.PDF bank-statement.txt</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">Writing text-processing code to strip out only the transactions from the text file is something you have to write yourself.</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">The pdftotext executable download for Windows can be found here:</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal"><a rel="nofollow noopener noreferrer" shape="rect" target="_blank" href="https://www.xpdfreader.com/download.html">https://www.xpdfreader.com/download.html</a></p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">Select the “Windows 32/64-bit” download under “Download the Xpdf command line tools:”.</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">If you are interested, a very good Windows gawk can be found here:</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal"><a rel="nofollow noopener noreferrer" shape="rect" target="_blank" href="https://sourceforge.net/projects/ezwinports/files/">https://sourceforge.net/projects/ezwinports/files/</a></p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">The miller language executable is available here:</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal"><a rel="nofollow noopener noreferrer" shape="rect" target="_blank" href="https://github.com/johnkerl/miller">https://github.com/johnkerl/miller</a></p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">HTH</p><p class="yiv3216725230MsoNormal"> </p><p class="yiv3216725230MsoNormal">Peter</p><p class="yiv3216725230MsoNormal"> </p><div class="yiv3216725230yqt7645174602" id="yiv3216725230yqt58103"><div style="border:none;border-left:solid blue 1.5pt;padding:0in 0in 0in 4.0pt;"><div><div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in;"><p class="yiv3216725230MsoNormal"><b>From:</b> KMyMoney <kmymoney-bounces@kde.org> <b>On Behalf Of </b>Aaron Mehl<br clear="none"><b>Sent:</b> Monday, December 28, 2020 8:47 AM<br clear="none"><b>To:</b> KMyMoney Users' Mailing List <kmymoney@kde.org><br clear="none"><b>Subject:</b> pdf to kmymoney</p></div></div><p class="yiv3216725230MsoNormal"> </p><div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">Hi all,</span></p></div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">My bank statements are in pdf format.</span></p></div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">I am looking for a clean easy way to import them into KMyMoney.</span></p></div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">I googled and found a non-clean answer, paste it as text in excel and export as csv. </span></p></div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">The problem was the the huge amount of manual clean up I had to do.</span></p></div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">I see a command line utility pdftotext but I still need a csv file to import. Is there a utility that will turn this text file into csv or some other way to do this.</span></p></div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">Thanks,</span></p></div><div><p class="yiv3216725230MsoNormal"><span style="font-size:12.0pt;font-family:sans-serif;">Aaron</span></p></div></div></div></div></div></div></div></div>
</div>
</div></body></html>