<html>
<head>
<style>
.hmmessage P
{
margin:0px;
padding:0px
}
body.hmmessage
{
FONT-SIZE: 10pt;
FONT-FAMILY:Tahoma
}
</style>
</head>
<body class='hmmessage'>
...yes, sorry to dampen the euphoria, but my experience with tesseract was the same as Richard's: much better, but still far from what you get with the (effectively) free stuff you get automatically when you purchase your scanner. <BR>
While it's true that the best OCR is likely to be commercial, it's frustrating that there's stuff which does a very good job and is virtually given away, but can't be run under linux. <BR>
Peter<BR><BR>> From: cms0009@gmail.com<BR>> To: kde-linux@kde.org<BR>> Date: Fri, 23 Nov 2007 16:23:33 -0500<BR>> Subject: Re: [kde-linux] kooka scanning software<BR>> <BR>> On Thursday 22 November 2007 12:00:22 pm Bruce Bales wrote:<BR>> > On Thursday 22 November 2007 hh:mm:ss Emanoil Kotsev wrote:<BR>> > > hello, everybody.<BR>> > > I want to share my experience/opinion with/on gocr.<BR>> > ><BR>> > > I've tried this 2 years ago and still monitor the mailing list. Despite<BR>> > > of the fact that there is very slow improvement, I was/am disappointed by<BR>> > > the fact, that it only works with the latin character set. Above this it<BR>> > > is not possible to add own code for other character sets without<BR>> > > rewriting some parts of the code.<BR>> > ><BR>> > > 2 years ago the developers promised, they were planning to replace the<BR>> > > algorithms with such using vectors, which would make extending of j/gocr<BR>> > > (with plugins) easier, but as far as I know it has not been done yet.<BR>> > ><BR>> > > So I think it is a waste of time to discuss on gocr... The C code is<BR>> > > also very complicated to read ...<BR>> > ><BR>> > > shortly - disaster and a pity that there is no linux ocr program. I have<BR>> > > not ever heard if something commercial works under linux.<BR>> > ><BR>> > > I was using OmniPage Pro ... and few weeks ago I tried this in a VMWare<BR>> > > (it was working an year ago with a great success) . Now I have an error<BR>> > > that says that the license period has expired and it is also not working.<BR>> > > Well the program license was obtained in '97 .... but I don't remember to<BR>> > > have read something about 10 year period ... any way commercial OCR costs<BR>> > > and linux OCR sucks<BR>> > ><BR>> > > Let us hope that the feature will be better at least for the linux world.<BR>> > ><BR>> > > regards<BR>> ><BR>> > Emanoil,<BR>> > Linux does have a great OCR program -- it is called tesseract. It works<BR>> > beautifully with my English scans. I do not know if it works with other<BR>> > languages and fonts, but I think you should investigate it.<BR>> ><BR>> <BR>> Been trying this out with xsane, and its still not as good as the cheap window <BR>> OCR scanning software you get with the purchase of a scanner.<BR>> <BR>> there are some excellent commercial program for scanning under windows,<BR>> that will have a 99% scanning rate, for under a $100.00 and yet,<BR>> we don't have anything that can do that, for Linux ( Strike one )<BR>> <BR>> Linux needs a Full Blown OCR application on the same par as windows or better, <BR>> plus that can interface with any scanner.<BR>> <BR>> Richard<BR>> ___________________________________________________<BR>> This message is from the kde-linux mailing list.<BR>> Account management: https://mail.kde.org/mailman/listinfo/kde-linux.<BR>> Archives: http://lists.kde.org/.<BR>> More info: http://www.kde.org/faq.html.<BR><BR><br /><hr />Invite your mail contacts to join your friends list with Windows Live Spaces. It's easy! <a href='http://spaces.live.com/spacesapi.aspx?wx_action=create&wx_url=/friends.aspx&mkt=en-us' target='_new'>Try it!</a></body>
</html>