The Lounge is rated Safe For Work. If you're about to post something inappropriate for a shared office environment, then don't post it. No ads, no abuse, and no programming questions. Trolling, (political, climate, religious or whatever) will result in your account being removed.
Working on an app that is very much table driven. I have "table data"; but not in text form; only images.
Windows 10 has OCR!
Using a PDF reader (and a nice zoom), I use the "Snipping Tool" to snip my "table", and save it as an image file. Then using the C# sample OCR app, I load the image and it extracts the text. I'm getting 95 to 100% recognition. The app highlights what it extracted from the image so you can see what "noise" was the issue (you could then pixel adjust the image, for example). Since it's possible to access clip board images, you could eliminate the file step altogether. Brilliant stuff, IMO.
I'm planning to use Azure Cognitive Services for a small project for some OCR.
More specifically, I'd like to automatically read dates, descriptions and prices on invoices when I upload them to my system.
It's probably a hell of a lot of work and will save me little time, but I just wouldn't feel like a developer if I'd let this chance slip to automate in two weeks what I can do in three minutes
It can be hard to get an exact like for like OCR but it can often work well enough.
I just tested the TIFF IFilter OCR ability and it seems to actually work (mostly). I took a screen shot of this forum page and saved it as a .tiff. I then used Windows Search to search for text within the file. I tried "must admit that I've never tested it" and it found it! (I admit that I had a lot of hits on the phrase so I added "kind:picture" to narrow it down to picture files so as to not have to scroll through pages of hits).
I tried various words in the screenshot and one word that the TIFF IFilter failed to OCR was "masochists". Don't ask me why.
I'll upload the TIFF somewhere if you want it but you can trivially create your own to test it with. You'll need to manually install the TIFF IFilter if you haven't already done so.
** edit **
I tested with some more search strings. When I accidentally searched for "makr them use EDI" (quotes included) it found nothing. When I corrected the spelling to "make them use EDI" (quotes included again) it instantly found it. Windows Search really is searching the OCR'd and indexed contents of the image file.
I was having a conversation about a client who uses Rust, and was curious as to why they chose the Rust language until it was pointed out they are very much into high performance web development. You can use Rust to create WebAssembly.
Ryan pointed me to AssemblyScript[^] which, given my new crush on TypeScript, looks really interesting. Still lots of holes in it, but looks fairly awesome.