Automated OCR for Synapse

Graham · Nov 26, 2007

did some timing and it takes 10 mins to correctly recognize 5 forms including grabbing patient names, test dates and ids.

Jason · Nov 26, 2007

Were they all from the same lab ?

Graham · Nov 26, 2007

No, four different sources.

Graham · Nov 26, 2007

Actually, rather than giving up on not being able to allocate the scan correctly, it might be better to attach to a dummy patient and then move it later on .. that would be quicker.

Jason · Nov 26, 2007

Graham said:

Actually, rather than giving up on not being able to allocate the scan correctly, it might be better to attach to a dummy patient and then move it later on .. that would be quicker.
Click to expand...

Except if the enduser forgets about the dummy patient ! = lost scan.

I'd pass on the dummy patient idea.

Graham · Nov 26, 2007

No, because the dummy patient's results still end up in your inbox.

Jason · Nov 26, 2007

Graham said:

No, because the dummy patient's results still end up in your inbox.
Click to expand...

What a handy little inbox we have.

ID, in your example is the name of the lab. Is that the plan ? ID = lab name ?

Graham · Nov 26, 2007

The ID field refers to a unique text string found on the form that will identify that form.

In some cases where the lab uses a graphic, I have used a PO Box number, or a fax number on their form to identify them.

Graham · Nov 27, 2007

Here's a screen capture of the auto-filing utility in progress. The fields below the status area show the details for the last patient identified.

Jason · Nov 27, 2007

[quote user="Graham"]

Here's a screen capture of the auto-filing utility in progress. The fields below the status area show the details for the last patient identified.

[/quote]

Cool GUI. I love the Trying Rule updates

Graham · Nov 28, 2007

Updated screenshot with data masked out

Graham · Nov 28, 2007

I wonder if it might be too difficult to allow users to specify bitsets for the SSN/NHI.

Eg: [#"0" - #"9" #"A" - #"Z" #"a" - #"z"]

specifies that the SSN/NHI can only contain alphanumeric characters.

or,<pre> [#"0" - #"9"]</pre>

is numeric only ....

Jason · Nov 28, 2007

[quote user="Graham"]

I wonder if it might be too difficult to allow users to specify bitsets for the SSN/NHI.

Eg: [#"0" - #"9" #"A" - #"Z" #"a" - #"z"]

specifies that the SSN/NHI can only contain alphanumeric characters.

[/quote]

OCR stuff tends to make letter/number errors ALOT. 1 = l and 0 = O. (See ! - hard to tell).

I think it is a good idea to do the restriction if it will yield better results.

Graham · Nov 28, 2007

Latest version allows you to specify alpha, alphanumeric etc.

This has improved the results.

Graham · Nov 29, 2007

Added a new rule for one of my major lab vendors today - not really necessary since I get their labs via HL7.

But .. there were two problems:

There was no text I could use to identify the lab because they used a blue small font for the text I would otherwise use - and this doesn't OCR. In the end I ended up using the year string of the test date as it always appears in the same position. Just have to change it for next year.

The first name and surname are on different parts of the form. I don't really want to add another field for the first name .. so I am just using the surname. This with the NHI number is sufficient for me to ID the patient.

Jason · Nov 29, 2007

One method of identification of the scans is to OCR the scan ... and with the IDENTIFIED extracted text .. you can probably figure out what it is.

For me, I think this would be superior to the current proposed method.

But theories are just theories ... testing is the key.

Graham · Dec 1, 2007

the flaw in this is that the keywords I want can't be ocr'd because they're in some tiny colored font and produce garbage in the ocr'd text.

I don't know why our labs here want to produce printed results in what looks like a 8 point font!

I am currently rewriting the OCR to use asynchronous tcp, as currently it uses synchronous tcp which leaves the GUI non responsive during the OCR process.

Log in or Sign up

Automated OCR for Synapse

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Attached Files:

synapseOCR.medlab.png

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Attached Files:

synapse.autofiling.ID.name.NHI.date.png

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Share This Page

Log in or Sign up

Automated OCR for Synapse

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Attached Files:

synapseOCR.medlab.png

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Attached Files:

synapse.autofiling.ID.name.NHI.date.png

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Graham Developer Staff Member

Jason Developer / Handyman Staff Member

Graham Developer Staff Member

Share This Page

Useful Searches