did some timing and it takes 10 mins to correctly recognize 5 forms including grabbing patient names, test dates and ids.
Actually, rather than giving up on not being able to allocate the scan correctly, it might be better to attach to a dummy patient and then move it later on .. that would be quicker.
Except if the enduser forgets about the dummy patient ! = lost scan. I'd pass on the dummy patient idea.
What a handy little inbox we have. ID, in your example is the name of the lab. Is that the plan ? ID = lab name ?
The ID field refers to a unique text string found on the form that will identify that form. In some cases where the lab uses a graphic, I have used a PO Box number, or a fax number on their form to identify them.
Here's a screen capture of the auto-filing utility in progress. The fields below the status area show the details for the last patient identified.
[quote user="Graham"] Here's a screen capture of the auto-filing utility in progress. The fields below the status area show the details for the last patient identified. [/quote] Cool GUI. I love the Trying Rule updates
I wonder if it might be too difficult to allow users to specify bitsets for the SSN/NHI. Eg: [#"0" - #"9" #"A" - #"Z" #"a" - #"z"] specifies that the SSN/NHI can only contain alphanumeric characters. or,<pre> [#"0" - #"9"]</pre> is numeric only ....
[quote user="Graham"] I wonder if it might be too difficult to allow users to specify bitsets for the SSN/NHI. Eg: [#"0" - #"9" #"A" - #"Z" #"a" - #"z"] specifies that the SSN/NHI can only contain alphanumeric characters. [/quote] OCR stuff tends to make letter/number errors ALOT. 1 = l and 0 = O. (See ! - hard to tell). I think it is a good idea to do the restriction if it will yield better results.
Added a new rule for one of my major lab vendors today - not really necessary since I get their labs via HL7. But .. there were two problems: There was no text I could use to identify the lab because they used a blue small font for the text I would otherwise use - and this doesn't OCR. In the end I ended up using the year string of the test date as it always appears in the same position. Just have to change it for next year. The first name and surname are on different parts of the form. I don't really want to add another field for the first name .. so I am just using the surname. This with the NHI number is sufficient for me to ID the patient.
One method of identification of the scans is to OCR the scan ... and with the IDENTIFIED extracted text .. you can probably figure out what it is. For me, I think this would be superior to the current proposed method. But theories are just theories ... testing is the key.
the flaw in this is that the keywords I want can't be ocr'd because they're in some tiny colored font and produce garbage in the ocr'd text. I don't know why our labs here want to produce printed results in what looks like a 8 point font! I am currently rewriting the OCR to use asynchronous tcp, as currently it uses synchronous tcp which leaves the GUI non responsive during the OCR process.