Misplaced Pages

Optical character recognition: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 18:03, 25 October 2007 edit199.72.115.66 (talk)No edit summary← Previous edit Revision as of 18:04, 25 October 2007 edit undo199.72.115.66 (talk) Blanked the pageNext edit →
Line 1: Line 1:
{{SpecialChars}}

'''Optical character recognition''', usually abbreviated to '''OCR''', is the ] or ] translation of ]s of handwritten or typewritten text (usually captured by a ]) into machine-editable text.

OCR is aRecognition]].

Recognition of cursive text is an active area of research, with recognition rates even lower than that of hand-printed text. Higher rates of recognition of general cursive script will likely not be possible without the use of contextual or grammatical information. For example, recognizing entire the mid ] at ] and other institutions. Successive efforts were made to localize and remove musical staff lines leaving symbols to be recognized and parsed. The first proprietary music-scanning program, MIDISCAN, was released in 1991. Three proprietary products are currently available. At this time, OCR software does not recognize handwritten scores.

== Magnetic ink character recognition ==

One area where accuracy and speed of computer input of character information exceeds that of humans is in the area of ], where the error rates range around one read error for every 20,000 to 30,000 checks.

== Optical Character Recognition in Unicode ==

In ], ''Optical Character Recognition'' symbol characters are placed in the ] range 0x'''2440'''–0x'''245F''', as shown below (see also ]):

{| class="wikitable" {{CT-1}}
! colspan="4" rowspan="3" {{CT-2}}| &nbsp;|| <small>'''Symbol'''</small>|| rowspan="2" {{CT-3}}| Name|| colspan="4" rowspan="3" {{CT-4}}| &nbsp;
|-
! Hex
|-
! colspan="2" {{CT-2}}| <small>Symbol's Picture</small>
|- class="Unicode"
| width="0*" {{CT-7}}| ⑀|| rowspan="2" {{CT-3}}| OCR Hook || width="0*" {{CT-7}}| ⑁|| rowspan="2" {{CT-3}}| OCR Chair || width="0*" {{CT-7}}| ⑂|| rowspan="2" {{CT-3}}| OCR Fork || width="0*" {{CT-7}}| ⑃|| rowspan="2" {{CT-3}}| OCR Inverted Fork|| width="0*" {{CT-7}}| ⑄|| rowspan="2" {{CT-3}}| OCR Belt Buckle
|-
| 0x2440|| 0x2441|| 0x2442|| 0x2443|| 0x2444
|-
| colspan="2" width="20%" {{CT-2}}| ] || colspan="2" width="20%" {{CT-2}}| ]|| colspan="2" width="20%" {{CT-2}}| ]|| colspan="2" width="20%" {{CT-2}}| ]|| colspan="2" width="20%" {{CT-2}}| ]
|- class="Unicode"
| {{CT-7}}| ⑅|| rowspan="2" {{CT-3}}| OCR Bow Tie|| {{CT-7}}| ⑆|| rowspan="2" {{CT-3}}| OCR Branch Bank Identification|| {{CT-7}}| ⑇|| rowspan="2" {{CT-3}}| OCR Amount Of Check|| {{CT-7}}| ⑈|| rowspan="2" {{CT-3}}| OCR Customer Account Number|| {{CT-7}}| ⑉|| rowspan="2" {{CT-3}}| OCR Dash
|-
| 0x2445|| 0x2446|| 0x2447|| 0x2448|| 0x2449
|-
| colspan="2" {{CT-2}}| ] || colspan="2" {{CT-2}}| ]|| colspan="2" {{CT-2}}| ]|| colspan="2" {{CT-2}}| ]|| colspan="2" {{CT-2}}| ]
|- class="Unicode"
| {{CT-7}}| ⑊|| rowspan="2" {{CT-3}}| OCR Double Backslash|| &nbsp;|| rowspan="2" {{CT-3}}| <small>Classified</small>|| &nbsp;|| rowspan="2" {{CT-3}}| <small>Not Defined</small>|| &nbsp;|| rowspan="2" {{CT-3}}| <small>Not Defined</small>|| &nbsp;|| rowspan="2" {{CT-3}}| <small>Not Defined</small>
|-
| 0x244A|| 0x244B|| 0x244C|| 0x244D|| 0x244E
|-
| colspan="2" {{CT-3}}| ] || colspan="2" {{CT-3}}| - || colspan="2" {{CT-3}}| - || colspan="2" {{CT-3}}| - || colspan="2" {{CT-3}}| -
|}

== OCR software ==

* ] FineReader OCR
* ]
* ]
* ]
* ] VERUS
* ]
* ]
* ]
* ]
* ]
* ]
* ]
* ]

== See also ==

* ]
* ]
* ]
* ]
* ]
* ]
* ]
* ]
* ] - optical character recognition technology system used in clinical trials
* ]

== References ==

{{reflist}}

== External links ==

* , a comprehensive conference on all aspects of document recognition
*

]
]
]
]
]
]

]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]
]

Revision as of 18:04, 25 October 2007