StudentShare
Contact Us
Sign In / Sign Up for FREE
Search
Go to advanced search...
Free

Optical Character Recognition System - Assignment Example

Cite this document
Summary
The paper "Optical Character Recognition System " highlights that the current OCR software is able to recognize source material with different fonts. However, the software experiences problems in recognizing script and handwriting fonts that imitate handwriting…
Download full paper File format: .doc, available for editing
GRAB THE BEST PAPER98.8% of users find it useful

Extract of sample "Optical Character Recognition System"

Name Optical character recognition system Tutor Institution Date Introduction Optical character recognition system is kind of software that was engineered to convert typewritten or hand-written text documents into machine text that can be formatted. Entering data through optical character recognition is quite fast, more efficient and more accurate. In most cases of conversion of text, the OCR offers a substitute to keyboarding or data entry. Before using the optical character recognition, the source material is first scanned specifically using an optical scanner but sometimes a particular circuit board in the PC. The purpose of the scan is to read the source material as a pattern of dots or rather as a bitmap. Software that recognizes the images is also needed. The OCR software afterwards processes the scans so as to differentiate between text and images and also establish what letters are represented in the dark as well as light areas (Lais, 2000). Image Segmentation Image segmentation is a pizel classification process that is purposed to segment or extract regions or objects from the background. It described as a significant preprocessing step to the ultimate success of image recognition, image compression, image visualization and image retrieval. One should note that there is no set sequence to the approach to segmentation. Various types of parts of a scene can act as the segments upon which descriptions are based on and there are a variety of ways a person can utilize to extract the parts from the image. Appropriate selection of segmentation technique is based on the type of applications and images. In optical character recognition, the text is detached from the document image and further divided into lines, words, columns, and connected constituents. In the process of building character sub-images, a person is confronted with broken or touching characters that appear in degraded documents like scan, fax and photocopy. The evaluation that has been made concerning segmentation algorithms is subjective and hence people make their judgment based on the outcome of several segmented images or according to their intuition. However, the segmentation algorithms can be evaluated through the use of symmetric along with asymmetric distance metric alternatives (Shih, 2009). In today’s OCR engines, the technology multiple algorithms of neutral network is added to examine the discontinuity line between the text characters, the stroke edge and the background. With the presence of irregularities of having printed ink on paper, every algorithm then makes an average of the dark and light along the stroke side and through matching it to known character it makes a smart guess concerning what character it is. Further, the OCR software polls the results or finds the average from the entire algorithm to get a single reading (Lais, 2002). Methods of segmentation Thresholding Thresholding is one of the different methods of segmentation. This method is based on the hypothesis that clusters in the histogram correlate to either objects or background of interest that can be got by separating the histogram clusters. This is a process where the gray-scale or color- image is minimized to a binary image by use of an optimal threshold. The purpose is to extract the pixels from an image which characterize an object such as a line image like a map or graph or a text image. Even though the information is represented in binary form, the pixels represent a wide range of intensities. Therefore the purpose of binarization is to mark those pixels that belong to the background regions with diverse intensities and true foreground regions with a definite intensity (Devi, 2006)). An effective thresholding algorithm must preserve semantic and logical content. There are two forms of thresholding algorithms namely adaptive or local thresholding algorithms and global thresholding algorithms. In local thresholding, diverse threshold values for various local areas are applied. In global thresholding, a particular threshold for each and every image pixels is used (Devi, 2006). It is not possible to segment all images successfully into background and foreground when simple thresholding is used. In cases where the intensity distribution of objects in both the foreground and background is different, this method is quite applicable. In adaptive thresholding, various threshold values for various local areas are applied. There are diverse methods of selecting the threshold value (Shih, 2009). Edge detection In edge detection method, the edges classify boundaries and the regions are enclosed within these edges. Edge detection is an important tool in computer vision especially in areas of feature extraction and feature detection. The purpose is to identify the regions where the brightness of the image changes sharply in a digital image. Determining the position of the edges is the first step in segmenting images into regions (Lais, 2002). When edge detector is applied in an image, the result is a set of connected curves that show the boundaries of objects and the boundaries of surface markings. It means that when edge detection algorithms is used, the amount of data purposed to be processed in the image may be reduced hence filtering out information regarded as irrelevant and at the same time preserving the significant structural properties of an image (Shih, 2009). Split and merge operations Segmentation algorithms that employ the split and merge operations are basically restricted to a split stage which is consequently followed by a merge stage. The pure merging methods are computationally expensive since they begin from such small individual points. However, this can be made efficient by recursively dividing the image into much smaller and smaller regions until the point where single regions are coherent and then merge the images recursively to come up with larger coherent regions. The first step therefore is splitting the image where the entire image is considered as one region. In cases where the entire region is coherent that is all the pixels represented in the region are similar, then the region is left unmodified. However, where the region is not adequately coherent, splitting must be done (Govindaraju & Setlur, 2009). Feature extraction Selecting the feature extraction method is basically the most fundamental factor in the achievement of high recognition pattern. Feature extraction is defined as problem of extracting the most relevant information from raw data for classification purposes. The sense is to enhance the between -class pattern variability while minimizing the pattern within- class. It should be noted that various feature extraction methods conform to this requirement to certain degree, depending on the available data and particular recognition problem (Jain, 1996). In OCR, a feature extraction method may prove to be successful in a single application domain but fail to work successfully in another application domain (Yampolskiy, 2007). A good feature extraction method should be that which give the best selection method for a particular application. It is wise to consider whether the characters under scrutiny have known size and orientation, whether they are typed, machine-printed or handwritten and to what level they are degraded. Some methods of feature extraction work best on gray-level sub-images of particular characters while other do best on solid four-or- eight- attached symbols divided from the binary raster image, thinned symbols or symbol contours. Furthermore, the format of the extracted features should match with the requirements of the selected classifier. Grammar or graph based descriptions of the characters are best for syntactic or structural classifier. On the other hand discrete features that may take two or three different values are well suited for decision trees (Jain, 1996). Pattern recognition is a discipline whose main goal is classifying objects into various classes or categories. Depending on the application, the objects can either be signal waveforms or images or any other form of measurement that require to be classified. Character recognition is a very significant area of pattern recognition. The optical character recognition basically has a front end device which is composed of a scan lens, a light source, a detector and a document transport. The variation of light intensity is translated into numbers at the output of the light sensitive detector and hence an image array is formed. In the process, a series of image processing procedures are applied resulting to line and character segmentation (Govindaraju & Setlur, 2009). Next, the pattern recognition software then proceeds to recognize the characters a process which involves classifying each and every character in its correct number, letter and punctuation class. There is an advantage of storing the recognized document rather than the scanned image. There is a great interest invested in systems that identify handwriting besides the printed character recognition systems. A very good example of an application of such systems is in the machine that reads bank checks. It is a requirement that the machine recognizes the amount stated in digits and figures and match them appropriately. In addition, the machine could check whether the account to be credited corresponds to the payee. Much labor could be saved by the machine even though only half of the checks are checked correctly. A similar application is in the postal code identification where automatic mail-sorting machines are used in the post offices (Jain, 1996). Classification Image classification explores the numerical properties of different image features and further organizes data into groups. One mode of classification in optical character recognition is classification of the blocks which is a form of text and non-text block reconstruction. Spatial domain features opens a door way for good block segmentation of grayscale postal images. The classification error is reduced when first pass classification is combined with context-based classification. The document classification trials automate the process of recognizing, sorting as well as routing of documents to their right destination (Shih, 2009). Conclusion The current OCR software is able to recognize source material with different fonts. However, the software experiences problems in recognizing script and handwriting fonts that imitate handwriting. This will not be a problem soon as developers are considering various approaches that will improve handwriting as well as script recognition. Advances have made the system reliable as a minimum of 90% accuracy is expected for average quality documents. The initial step in achieving better recognition starts with the scanner. The quality of the scanner’s light arrays device ultimately affects the OCR results. The more tightly the arrays are packed, the more the scanner can detect more distinct colors and finer image. Background or smudges color can make a fool to the recognition software and therefore to improve the recognition rate or refine the image the scan resolution can be adjusted. Quadratic Integral Ratio (QIR) algorithm (Global threshold) Three sub images of QIR method References Devi, A. 2006. Thresholding: A Pixel-Level Image Processing Methodology Preprocessing Technique for an OCR System for the Brahmi Script. http://www.ancient-asia- journal.com/index.php/aa/article/viewArticle/13/25 Retrieved 21st April, 2011 Govindaraju, V.& Setlur, S. 2009. Guide to OCR for Indic Scripts: Document Recognition and Retrieval. New York, Splinger. Jain, A.1996 FEATURE EXTRACTION METHODS FOR CHARACTER RECOGNITION--A SURVEYhttp://www.cs.sfu.ca/cc/414/li/material/refs/OCR-survey- 96.pdf Retrieved 21st April, 2011 Lais, S. 2002. QuickStudy: Optical Character Recognition http://www.computerworld.com/s/article/73023/Optical_Character_Recognition July 29, 2002 Retrieved 21st April, 2011 Shih, F. 2009. Image Processing and Pattern Recognition: Fundamentals and Techniques. New York,Wiley-IEEE Yampolskiy, R. 2007. Feature Extraction Approaches for Optical Character Recognition. Briviba Scientific Press, Read More
Cite this document
  • APA
  • MLA
  • CHICAGO
(Optical Character Recognition Steps (System) Example | Topics and Well Written Essays - 1846 words, n.d.)
Optical Character Recognition Steps (System) Example | Topics and Well Written Essays - 1846 words. https://studentshare.org/logic-programming/2046019-optical-character-recognition-steps-system
(Optical Character Recognition Steps (System) Example | Topics and Well Written Essays - 1846 Words)
Optical Character Recognition Steps (System) Example | Topics and Well Written Essays - 1846 Words. https://studentshare.org/logic-programming/2046019-optical-character-recognition-steps-system.
“Optical Character Recognition Steps (System) Example | Topics and Well Written Essays - 1846 Words”. https://studentshare.org/logic-programming/2046019-optical-character-recognition-steps-system.
  • Cited: 0 times

CHECK THESE SAMPLES OF Optical Character Recognition System

Human Factors in the Design of the Input Systems

In this scenario, given below are some of the important aspects which can have a significant influence on the system design: (Kirwan, et al.... SYSTEMS ANALYSIS AND DESIGN Systems Analysis and Design Author Author Affiliation Date Question No 1 Discuss why human factors should or should not be included into the design of the input systems....
4 Pages (1000 words) Assignment

Pattern Recognition Using Neural Network

Pattern Recognition or optical character recognition (OCR) is a pipelined process consisting of several stages in proper sequence.... Pattern recognition is one of the most challenging processes in today's technology that includes character recognition, handwriting identification and facial image analysis etc. ... There are various algorithm or computer processes available for pattern recognition.... One such example is Brian Sanderson's Pattern recognition (PR) Algorithm....
12 Pages (3000 words) Essay

Pattern Recognition Using Neural Network

Pattern Recognition or optical character recognition (OCR) is a pipelined process consisting of several stages in proper sequence.... Pattern recognition is one of the most challenging processes in.... ach character is represented as a combination of pixels.... Figure 3 depicts the way pixel forms one particular character.... A typical character image is 6464 pixels large and for each such pixel 256 grey values are required making feature space large....
7 Pages (1750 words) Essay

How a Document Imaging Concept Has Evolving in Modern Civilization

From the paper "How a Document Imaging Concept Has Evolving in Modern Civilization" it is clear that the document imaging concept has offered the users with reduced cost for managing large volumes of documents.... The document imaging concept also enhances the retrieval efficiency of a document.... ...
7 Pages (1750 words) Essay

Computer Studies

The paper "Computer Studies" presents that the meaning of the term Computer is a system that performs the calculation on data & processes it to give the desired output.... The starting was from SSI (Small Scale Integration), but, soon the density of the ICs in a system increased that fell under medium-scale integration (MSI).... We will understand the organization of the basic computer system step by step.... But further if we think about the speed of operation of various units, it is difficult to achieve synchronization between units in our present system containing only three components....
22 Pages (5500 words) Research Paper

Urdu Script Recognition

An Optical Character Recognition System allows us to take a magazine or book or article, feed it straightly into an electronic computer data file, moreover edit the file by means of a word processor (webopedia, 2009).... In this research, I will discuss the optical character recognition.... Here I will discuss different algorithms and techniques regarding the Urdu optical character recognition presented for the enhanced character recognition....
5 Pages (1250 words) Assignment

The Difference between the Summary Report and Exception Report

optical character recognition is the kind of technology that enable imaging and scanning systems the ability to turn images of characters that are machine-printed into characters that are machine readable.... The graphs are able to assist the users of the system to grasp data and trends relationships that are not easily decipherable in number columns.... source document is a document that contains the data that has to be entered into the system database....
6 Pages (1500 words) Math Problem

Car Number Plate Recognition

The first introduces the origins of digital image processing, also a little resume about the following algorithms that are needed for developing the system ANPR.... It is evaluated the system's speed and error rate.... This report "Car Number Plate recognition" sheds some light on the car number plate photo that was filtered and new images produced had the same numbers but color and characters, it can be stated that they are almost similar to the original image....
6 Pages (1500 words) Report
sponsored ads
We use cookies to create the best experience for you. Keep on browsing if you are OK with that, or find out how to manage cookies.
Contact Us