Document Analysis Systems VII [electronic resource] : 7th International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006. Proceedings / edited by Horst Bunke, A. Lawrence Spitz.Material type: TextLanguage: English Series: Lecture Notes in Computer Science: 3872Publisher: Berlin, Heidelberg : Springer Berlin Heidelberg, 2006Description: XIII, 630 p. Also available online. online resourceContent type: text Media type: computer Carrier type: online resourceISBN: 9783540321576Subject(s): Computer science | Information storage and retrieval systems | Computer simulation | Computer vision | Optical pattern recognition | Information systems | Computer Science | Pattern Recognition | Information Storage and Retrieval | Image Processing and Computer Vision | Simulation and Modeling | Computer Appl. in Administrative Data ProcessingAdditional physical formats: Printed edition:: No titleDDC classification: 006.4 LOC classification: Q337.5TK7882.P3Online resources: Click here to access online
Session 1: Digital Libraries -- Retrieval from Document Image Collections -- A Semi-automatic Adaptive OCR for Digital Libraries -- Session 2: Image Processing -- Contribution to the Discrimination of the Medieval Manuscript Texts: Application in the Palaeography -- Restoring Ink Bleed-Through Degraded Document Images Using a Recursive Unsupervised Classification Technique -- Networked Document Imaging with Normalization and Optimization -- Gray-Scale Thinning Algorithm Using Local Min/Max Operations -- Session 3: Handwriting 1 -- Automated Scoring of Handwritten Essays Based on Latent Semantic Analysis -- Aligning Transcripts to Automatically Segmented Handwritten Manuscripts -- Virtual Example Synthesis Based on PCA for Off-Line Handwritten Character Recognition -- Extraction of Handwritten Text from Carbon Copy Medical Form Images -- Session 4: Document Structure and Format -- Document Logical Structure Analysis Based on Perceptive Cycles -- A System for Converting PDF Documents into Structured XML Format -- XCDF: A Canonical and Structured Document Format -- Structural Analysis of Mathematical Formulae with Verification Based on Formula Description Grammar -- Session 5: Tables -- Notes on Contemporary Table Recognition -- Handwritten Artefact Identification Method for Table Interpretation with Little Use of Previous Knowledge -- Session 6: Handwriting 2 -- Writer Identification for Smart Meeting Room Systems -- Extraction and Analysis of Document Examiner Features from Vector Skeletons of Grapheme ‘th’ -- Segmentation of On-Line Handwritten Japanese Text Using SVM for Improving Text Recognition -- Application of Bi-gram Driven Chinese Handwritten Character Segmentation for an Address Reading System -- Session 7: Language and Script Identification -- Language Identification in Degraded and Distorted Document Images -- Bangla/English Script Identification Based on Analysis of Connected Component Profiles -- Script Identification from Indian Documents -- Finding the Best-Fit Bounding-Boxes -- Session 9: Systems and Performance Evaluation -- Towards Versatile Document Analysis Systems -- Exploratory Analysis System for Semi-structured Engineering Logs -- Ground Truth for Layout Analysis Performance Evaluation -- On Benchmarking of Invoice Analysis Systems -- Semi-automatic Ground Truth Generation for Chart Image Recognition -- Session 10: Retrieval and Segmentation -- Efficient Word Retrieval by Means of SOM Clustering and PCA -- The Effects of OCR Error on the Extraction of Private Information -- Combining Multiple Classifiers for Faster Optical Character Recognition -- Performance Comparison of Six Algorithms for Page Segmentation -- Posters -- HVS Inspired System for Script Identification in Indian Multi-script Documents -- A Shared Fragments Analysis System for Large Collections of Web Pages -- Offline Handwritten Arabic Character Segmentation with Probabilistic Model -- Automatic Keyword Extraction from Historical Document Images -- Digitizing a Million Books: Challenges for Document Analysis -- Toward File Consolidation by Document Categorization -- Finding Hidden Semantics of Text Tables -- Reconstruction of Orthogonal Polygonal Lines -- A Multiclass Classification Framework for Document Categorization -- The Restoration of Camera Documents Through Image Segmentation -- Cut Digits Classification with k-NN Multi-specialist -- The Impact of OCR Accuracy and Feature Transformation on Automatic Text Classification -- A Method for Symbol Spotting in Graphical Documents -- Groove Extraction of Phonographic Records -- Use of Affine Invariants in Locally Likely Arrangement Hashing for Camera-Based Document Image Retrieval -- Robust Chinese Character Recognition by Selection of Binary-Based and Grayscale-Based Classifier -- Segmentation-Driven Recognition Applied to Numerical Field Extraction from Handwritten Incoming Mail Documents -- Performance Evaluation of Text Detection and Tracking in Video -- Document Analysis System for Automating Workflows -- Automatic Assembling of Cadastral Maps Based on Generalized Hough Transformation -- A Few Steps Towards On-the-Fly Symbol Recognition with Relevance Feedback -- The Fuzzy-Spatial Descriptor for the Online Graphic Recognition: Overlapping Matrix Algorithm.
DAS 2006 is the Seventh International Association for Pattern Recognition WorkshoponDocumentAnalysisSystemsandwasheldinNelson,NewZealand. DAS 2006 built on the tradition of past workshops held in Kaiserslautern, G- many (1994), Malvern,PA (1996), Nagano,Japan (1998),Rio de Janeiro,Brazil (2000),Princeton,NJ(2002),andFlorence,Italy(2004).Thegoalofthismeeting was to bring together those who have designed systems, or systems components, to solve real-world problems in document analysis. Documentanalysissystemsisinherentlyaninterdisciplinary?eldencompa- ing such diverse disciplines as image processing, pattern recognition, document structure and natural language processing. DAS 2006 attempted to bring these disciplinestogetherandtoprovideinteractionsbetweensystemsdevelopers,s- pliers and end users. We received 78 papers from 19 countries. Each submission was reviewed by three reviewers. In addition to the Program Committee members, 42 other reviewers helped in this process. From those submissions and their reviews, we went through the di?cult and sometimes painful process of ranking papers for acceptance or rejection. In the end we accepted 33 papers for oral presentation and 22 for presentation at poster sessions. We, the Co-chairmen of DAS 2006, wish to express our gratitude to all of our colleagues who have reviewed the papers submitted for this conference. We are proud to have brought two distinguished speakers to Nelson for keynote addresses: Ian Witten of the University of Waikato, the father of the NewZealandDigitalLibrary,andJamesFruchterman,apioneerinmodernc- mercial optical character recognition and currently CEO of Benetech. WeoweaspecialdebtofgratitudetoMarcusLiwickioftheUniversityofBern for his tireless workat maintaining the website,managingthe ?ow ofpapers and reviewsintotheConfMansystemandassemblingtheproceedingsforpublication by Springer. He was ably assisted by Andreas Schlapbach.