Welcome!

News Feed Item

Fujitsu Develops Image Restoration Technology Capable of Making A3-Sized PDFs Using A4 Scanner

Creates PDFs of double-sided A3 documents with minimal distortion and less than 20% of the manual effort

Kawasaki, Japan, Nov 13, 2012 - (JCN Newswire) - Fujitsu Laboratories Limited today announced the development of image restoration technology that can generate PDFs from multi-page A3 documents fed as a batch into an A4 scanner.

Typically, scanning both sides of a double-sided A3 document with an A4 scanner involves folding each sheet in half and manually feeding it twice into the scanner. With Fujitsu Laboratories' new technology, by simply cutting A3 sheets in half and scanning them using a scanner's automatic feeder, the original A3 layout of the scanned images can be automatically detected and composite A3 images assembled. At the same time, image correction is applied so that the boundaries between left and right halves are inconspicuous. As a result, A3 documents can be converted to PDFs using an A4 scanner with less than 20% of the effort.

Details of this technology are being presented at the International Conference on Pattern Recognition (ICPR) 2012, beginning November 12 at the Tsukuba International Congress Center, and at the December Study Group of CVIM2012, beginning December 3 at Yokohama National University.

Background

With the spread of the paperless office, more and more existing paper documents are being converted to PDFs for electronic storage. While compact desktop scanners can efficiently handle the PDF conversion of paper documents, A4-sized scanners are most frequently employed, and there has been no easy way using them to scan A3 documents. The typical approach for scanning an A3 document has been to fold the document in half and then manually feed each sheet carefully into a two-sided scanner. This requires considerable effort, particularly for multi-page documents.

Technological Issues

By cutting an A3 document in half, the automatic sheet feeder on a scanner can be used for batch-mode scanning to avoid the effort of folding and manually feeding each sheet. At the same time, this approach creates its own problems:

1. Batch-scanning a multi-page document means the left and right halves of each A3 document can easily wind up being mixed together in no particular order, making it difficult to reassemble the original A3 images.

2. When paper is being fed into the scanner, sheets may slide around or be fed in at slightly different speeds. After having scanned the left and right halves separately, this will create mismatches in text and figures at the boundary between the two when reassembling the original.

About the New Technology

Fujitsu Laboratories has developed a technology that, after cutting multi-page A3 documents in half and batch-scanning the images, they are restored to their original A3 layout. Key features of this technology are as follows.

1. Automatic estimation of image grouping to restore A3 document image

From the intermixed scanned images of the left and right halves of an A3 document, the technology will automatically estimate how images are grouped to recreate the original A3-page layout.

2. Correction of localized stretching in scanned images

This technology corrects localized stretching in scanned images, thereby enabling lines, text and diagrams to come together naturally at the boundary when joining left- and right-side scanned images of an original A3 document (Figure 3).

Results

This technology makes it possible to easily scan multi-page A3 documents with a compact A4-sized scanner. Compared to the previous approach of folding in half and scanning each A3 sheet, the manual labor involved in this method requires less than 20% of the effort and produces composite A3 documents with fewer boundary mismatches than existing methods.

Future Plans

To further accelerate image processing, Fujitsu Laboratories is aiming to equip A4-size scanners with this functionality. The company will also move forward on developing technology that generates scans the same size as the original image, even for documents larger than A3 cut into more than two pieces, simply by scanning their separate parts.

About Fujitsu Laboratories

Founded in 1968 as a wholly owned subsidiary of Fujitsu Limited, Fujitsu Laboratories Limited is one of the premier research centers in the world. With a global network of laboratories in Japan, China, the United States and Europe, the organization conducts a wide range of basic and applied research in the areas of Next-generation Services, Computer Servers, Networks, Electronic Devices and Advanced Materials. For more information, please see: http://jp.fujitsu.com/labs/en.

About Fujitsu Limited

Fujitsu is the leading Japanese information and communication technology (ICT) company offering a full range of technology products, solutions and services. Over 170,000 Fujitsu people support customers in more than 100 countries. We use our experience and the power of ICT to shape the future of society with our customers. Fujitsu Limited (TSE:6702) reported consolidated revenues of 4.5 trillion yen (US$54 billion) for the fiscal year ended March 31, 2012. For more information, please see www.fujitsu.com.



Source: Fujitsu Limited

Contact:
Fujitsu Limited
Public and Investor Relations
www.fujitsu.com/global/news/contacts/
+81-3-3215-5259

Technical Contacts

Fujitsu Laboratories Ltd.
Media Processing System Laboratories
Image Computing Lab.
E-mail: [email protected]


Copyright 2012 JCN Newswire. All rights reserved. www.japancorp.net

More Stories By JCN Newswire

Copyright 2008 JCN Newswire. All rights reserved. Republication or redistribution of JCN Newswire content is expressly prohibited without the prior written consent of JCN Newswire. JCN Newswire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

Latest Stories
Redis is not only the fastest database, but it has become the most popular among the new wave of applications running in containers. Redis speeds up just about every data interaction between your users or operational systems. In his session at 18th Cloud Expo, Dave Nielsen, Developer Relations at Redis Labs, shared the functions and data structures used to solve everyday use cases that are driving Redis' popularity.
20th Cloud Expo, taking place June 6-8, 2017, at the Javits Center in New York City, NY, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy.
Internet-of-Things discussions can end up either going down the consumer gadget rabbit hole or focused on the sort of data logging that industrial manufacturers have been doing forever. However, in fact, companies today are already using IoT data both to optimize their operational technology and to improve the experience of customer interactions in novel ways. In his session at @ThingsExpo, Gordon Haff, Red Hat Technology Evangelist, will share examples from a wide range of industries – includin...
WebRTC is the future of browser-to-browser communications, and continues to make inroads into the traditional, difficult, plug-in web communications world. The 6th WebRTC Summit continues our tradition of delivering the latest and greatest presentations within the world of WebRTC. Topics include voice calling, video chat, P2P file sharing, and use cases that have already leveraged the power and convenience of WebRTC.
Without lifecycle traceability and visibility across the tool chain, stakeholders from Planning-to-Ops have limited insight and answers to who, what, when, why and how across the DevOps lifecycle. This impacts the ability to deliver high quality software at the needed velocity to drive positive business outcomes. In his general session at @DevOpsSummit at 19th Cloud Expo, Phil Hombledal, Solution Architect at CollabNet, discussed how customers are able to achieve a level of transparency that e...
Much of the value of DevOps comes from a (renewed) focus on measurement, sharing, and continuous feedback loops. In increasingly complex DevOps workflows and environments, and especially in larger, regulated, or more crystallized organizations, these core concepts become even more critical. In his session at @DevOpsSummit at 18th Cloud Expo, Andi Mann, Chief Technology Advocate at Splunk, showed how, by focusing on 'metrics that matter,' you can provide objective, transparent, and meaningful f...
"We build IoT infrastructure products - when you have to integrate different devices, different systems and cloud you have to build an application to do that but we eliminate the need to build an application. Our products can integrate any device, any system, any cloud regardless of protocol," explained Peter Jung, Chief Product Officer at Pulzze Systems, in this SYS-CON.tv interview at @ThingsExpo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
"We are the public cloud providers. We are currently providing 50% of the resources they need for doing e-commerce business in China and we are hosting about 60% of mobile gaming in China," explained Yi Zheng, CPO and VP of Engineering at CDS Global Cloud, in this SYS-CON.tv interview at 19th Cloud Expo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at 20th Cloud Expo, Ed Featherston, director/senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
Between 2005 and 2020, data volumes will grow by a factor of 300 – enough data to stack CDs from the earth to the moon 162 times. This has come to be known as the ‘big data’ phenomenon. Unfortunately, traditional approaches to handling, storing and analyzing data aren’t adequate at this scale: they’re too costly, slow and physically cumbersome to keep up. Fortunately, in response a new breed of technology has emerged that is cheaper, faster and more scalable. Yet, in meeting these new needs they...
"Once customers get a year into their IoT deployments, they start to realize that they may have been shortsighted in the ways they built out their deployment and the key thing I see a lot of people looking at is - how can I take equipment data, pull it back in an IoT solution and show it in a dashboard," stated Dave McCarthy, Director of Products at Bsquare Corporation, in this SYS-CON.tv interview at @ThingsExpo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
@DevOpsSummit taking place June 6-8, 2017 at Javits Center, New York City, is co-located with the 20th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. @DevOpsSummit at Cloud Expo New York Call for Papers is now open.
IoT is rapidly changing the way enterprises are using data to improve business decision-making. In order to derive business value, organizations must unlock insights from the data gathered and then act on these. In their session at @ThingsExpo, Eric Hoffman, Vice President at EastBanc Technologies, and Peter Shashkin, Head of Development Department at EastBanc Technologies, discussed how one organization leveraged IoT, cloud technology and data analysis to improve customer experiences and effici...
"We are an all-flash array storage provider but our focus has been on VM-aware storage specifically for virtualized applications," stated Dhiraj Sehgal of Tintri in this SYS-CON.tv interview at 19th Cloud Expo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
Fact is, enterprises have significant legacy voice infrastructure that’s costly to replace with pure IP solutions. How can we bring this analog infrastructure into our shiny new cloud applications? There are proven methods to bind both legacy voice applications and traditional PSTN audio into cloud-based applications and services at a carrier scale. Some of the most successful implementations leverage WebRTC, WebSockets, SIP and other open source technologies. In his session at @ThingsExpo, Da...