MARCONet: Revolutionising image enhancement through Text Image Super-Resolution | Nanyang Technological University | Innovation and Entrepreneurship

Synopsis

MARCONet, an artificial intelligence (AI)-driven, image enhancement technology leverages advanced neural networks to improve document digitisation, archival preservation, data extraction, medical imaging and publishing quality.

Opportunity

In today's digital landscape, the demand for crystal-clear, high-resolution text within images is relentless. Our technology, the artificial intelligence (AI)-driven MARCONet, addresses this need by elevating the quality of text in images, even when the source image quality is blurry or unknown. This capability has implications across industries, from enhancing image legibility in archival documents, enabling businesses and individuals to extract more value from their visual data.

Technology

Our Blind Text Image Super-Resolution technology, MARCONet, harnesses the power of advanced neural networks to intelligently enhance text within images. It presents a novel prior that focuses more on character structure. MARCONet learns generative structure prior via reformulating a StyleGAN, and proposes a transformer-based encoder to jointly predict the font style, character bounding boxes and the indexes in the codebook.

Figure 1: Blind Chinese Text Super-Resolution.

Figure 1: Blind Chinese Text Super-Resolution.

Applications & Advantages

Document Digitisation: Enhances the quality of scanned documents, improving the accuracy of digitised text and data.
Archival Preservation: Preserves historical manuscripts and documents by enhancing faded or deteriorated text.
Data Extraction: Improves the performance of OCR systems for data extraction from images, streamlining data entry processes.
Medical Imaging: Enhances text readability in medical images, such as X-rays and MRIs, aiding diagnosis and analysis.
Publishing and Printing: Elevates the quality of printed materials by enhancing text in low-resolution source images.

Reference

https://www.mmlab-ntu.com/project/marconet/index.html
Xiaoming Li, Wangmeng Zuo and Chen Change Loy, Learning Generative Structure Prior for Blind text Image Super-resolution, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023. https://openaccess.thecvf.com/content/CVPR2023/papers/Li_Learning_Generative_Structure_Prior_for_Blind_Text_Image_Super-Resolution_CVPR_2023_paper.pdf
Tero Karras, Samuli Laine and Timo Aila, A Style-Based Generator Architecture for Generative Adversarial Networks, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019. https://openaccess.thecvf.com/content_CVPR_2019/papers/Karras_A_Style-Based_Generator_Architecture_for_Generative_Adversarial_Networks_CVPR_2019_paper.pdf

MARCONet: Revealing the Unseen through Text Image Super-Resolution

Technology Readiness Level (TRL)

Synopsis

MARCONet, an artificial intelligence (AI)-driven, image enhancement technology leverages advanced neural networks to improve document digitisation, archival preservation, data extraction, medical imaging and publishing quality.

Opportunity

Technology

Applications & Advantages

Inventor

Prof LOY Chen Change

Technology Readiness Level (TRL)

Quick links

Get in touch

Connect with us

Technology Readiness Level (TRL)

Synopsis

MARCONet, an artificial intelligence (AI)-driven, image enhancement technology leverages advanced neural networks to improve document digitisation, archival preservation, data extraction, medical imaging and publishing quality.

Opportunity

Technology

Applications & Advantages

Inventor

Prof LOY Chen Change

Technology Readiness Level (TRL)

Related Research News

NTU Singapore-led study decodes how diet shapes health in Asia

AI assistants could double productivity

Sneakier way to induce AI models into giving wrong answers

Navigating the AI tide

Building intelligent infrastructure: Assoc Prof Peer Sathikh

Algorithms forecast future electricity demands

Methods of Invisible Watermarking Electronic Documents via the Generation and Application of Content-Agnostic Overlay and Underlay

CLAIR Your Inbox Today

NAIS – NTU AI Shortlisting System

On-Skin EMG Sensing for Smart Robotic Control and Immersive Haptics

AI-Powered Geospatial Assistant for Smart Cities

City Foundation Models to Solve Urban Challenges

UrbanLLM

RaBitQ: Quantising High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search