Skip site navigation
University of Maryland Division of Research
Who We Are Capabilities Partnerships Resources News
Analytical Nuclear Magnetic Resonance (NMR) Service & Research Center Biomolecular Nuclear Magnetic Resonance (NMR) Facility Biosciences Cores: Genomics, Imaging, and Flow Cytometry BioWorkshop Brain & Behavior Institute - Advanced Genomic Technologies Core CALCE Test Services and Failure Analysis Laboratory Center For Innovative Biomedical Resources (CIBR) Clarice Smith Performing Arts Center Daikin Energy Innovation Lab DLAR Imaging Core Exposome Small Molecule Core Facility Glenn L. Martin Wind Tunnel Herschel S. Horowitz Center for Health Literacy KIT-Maryland MEG Lab Maryland Fire and Rescue Institute (MFRI) Maryland NanoCenter Maryland Neuroimaging Center Mass Spectrometry Facility Michelle Smith Collaboratory for Visual Culture Neutral Buoyancy Research Facility (NBRF) Surface Analysis Center The Laboratory for Biological Ultrastructure The University of Maryland Center for Health Equity The University of Maryland Prevention Research Center X-ray Crystallographic Center (XCC)
Africa Through Language and Area Studies (ATLAS) Anti-Black Racism Initiative Effective and Equitable Weather Forecasting in a Changing Climate with Machine Learning Encuentros: A University-Community Partnership to Mitigate the Mental Health Crisis for Latino Immigrant Youth Fostering Inclusivity through Technology (FIT) Helping Our Bodies Clear Respiratory Infections The Maryland Safe Drinking WATER Study Modeling the Evolution of Avian Influenza Viruses Music Education for All Through Personalized AI and Digital Humanities Observing Wildfires Through UAVs and Fire Imaging Technologies Programmable Design of Sustainable, All-Natural Plastic Substitutes Racial and Social Justice Research-Practice Partnership Collaborative Remediation of Methane, Water, and Heat Waste Seizing Opportunities: Social Capital, Businesses, and Communities Using Machine Learning to Measure and Improve Equity in K-12 Mathematics Classrooms Water Emergency Team
Accurate, Equitable, and Transparent Genetic Ancestry Inference Advancing Environmental Justice By Evaluating Climate-Ready Urban Street Trees In Historically Redlined Neighborhoods AFTER: A Hospital Violence Intervention Program For Youth Victims of Gunshot Injury An Innovative Intervention to Help Asian American Families Cope with Racism and Mental Health Difficulties Bridging the Gaps in Satellite Observations of Earth Systems to Support Climate Monitoring and Prediction Climate Change and Political Conflict Climate Mitigation and Land-Use Digital Equity Mapping Research and Training Program Establishing a Role for Psilocybin in Frontal Lobe Function Fetal Mammary Stem Cell Programming and Hormone Dysfunction Forecasting Acute Malnutrition for Anticipatory Action Genetic and Lifestyle Risk Factors of Accelerated Brain Aging in Severe Mental Illness How Does Statistical Learning Interact with Socioeconomic Status to Shape Literacy Development? Human Rights Politics and Policies: Lessons from Latin America Increasing Sustainability, Accessibility, and Equity in Urban Mobility with A Self-driving E-Scooter Increasing Participation of Minorities and Women In STEM Through Sports Performance Analytics Research Market Design, Energy Storage, and Interconnection to the U.S. Power Grid On-board Energy Harvesting for Long-endurance Earth Observation UAVs Promoting Youth Mental Wellbeing in Rural Honduras by Engaging Teachers as Catalysts Relating Attitudes on Democracy to Attitudes on Race and Ethnicity An Innovative Approach to Remove Emerging Organic Contaminants from the Environment Role of Mitochondria Dynamics in Opioid Addiction Towards an Early Warning System for Increased Probability of Community Infection by SARS-Cov-2 Variants Understanding the Impact of Wind on Fire Dynamics in Mass-Timber Compartment Visualizing Urban Flooding Due To Climate Change
Search
Who We Are Capabilities Partnerships Resources News

Mellon Grant Funds Continuation of Islamicate Text Digitization Project

A $1.75 million grant will help expand digital access to Arabic, Persian, Ottoman Turkish and Urdu manuscripts and books.

July 05, 2022

Persian text

The University of Maryland has received a $1.75 million grant from the Mellon Foundation to continue development of open-source technology to expand digital access to manuscripts and books from the premodern Islamicate world in Arabic, Persian, Ottoman Turkish and Urdu.

Matthew Thomas Miller, assistant professor in the Roshan Institute for Persian Studies in the School of Languages, Literatures, and Cultures, leads the interdisciplinary team of researchers, including David Smith from Northeastern University, Sarah Bowen Savant from Aga Khan University (AKU) in London, Taylor Berg-Kirkpatrick from the University of California, San Diego, and Raffaele Viglianti from the Maryland Institute for Technology in the Humanities at Maryland. The Mellon Foundation has been funding the project, known as “OpenITI AOCP,” since 2019.

“Over the past four years we have made incredible progress on the creation of digital infrastructure for Islamicate studies, and that is thanks in large part to the Mellon Foundation,” Miller said. “We are honored that the foundation continues to support our efforts to expand access to and digitally preserve such a rich and important cultural tradition.”

There are currently hundreds of thousands—perhaps even millions—of premodern Islamicate books and manuscripts that are not able to be accessed digitally by academics or the public, Miller said.

Thus far, the project team—made up of computer science and humanities experts—has successfully improved the accuracy of open-source Persian and Arabic optical character recognition (OCR) software, which is a system that turns physical, printed documents into machine-readable text. Under the new grant, they will use this OCR software to produce 2,500 new digitized Persian and Arabic texts, as well as expand the OCR system’s capabilities into Ottoman Turkish and Urdu.

They also aim to improve the accuracy of open-source handwritten text recognition (HTR) for Arabic-script manuscripts. A subfield of OCR technology, HTR tools are designed to read a diversity of human handwriting types with high levels of accuracy.

The team will also roll out a user-friendly redesign of its eScriptorium platform, which hosts the open-source tools. This latest Mellon grant will last three years. (Last year, Miller also received a grant from the National Endowment for the Humanities to support the project.)

Though he hopes its next phase of developments mark a major improvement for Arabic, Persian, Ottoman Turkish and Urdu texts, Miller said the goal ultimately is for the open-source tools to be used across a wide variety of languages.

“We really hope the technology will be reused by other users, especially those working in other under-resourced languages,” he said. “It’s designed to meet the needs of varied users.”

Image description: Persian ruba‘i (quatrain) calligraphy dating between circa 1610 and circa 1620. Gift in honor of Madeline Neves Clapp; Gift of Mrs. Henry White Cannon by exchange; Bequest of Louise T. Cooper; Leonard C. Hanna Jr. Fund; From the Catherine and Ralph Benkaim Collection. Learn more.

Original news story written by Jessica Weiss