Skip site navigation
University of Maryland Division of Research
Who We Are Capabilities Partnerships Resources News
Analytical Nuclear Magnetic Resonance (NMR) Service & Research Center Biomolecular Nuclear Magnetic Resonance (NMR) Facility Biosciences Cores: Genomics, Imaging, and Flow Cytometry BioWorkshop Brain & Behavior Institute - Advanced Genomic Technologies Core CALCE Test Services and Failure Analysis Laboratory Center For Innovative Biomedical Resources (CIBR) Clarice Smith Performing Arts Center Daikin Energy Innovation Lab DLAR Imaging Core Exposome Small Molecule Core Facility Glenn L. Martin Wind Tunnel Herschel S. Horowitz Center for Health Literacy KIT-Maryland MEG Lab Maryland Fire and Rescue Institute (MFRI) Maryland NanoCenter Maryland Neuroimaging Center Mass Spectrometry Facility Michelle Smith Collaboratory for Visual Culture Neutral Buoyancy Research Facility (NBRF) Surface Analysis Center The Laboratory for Biological Ultrastructure The University of Maryland Center for Health Equity The University of Maryland Prevention Research Center X-ray Crystallographic Center (XCC)
Africa Through Language and Area Studies (ATLAS) Anti-Black Racism Initiative Effective and Equitable Weather Forecasting in a Changing Climate with Machine Learning Encuentros: A University-Community Partnership to Mitigate the Mental Health Crisis for Latino Immigrant Youth Fostering Inclusivity through Technology (FIT) Helping Our Bodies Clear Respiratory Infections The Maryland Safe Drinking WATER Study Modeling the Evolution of Avian Influenza Viruses Music Education for All Through Personalized AI and Digital Humanities Observing Wildfires Through UAVs and Fire Imaging Technologies Programmable Design of Sustainable, All-Natural Plastic Substitutes Racial and Social Justice Research-Practice Partnership Collaborative Remediation of Methane, Water, and Heat Waste Seizing Opportunities: Social Capital, Businesses, and Communities Using Machine Learning to Measure and Improve Equity in K-12 Mathematics Classrooms Water Emergency Team
Accurate, Equitable, and Transparent Genetic Ancestry Inference Advancing Environmental Justice By Evaluating Climate-Ready Urban Street Trees In Historically Redlined Neighborhoods AFTER: A Hospital Violence Intervention Program For Youth Victims of Gunshot Injury An Innovative Intervention to Help Asian American Families Cope with Racism and Mental Health Difficulties Bridging the Gaps in Satellite Observations of Earth Systems to Support Climate Monitoring and Prediction Climate Change and Political Conflict Climate Mitigation and Land-Use Digital Equity Mapping Research and Training Program Establishing a Role for Psilocybin in Frontal Lobe Function Fetal Mammary Stem Cell Programming and Hormone Dysfunction Forecasting Acute Malnutrition for Anticipatory Action Genetic and Lifestyle Risk Factors of Accelerated Brain Aging in Severe Mental Illness How Does Statistical Learning Interact with Socioeconomic Status to Shape Literacy Development? Human Rights Politics and Policies: Lessons from Latin America Increasing Sustainability, Accessibility, and Equity in Urban Mobility with A Self-driving E-Scooter Increasing Participation of Minorities and Women In STEM Through Sports Performance Analytics Research Market Design, Energy Storage, and Interconnection to the U.S. Power Grid On-board Energy Harvesting for Long-endurance Earth Observation UAVs Promoting Youth Mental Wellbeing in Rural Honduras by Engaging Teachers as Catalysts Relating Attitudes on Democracy to Attitudes on Race and Ethnicity An Innovative Approach to Remove Emerging Organic Contaminants from the Environment Role of Mitochondria Dynamics in Opioid Addiction Towards an Early Warning System for Increased Probability of Community Infection by SARS-Cov-2 Variants Understanding the Impact of Wind on Fire Dynamics in Mass-Timber Compartment Visualizing Urban Flooding Due To Climate Change
Search
Who We Are Capabilities Partnerships Resources News
Research Announcements

Mellon Grant Funds Continuation of Persian and Arabic Digitization Project

The grant supports the development of user-friendly, open-source software to produce high-quality digital transcriptions of printed texts.

September 03, 2021

The $100k grant supports the development of user-friendly, open-source software to produce high-quality digital transcriptions of printed texts.

The Andrew W. Mellon Foundation has awarded a $100,000 grant to support the continued development of user-friendly, open-source software capable of creating digital texts from Persian and Arabic books.

Matthew Thomas Miller, assistant professor in the Roshan Institute for Persian Studies in the School of Languages, Literatures, and Cultures, leads an interdisciplinary team of researchers from Northeastern University, Aga Khan University (AKU) in London and the Maryland Institute for Technology in the Humanities at Maryland. The Mellon Foundation has been funding the team’s work since 2019.

“We are honored that The Andrew W. Mellon Foundation has again supported our efforts,” Miller said. “They have been global leaders in building open-source tools and open-access collections for the expansion in access to and digital preservation of cultural traditions across the world, and we are delighted to be a part of these efforts.”

The project, known as “OpenITI AOCP,” aims to enable the digitization of texts from the premodern Islamicate world—an enormous tradition stretching over 1,000 years. The tools being created by the project team will be free and open to use and will allow academics and the public to produce high-quality digital transcriptions of Persian and Arabic printed texts, from poetry to the Quran.

“Premodern Islamicate textual production is a massive and understudied archive that remains particularly underrepresented in the field of digital humanities,” Miller said. “This democratization of access to digital text production will change the landscape of Islamicate studies.”

Thus far, the project team—made up of computer science and humanities experts—has successfully improved the accuracy of Persian and Arabic optical character recognition (OCR) tools, which are tools that transfer printed text into machine-encoded text, and have begun experimenting on Ottoman Turkish and Urdu. They are integrating those tools into a platform called eScriptorium. They also held a training session at the University of Maryland in 2020 for OCR experts from all over the world. And they taught a Spring 2021 Global Classrooms course, “The Islamicate World 2.0: Studying Islamic Cultures through Computational Textual Analysis,” on the basics of computational textual analysis as it relates to textual data about the Islamicate world.

Next steps include finalizing the open-source software for widespread use, as well as holding additional workshops and community building activities around the new tools. This latest Mellon grant will last one year.

Earlier this year, Miller was awarded $282,905 by the National Endowment for the Humanities to support the project.

Image description: The introduction to George B. Whiting's Kitab fi al-Imtina‘ ‘an Shurb al-Muskirat, published in Beirut by American Mission Press in 1838 and housed at Harvard's Houghton Library (*98Miss168). Licensed for non-commercial use.

Original news story written by Jessica Weiss