Humanist Discussion Group, Vol. 39, No. 201.
Department of Digital Humanities, University of Cologne
Hosted by DH-Cologne
www.dhhumanist.org
Submit to: humanist@dhhumanist.org
Date: 2025-11-03 11:35:12+00:00
From: Róbert PÉTER <robert.peter@ieas-szeged.hu>
Subject: AVOBMAT beta launched – Multilingual text mining platform at GWDG – Feedback invited
Dear All,
I would like to draw your attention to the open beta launch of AVOBMAT
<https://avobmat.hu/> (Analysis and Visualization of Bibliographic
Metadata and Text), a multilingual text and metadata mining platform
developed for DH research and teaching.
Designed in close collaboration with DH scholars, AVOBMAT supports
transparent and reproducible workflows across a wide range of textual
and bibliographic corpora. It handles large-scale datasets that are
difficult or impossible to analyse with commercial LLMs. The service runs
on an extensible, scalable, and modular cloud-based infrastructure hosted
by the Gesellschaft für wissenschaftliche Datenverarbeitung Göttingen
(GWDG). AVOBMAT is developed at the University of Szeged, Hungary.
Key Features:
- Multilingual preprocessing, analysis, and visualization in 24+
languages
- Bibliographic metadata analysis with network visualizations, gender
analysis
- Named entity recognition, disambiguation & linking
- Topic modelling, POS tagging, N-gram viewer, corpus comparison, KWIC,
lexical diversity
- Export/import of configurations and results for reproducible workflows
- Support for both public and private corpora
- Integrated Help with interface overview, workflow, configuration
settings, glossary and appendices
- Upload templates, example corpora, and corrected/enriched metadata for
ELTeC novel and DraCor drama collections are available on this GitHub
repository <https://github.com/avobmat/general>
Current content:
- 1,708 novels in 15 languages (ELTeC)
- 4,113 dramas in 12 languages (DraCor)
- Upcoming: corrected/enriched corpora including CoNSSA (Spanish
novels), ECCO TCP, EEBO TCP, and Early American Imprints TCP
Free to use during the pilot phase with GWDG (until 25 March, 2026).
Community & webinars:
GWDG will host the first AVOBMAT webinar on 12 November (15.00-16.30,
CET). For registration, please visit the event page
<https://events.gwdg.de/event/1267/>. The second seminar will take place on
8 December (15.00-16.30, CET).
Explore AVOBMAT: https://avobmat.hu
Read our intro article
<https://openhumanitiesdata.metajnl.com/articles/10.5334/johd.175> in
the Journal of Open Humanities Data.
See AVOBMAT-related research projects and publications here
<https://avobmat.hu/news/>.
We warmly welcome your feedback, as we aim to continue refining and
enhancing AVOBMAT in close dialogue with the DH community.
We would also be grateful for your help in recommending curated,
open-access text collections that we could process and make publicly
available for teaching DH. Although AVOBMAT currently supports 24
languages, we only have sample databases for 15. You can view the list
of supported languages and available features in this chart
<https://avobmat.hu/features/>. Please feel free to reply via private
message.
We’re also happy to collaborate on research and teaching projects related
to (multilingual) DH.
We would appreciate your sharing this information with interested parties.
Thank you in advance for your support and suggestions.
All the best,
Róbert
Róbert Péter, Ph.D.
associate professor
Institute of English and American Studies
Head of Digital Humanities Laboratory
University of Szeged
Bluesky: @robertpeter.bsky.social
Robert_Peter@Fedihum.org
Researchgate <https://www.researchgate.net/profile/Robert-Peter-3>,
Academia.edu <https://u-szeged.academia.edu/RobertPeter>
_______________________________________________
Unsubscribe at: http://dhhumanist.org/Restricted
List posts to: humanist@dhhumanist.org
List info and archives at at: http://dhhumanist.org
Listmember interface at: http://dhhumanist.org/Restricted/
Subscribe at: http://dhhumanist.org/membership_form.php