Small Codes is an open digital infrastructure designed to support the preservation and revitalization of minority languages through scalable, interoperable and user-friendly tools. The platform combines linguistic data management with web-based technologies, offering an integrated suite of software modules-including online dictionaries, spell-checkers, corpus alignment systems, linguistic maps, and multimedia archives-tailored for under-resourced and dialectally fragmented languages. Unlike standard language technology pipelines designed for dominant languages, Small Codes supports linguistically diverse input and community-led data models. It operates through a federated, semi-industrial development model, balancing long-term sustainability with flexibility for academic and institutional partners. This paper outlines the system architecture and core functionalities of Small Codes, presents selected implementation scenarios, and discusses its contribution to digital heritage and computational dialectology.

Small Codes: a platform for digital resources and tools for minority languages and dialects

Greta Mazzaggio;
2025-01-01

Abstract

Small Codes is an open digital infrastructure designed to support the preservation and revitalization of minority languages through scalable, interoperable and user-friendly tools. The platform combines linguistic data management with web-based technologies, offering an integrated suite of software modules-including online dictionaries, spell-checkers, corpus alignment systems, linguistic maps, and multimedia archives-tailored for under-resourced and dialectally fragmented languages. Unlike standard language technology pipelines designed for dominant languages, Small Codes supports linguistically diverse input and community-led data models. It operates through a federated, semi-industrial development model, balancing long-term sustainability with flexibility for academic and institutional partners. This paper outlines the system architecture and core functionalities of Small Codes, presents selected implementation scenarios, and discusses its contribution to digital heritage and computational dialectology.
2025
978-3-03868-277-6
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11387/205385
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact