Accounting is a routine activity. Through repetition, the scribes of the Ebla Archives (Syria, 24th cent. BCE) have been able to record thousands of transactions. They organized and stored accounting data referred to more than thirty years of the Palace G activities. The recurring textual patterns characterizing the administrative corpus are a byproduct of this routine-based approach. The ability to see recurring patterns in the textual record is fundamental when dealing with an administrative corpus: however, this ability fails when the patterns are buried in data. In this paper, I argue that theoretical aspects of data mining are not far from theoretical and methodological tenets of the historical approach. Data mining is a useful technique for the identification of document clusters and relevant information which would otherwise remain hidden. Furthermore, textual pattern recognition is critical to address topics such as the study of society: belonging to a category of complex problems, any socio-historical investigation requires dealing with multiple interconnected variables. However, not all research topics require such an approach. I define the line beyond which digital approaches are extremely useful (if not indispensable) as 'visibility threshold’. The position of this interface is relative and subjective.

Visibility Threshold: Some Considerations on Data Mining Applied to the Study of Eblaite Society

Scarpa, Erica
2021-01-01

Abstract

Accounting is a routine activity. Through repetition, the scribes of the Ebla Archives (Syria, 24th cent. BCE) have been able to record thousands of transactions. They organized and stored accounting data referred to more than thirty years of the Palace G activities. The recurring textual patterns characterizing the administrative corpus are a byproduct of this routine-based approach. The ability to see recurring patterns in the textual record is fundamental when dealing with an administrative corpus: however, this ability fails when the patterns are buried in data. In this paper, I argue that theoretical aspects of data mining are not far from theoretical and methodological tenets of the historical approach. Data mining is a useful technique for the identification of document clusters and relevant information which would otherwise remain hidden. Furthermore, textual pattern recognition is critical to address topics such as the study of society: belonging to a category of complex problems, any socio-historical investigation requires dealing with multiple interconnected variables. However, not all research topics require such an approach. I define the line beyond which digital approaches are extremely useful (if not indispensable) as 'visibility threshold’. The position of this interface is relative and subjective.
File in questo prodotto:
File Dimensione Formato  
Scarpa2021c - Visibility Threshold - H2D 3.pdf

accesso aperto

Tipologia: Versione dell'editore
Licenza: Accesso libero (no vincoli)
Dimensione 549.54 kB
Formato Adobe PDF
549.54 kB Adobe PDF Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/3751188
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact