Approximate Aggregations in Trajectory Data Warehouses

The widespread diffusion of modern technologies such as low-cost sensors, wireless, ubiquitous and locationaware mobile devices, allows for collecting overwhelming amounts of data about trajectories of moving objects. Such data are usually produced at different rates, and arrive in streams in an unpredictable and unbounded way. In this paper we discuss how data warehousing technology can be used to store aggregate information about trajectories and perform OLAP operations over them. To this end, we define a data cube with spatial and temporal dimensions, discretized according to a regular grid. We investigate in depth some issues related to the computation of a holistic aggregate function, i.e, the presence, which returns the number of distinct trajectories occurring in a given spatio-temporal area. In particular, we introduce a novel way to compute an approximate, but nevertheless very accurate, presence aggregate function, which uses only a bounded amount of measures stored in the base cells of our cuboid. We also concentrate on the loading phase of our data warehouse, which has to deal with an unbounded stream of trajectory observations. We suggest how the complexity of this phase can be reduced, and we analyse the errors that this procedure induces at the level of the subaggregates stored in the base cells. These errors and the accuracy of our approximate aggregate functions are carefully evaluated by means of tests performed on synthetic trajectory datasets.

Approximate Aggregations in Trajectory Data Warehouses

F. BRAZ;ORLANDO, Salvatore;ORSINI, Renzo;RAFFAETA', Alessandra;RONCATO, Alessandro;SILVESTRI, Claudio

2007-01-01

Abstract

The widespread diffusion of modern technologies such as low-cost sensors, wireless, ubiquitous and locationaware mobile devices, allows for collecting overwhelming amounts of data about trajectories of moving objects. Such data are usually produced at different rates, and arrive in streams in an unpredictable and unbounded way. In this paper we discuss how data warehousing technology can be used to store aggregate information about trajectories and perform OLAP operations over them. To this end, we define a data cube with spatial and temporal dimensions, discretized according to a regular grid. We investigate in depth some issues related to the computation of a holistic aggregate function, i.e, the presence, which returns the number of distinct trajectories occurring in a given spatio-temporal area. In particular, we introduce a novel way to compute an approximate, but nevertheless very accurate, presence aggregate function, which uses only a bounded amount of measures stored in the base cells of our cuboid. We also concentrate on the loading phase of our data warehouse, which has to deal with an unbounded stream of trajectory observations. We suggest how the complexity of this phase can be reduced, and we analyse the errors that this procedure induces at the level of the subaggregates stored in the base cells. These errors and the accuracy of our approximate aggregate functions are carefully evaluated by means of tests performed on synthetic trajectory datasets.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2007
			
	Titolo del volume
	
				Proceedings of ICDE Workshop on Spatio-Temporal Data Mining
			
	Appare nelle tipologie:
	
				4.1 Articolo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
raffaeta.pdf non disponibili Tipologia: Documento in Post-print Licenza: Accesso chiuso-personale Dimensione 154.85 kB Formato Adobe PDF Visualizza/Apri	154.85 kB	Adobe PDF	Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/31951

Citazioni

ND

23

8

social impact