The widespread diffusion of modern technologies such as low-cost sensors, wireless, ubiquitous and locationaware mobile devices, allows for collecting overwhelming amounts of data about trajectories of moving objects. Such data are usually produced at different rates, and arrive in streams in an unpredictable and unbounded way. In this paper we discuss how data warehousing technology can be used to store aggregate information about trajectories and perform OLAP operations over them. To this end, we define a data cube with spatial and temporal dimensions, discretized according to a regular grid. We investigate in depth some issues related to the computation of a holistic aggregate function, i.e, the presence, which returns the number of distinct trajectories occurring in a given spatio-temporal area. In particular, we introduce a novel way to compute an approximate, but nevertheless very accurate, presence aggregate function, which uses only a bounded amount of measures stored in the base cells of our cuboid. We also concentrate on the loading phase of our data warehouse, which has to deal with an unbounded stream of trajectory observations. We suggest how the complexity of this phase can be reduced, and we analyse the errors that this procedure induces at the level of the subaggregates stored in the base cells. These errors and the accuracy of our approximate aggregate functions are carefully evaluated by means of tests performed on synthetic trajectory datasets.
Approximate Aggregations in Trajectory Data Warehouses
ORLANDO, Salvatore;ORSINI, Renzo;RAFFAETA', Alessandra;RONCATO, Alessandro;SILVESTRI, Claudio
2007-01-01
Abstract
The widespread diffusion of modern technologies such as low-cost sensors, wireless, ubiquitous and locationaware mobile devices, allows for collecting overwhelming amounts of data about trajectories of moving objects. Such data are usually produced at different rates, and arrive in streams in an unpredictable and unbounded way. In this paper we discuss how data warehousing technology can be used to store aggregate information about trajectories and perform OLAP operations over them. To this end, we define a data cube with spatial and temporal dimensions, discretized according to a regular grid. We investigate in depth some issues related to the computation of a holistic aggregate function, i.e, the presence, which returns the number of distinct trajectories occurring in a given spatio-temporal area. In particular, we introduce a novel way to compute an approximate, but nevertheless very accurate, presence aggregate function, which uses only a bounded amount of measures stored in the base cells of our cuboid. We also concentrate on the loading phase of our data warehouse, which has to deal with an unbounded stream of trajectory observations. We suggest how the complexity of this phase can be reduced, and we analyse the errors that this procedure induces at the level of the subaggregates stored in the base cells. These errors and the accuracy of our approximate aggregate functions are carefully evaluated by means of tests performed on synthetic trajectory datasets.File | Dimensione | Formato | |
---|---|---|---|
raffaeta.pdf
non disponibili
Tipologia:
Documento in Post-print
Licenza:
Accesso chiuso-personale
Dimensione
154.85 kB
Formato
Adobe PDF
|
154.85 kB | Adobe PDF | Visualizza/Apri |
I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.