Logo del repository
  1. Home
 
Opzioni

A Global Paradigm for Designing Parallel Relational Data Warehouses in Distributed Environments

Benkrid, Soumia
•
Bellatreche, Ladjel
•
CUZZOCREA, Alfredo Massimiliano
2014
  • book part

Abstract
Designing a Parallel Relational Data Warehouse (PRDW) consists of a set of tasks: (i) choosing the hardware architecture; (ii) fragmenting the data warehouse schema; (iii) allocating the generated fragments; (iv) replicating fragments in order to ensure high performance; (v) defining the strategies for load balancing and query processing. The major drawback of this life-cycle is the fact that it does not consider the inter-dependency among sub-problems related to the design of PRDW, and it makes use of heterogeneous metrics to evaluate the “quality” of the final design. In previous research efforts, we introduced an analytical cost model for parallel OLAP query processing in cluster environments. In a second experience, we have taken into account the inter-dependency existing between fragmentation and allocation. In this paper, we propose a novel methodology, called F&A&R, which further extends previous results, and defines an approach where the main PRDW design phases (i.e., fragmentation, allocation, and replication) are performed simultaneously, in a global fashion. In particular, our approach determines whether the fragmentation pattern currently generated is relevant to the allocation process or not. An original method of supporting data replication, based on fuzzy k-means clustering, is also proposed and successfully integrated within the whole design framework. Finally, we experimentally assessed the performance of F&A&R against a well-known data warehouse benchmark, with very promising results.
DOI
10.1007/978-3-662-45761-0_3
Archivio
http://hdl.handle.net/11368/2896374
info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-84917736007
http://springerlink.com/content/0302-9743/copyright/2005/
Diritti
metadata only access
Soggetti
  • Allocation

  • Analytical cost model...

  • Data warehouse

  • Design methodology

  • Distributed environme...

  • Fragmentation

  • Load balancing

  • Replication

  • Theoretical Computer ...

  • Computer Science (all...

Scopus© citazioni
2
Data di acquisizione
Jun 7, 2022
Vedi dettagli
google-scholar
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your nstitution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Realizzato con Software DSpace-CRIS - Estensione mantenuta e ottimizzata da 4Science

  • Impostazioni dei cookie
  • Informativa sulla privacy
  • Accordo con l'utente finale
  • Invia il tuo Feedback