The GRelC Project: Advanced Grid-Database Management
Within the proposed tutorial, we will talk about the Grid Relational Catalog (GRelC) Project, an integrated environment for grid database management, highlighting the vision/approach, architecture, components, services and technological issues.
The key topic of the tutorial, as well as a demo, will be the GRelC Data Access and Integration Service. The GRelC DAIS is GSI/VOMS enabled web service addressing extreme performance, interoperability and security. It efficiently, securely and transparently manages databases on the grid across VO’s, with regard to emerging and consolidated grid standards and specifications as well as production grid middleware (gLite & Globus).
It provides a uniform access interface, in grid, both to access and integrate relational (MySQL, Oracle, Postgresql, IBM/DB2, SQlite) and non-relational data sources (XML DB engines such as eXist, XIndice and libxml2 based documents).
The GRelC DAIS provides:
- Basic functionalities (query submission, grid-db management, user/VO/ACL management, etc.) to access and manage grid-databases
- Efficient delivery mechanisms leveraging streaming, chunking, prefetching, etc. to retrieve data from databases in grid providing high level of performance (in terms of query response time, number of concurrent accesses, etc.)
- Advanced functionalities to transparently and securely integrate heterogeneous,
- ted and geographically spread grid data sources (through P2P connected GRelC DAIS nodes)
- Additional functionalities such as asynchronous queries
- A data Grid Portal (GRelC Portal) to ease the access, management and integration of grid-databases, as well as user/VO/ACL management, etc. This Data Grid Portal completely replaces the Command Line Interface, does not need additional configuration/installation of software, and it provides a seamless and ubitquitous way to manage Grid Databases.
The GReIC DAIS is very versatile so it can be used both at VO and site level. It can/was used in both ways depending on VO/user/database constraints and requirements. There is no single point of failure and no centralized management for this service due to the scalable P2P architecture.
After the talk, during the demo we will show how the GRelC DAIS successfully provides data access and integration service for gLite. We will demonstrate how distributed DB integration, distributed accounting and monitoring can easily be performed by using the GRelC Portal (web interface).
Today the GReIC DAIS is part of the GILDA release (EGEE t-Infrastructure) and is candidate at the EGEE Respect Program since is tightly coupled with the gLite middleware, the EGEE architecture and the EGEE Training Infrastructure.
Currently the GRelC DAIS is used as the Euro-Mediterranean Centre of Climate Change (MCCC) Data Grid framework.
The topics covered in the tutorial are:
- Grid Database Management System
- Grid Data Access Service, Grid Data Integration Service, Data Grid Portal
- Grid Middleware: gLite, Globus
- Grid Architectures and Systems
- Peer-to-Peer Systems
Organisers:
- 1S.Fiore, A.Negro, 1S.Vadacca, 1 G.Alosio
Presenters:
- S.Fiore (sandro.fiore@unile.it), A.Negro, S.Vadacca
Affiliations:
- Euro-Mediterranean Centre of Climate Change (CMCC) and University of Salento (Italy)
Short Biography
Sandro Luigi Fiore was born in Galatina (Italy) in 1976. He received a summa cum laude Laurea degree in Computer Engineering from the University of Lecce (Italy), 2001. He received a PhD degree in Informatic Engineering – “Innovative Materials and Technologies” at the ISUFI- University of Lecce, Italy, in 2004. His research activity focuses on Parallel and Distributed Computing, specifically with regard to “Advanced Data Management in a Grid environment”. He was directly involved in the EGEE, EGEE-II projects (“Enabling Grids for E-science”) and currently he is involved in the EGEE-III project and in other national (on bioinformatics, LIBI) and international (on grid storage management, INTERSTORE) projects. Since June 2006, he leads the Data Management group at the Euro-Mediterranean Centre for Climate Change (CMCC) in Lecce (Italy). He is author and co-author of about 45 papers in refereed journals/proceedings on parallel & grid computing as well as of 1 patent concerning advanced data management. He is a Member of ACM.
Salvatore Vadacca was born in Galatina (LE) in 1982. He received summa cum laude bachelor and master degrees in Computer Engineering from the University of Lecce, Italy in 2003 and 2006, respectively. His research interests include data management; distributed, peer-to-peer and grid computing; as well as web design and development and social networks. Since 2003, he has been a team member of the GRelC Project. In 2006 he joined the Euro-Mediterranean Centre for Climate Change (CMCC) in Lecce, Italy, where he works in the Data Grid group. Since April 2008 he’s a Ph.D. student in Interdisciplinary Sciences and Technologies at the ISUFI, Lecce. He is a member of ACM, IEEE, IEEE Computer Society and IEEE Communications Society.
Alessandro Negro was born in S. Pietro Vernotico (BR) in 1981. He received a summa cum laude bachelor degree in Computer Engineering from the University of Lecce, Italy in February 2004 and a summa cum laude master degree in Computer Engineering from the same University in April 2006. His research interests include GUI Development, Data Management, Distributed, Grid Computing and Web Services. He is also interested in web and pattern-oriented design. Since 2003 he is a Team Member of the GRelC Project. Since September 2008 he holds a contract position at the CMCC.
Potential attendee profile
The targeted audience are people interested in the access, management and integration of databases (relational and/or XML) in a grid environment (gLite and/or Globus based). Some examples could be: bioinformatics (molecules/protein DBs), astrophysics (astronomic DBs for Earth Science), systems administrators (for accounting and monitoring purposes), etc.
Prerequisites
- SQL for RDBMS and XPath for XML DBs.
- Usage of Grid Certificates (X509V3)
Tutorial format
- Talks and demo
- Optionally: training on Command Line Interface and GRelC Portal if a set of machines for trainees + network connection are available)
Last modified: Monday, 02-Feb-2009 14:27:36 NZDT