IBM is to join CERN in building a massive data grid to help scientists understand the origins of the Universe.
CERN1 and IBM today announced that IBM is joining the CERN openlab for DataGrid applications to collaborate in creating a massive data-management system built on Grid computing.
IBM's innovative storage virtualization and file management technology, will play a pivotal role in this collaboration, which aims to create a data file system far larger than exists today to help scientists at CERN understand some of the most fundamental questions about the nature of matter and the Universe.
Conceived in IBM Research as Storage Tank®2, the new technology is designed to provide scalable, high-performance and highly available management of huge amounts of data using a single file namespace regardless of where or on what operating system the data reside. IBM and CERN will work together to extend Storage Tank's capabilities so it can manage and provide access from any location worldwide to the unprecedented torrent of data - billions of gigabytes a year - that CERN's Large Hadron Collider (LHC) is expected to produce when it goes online in 2007. The LHC is the next-generation particle accelerator. It will recreate - on a tiny scale - conditions that existed shortly after the Big Bang, enabling researchers to answer outstanding questions about what the Universe is made of and the laws that govern its behaviour.
The very same CERN spirit that invented the World Wide Web in 1990 is now developing a new application for the Internet - Grid computing - that will push technology limits with its data processing requirements for the LHC. CERN openlab is a collaboration between CERN and leading industrial partners, which aims to create and implement data-intensive Grid-computing technologies that will aid the LHC scientists. Because the same issues facing CERN are becoming increasingly important to the IT industry, the CERN openlab and its innovative partners - which include Enterasys Networks, HP and Intel - are eager to explore new computing and data management solutions far beyond today's Internet-based computing.
By 2005, the CERN openlab collaboration with IBM is expected to be able to handle up to a petabyte (a million gigabytes) of data, which is equivalent to the information stored in 20 million four-draw filing cabinets full of paper, or 500 million floppy disks, or 1.5 million CD-ROMs.
"CERN has a long-standing collaborative relationship with IBM, and we are delighted that IBM is joining the CERN openlab for DataGrid applications," said Wolfgang von Rüden, Information Technology Division Leader and Head of the CERN openlab. "Together with IBM, we aim to achieve a one petabyte storage solution and integrate it with the Grid that CERN is building to handle the extreme data challenges of the LHC project."
"CERN's scientists and colleagues want to be able to get to their data wherever it may be - local or remote and regardless of which operating system on which it may reside," said Jai Menon, IBM Fellow at IBM's Almaden Research Center (San Jose, California, USA) and co-director of IBM's Storage Systems Institute joint programme between IBM Research and the company's product division. "This is the perfect environment for us to enhance Storage Tank to meet the demanding requirements of large-scale Grid computing systems."
As part of the agreement, several leading storage management experts from IBM's Almaden Research Center and Haifa Research Lab (Haifa, Israel) will work with the CERN openlab team. In addition, through its Shared University Research (SUR) programme, IBM will supply CERN with the system's initial 20 terabytes of high-performance disk storage, a cluster of six eServer xSeries systems running Linux and on-site engineering support and services by IBM Switzerland. The SUR award is valued at 1.5 million dollars for the first year.
Storage Tank employs policy-based storage management to help lower its total cost of ownership. Clustering and specialized protocols that detect network failures enable very high reliability and availability.
In this initiative, IBM is following a collaboration strategy initiated in 2001 with the European Union-sponsored European DataGrid project, which is also led by CERN.
CERN is the European Organization for Nuclear Research, the world's largest particle physics centre near Geneva, Switzerland. Technological development at CERN has given the world advances as varied as medical imaging and the World Wide Web. Founded in 1954, the laboratory was one of Europe's first joint ventures and has become a shining example of international collaboration. From the original 12 signatories of the CERN convention, membership has grown to the present 20 member states.
IBM is the world's largest information technology company, with 80 years of leadership in helping businesses innovate. Drawing on resources from across IBM and key Business Partners, IBM offers a wide range of services, solutions and technologies that enable customers, large and small, to take full advantage of the new era of e-business. IBM's Shared University Research (SUR) programme provides computing equipment and research collaborations to institutions around the world for advanced research projects in areas of mutual interest and benefit, including Grid Computing, Autonomic Computing, Life Sciences and Deep Computing. The SUR awards also connect top academic researchers with IBM Research personnel and representatives from product-development and solution-provider communities. IBM awards about 50 SUR grants per year worldwide.
Contacts at CERN
CERN openlab for DataGrid applications
CERN openlab for DataGrid applications