Syntax Highlighter

Friday, April 9, 2010

A new grid computing paradigm explained

Copper represents a new paradigm for grid computing.

Why? Copper flips some of the standard practices in grid computing on their head. I'll illustrate this by starting with a typical cluster environment.

(click for larger version)

In this environment, users log in to a cluster headnode (or login node) and submit jobs (as scripts) to a scheduler that copies those scripts to several pre-configured slave nodes (a.k.a. compute nodes).

The software environment that runs on the headnode must be carefully crafted by an administrator to meet all of the users' needs and accommodate each application that they may want to run. Note that users do not have any administrative control over the cluster in this context, so they may not freely install new software. This leads to users building software in their personal home directories, using large, statically-linked binaries and often banging their head against the wall.

Additionally, administrators must ensure that the slave nodes are kept in sync with the headnode. If a new software package is installed on the headnode, it must be installed on each of the slaves. If a configuration or library changes, then it must be changed on each of the slaves.

A Copper computing environment is fundamentally different.
(click for larger version)

In this case, users may each be given a virtual headnode on which they have complete administrative control. This means that they can freely change the configuration and install new software. They are also free to use the distribution and tools that they are familiar with (and possibly have on their workstations).

When jobs are submitted to a queue in a Copper cluster, slave nodes are created on the fly, which are exact clones of the headnode at the time. This means that the user does not need to worry about updating software on any slaves nodes, as everything will be exactly as the same as it was on their virtual headnode. If their software ran on the headnode, then it is guaranteed to run on their slave nodes. When the jobs complete, the slave nodes are automatically destroyed and their associated resources are freed.

How is this possible? Our technology allows virtual machines to clone nearly instantaneously and with almost zero overhead. Using this cloning for grid computing means that you don't have to worry about keeping environments in sync. Ever.

In fact, the above diagram is not really accurate. Scripts are not copied to the slave VMs, because they are already there. Because the slaves are guaranteed to up-to-date from the point when you actually submitted the job, they can just run a command that you specify instead of having to copy a script. That's fundamentally cool; it's like we copy the whole headnode whenever you run a job.

As an example, this will run flawless on Copper, but not on traditional clusters:
apt-get install python     # Install python.
vim myprogram.py # Edit my python program.
gcqsub python myprogram.py # Submit my python program to the queue.
In the above, I install python, edit a program and run it through the grid queue in steps that are just are natural as using my workstation.

Thursday, April 8, 2010

GridCentric Inc. Announces Copper™ Cluster Management Software

Copper™ Combines Cloud and Grid Computing Technologies to Provide Unprecedented Ease-of-Use and Flexibility

Toronto, Canada – April 8th, 2010 – GridCentric Inc., a Toronto-based software company, today announced the availability of Copper™, a cluster management system for high performance computing workloads. Copper™ combines virtualization and grid computing technologies to enable simple, efficient and flexible cluster deployment, management, and use.

The GridCentric Copper™ platform enables real-time, on-demand sharing of high performance computing resources and makes cluster administration a one-click task. With Copper™, organizations reduce IT expenditures through significantly lower installation and management costs, higher utilization of their physical infrastructure, and improved sharing of their data assets.

Operators of large clusters face enormous challenges due to system complexity – compute clusters are typically composed of hundreds or even thousands of individual computers, each requiring separate software components for hardware provisioning and management, resource allocation, job scheduling, and application support. Initial setup of a compute cluster can take several months because of these challenges.

By combining technologies from cloud and grid computing, GridCentric’s Copper™ platform provides server provisioning, resource management, job control, and high-level application support in a single integrated software package. With Copper™, the time required to setup and configure a compute cluster goes from months to days. Additionally, Copper™ has built-in support for operating hundreds of “virtual clusters” on the same set of physical resources, scaling their compute footprints on-demand in real time – the industry’s first true high-performance cloud computing platform. Copper™ gives cluster users the power of a supercomputer with the ease-of-use of a PC.

“GridCentric's approach to cluster management will enable many organizations to service users in ways that would have otherwise been impractical or impossible. Our experiences with it to date have been very positive. Copper™ is the way of the future for many organizations,” said Professor Michael Bauer of the University of Western Ontario, and Associate Director of the SHARCNET academic supercomputing consortium.

Copper™ has been in limited trial on a SHARCNET compute cluster located at York University in Toronto, Canada since late 2009, and is now open to all researchers from SHARCNET’s 17 academic member organizations.

“GridCentric’s Copper™ product represents a new class of cluster management software,” said Tim Smith, co-founder and CEO of GridCentric Inc. “Systems integrators, cluster operators, and end-users will all benefit from Copper’s unprecedented ease of installation, management, and use.”

Availability and Pricing

Copper is immediately available through authorized systems integrators and resellers at a list price of $1250.00 per compute node. Volume and Academic discounts may apply. For more information, please visit www.gridcentriclabs.com/copper or e-mail sales@gridcentriclabs.com.

About GridCentric Inc.

GridCentric Inc. is a technology-leading systems, networking, and virtualization software company. It is the mission of GridCentric to make high performance computing easy, without sacrificing performance and flexibility. GridCentric is a privately held corporation, and is funded in part by Rogers Ventures. GridCentric was recently named as one of the “Top 25 Canadian ICT Up and Comers” by the Branham Group. For more information, please visit www.gridcentriclabs.com.

About SHARCNET

SHARCNET was established in 2001. It is one of seven world-leading Compute Canada (www.computecanada.org) supercomputing consortia. SHARCNET currently serves 14 universities, 2 colleges, and one research institute across western Ontario. For more information, please visit www.sharcnet.ca.