The definition of cloud storage has been on my mind lately, and I think some attention to this topic is still called for. From an article in CXO Today: Cloud storage is not a disk array that you own, lease, or manage neither is it a virtual logical unit number (LUN) from a larger disk array. It is in fact it is offered via an application programming interface (API) through which you can send and receive data without having to actively manage the storage.
I see many "Cloud Storage" services and vendors of cloud storage infrastructure products that do not do or provide for what is described in the preceding observation. For example, some cloud storage services are really offerings of storage that are associated with cloud computing. One major requirement is storing and retrieving cloud computing images. Since these are "bootable" the typical storage approach is an iSCSI connected storage resource. A cloud computing image may require files for its application, and these are often stored on shared storage systems, and may be accessed in a variety of ways, but not necessarily via Web services APIs.
Often, IT service providers call this shared storage; however, when it is accessed by cloud computing images, it is often referred to as cloud storage. Finally, block data, like a data base, is often required for the application running on a cloud computing image, and is accessed via iSCSI, and may be referred to as cloud storage.
So, where do these observations lead us?
There are many benefits of a storage cloud, and for the user these include ease of access (in a variety of ways) to various amounts of storage on an as needed basis, with instant or nearly instant provisioning, with little if any traditional storage management requirements for the user. IT service providers, and enterprise IT organizations are fundamentally organized around the premise of service delivery. So, for both of these entities, a service offering like cloud storage is an important business asset, and the primary differences in the deployed infrastructure is associated with multi-tenancy (which, among other things, drives different security requirements) and billing.
For many months, I have relied on the following definition of cloud storage: a persistent storage solution for objects (also called files or unstructured data) accessed via Web services APIs via a network (LAN or WAN).
Today, I would like to move forward and offer a new definition, more encompassing, and reflecting not just a purist view but attempting to capture what is truly important for an IT service provider, in house or as a focus of a business (hosters, telcos, and cloud providers):
"Cloud Storage provides whatever amount of storage you require, on an immediate basis. It is persistent. It can be accessed in a variety of ways, both in the data center where the cloud is housed, as well as via the Internet. If you obtain this from an external provider, it is purchased on a pay as you go basis. You do not manage it, you use it, and the service provider manages it."
Here is how we depict this at Mezeo:
I strongly believe that obtaining, using, and decommissioning persistent storage in a simple, easy way, available in any quantity on a pay-for-use basis, and accessible in a variety of ways, via the Internet or at the data center where your application runs, is the heart of the matter. If you get that service in house, or from an IT service provider, it should include the aforementioned characteristics. This is a very inclusive definition, and it provides for traditional access methods, as well as programmable access (Web services APIs).
Finally, here are three more points that are very important:
1) By Web Services API access, we mean API access to stored content! This is different from APIs for storage management and is specific to a way of working with stored content.
2) New applications, and retrofits, will ultimately expect "programmable" storage. This is a classic "Innovators Dilemma" scenario, I see it every day, and it is coming.
3) HTTP access (Web services API or "programmable") is not slower than other access, but it does tolerate the latency of the Internet. As a result, you will ultimately see that HTTP access of storage in a data center will be a preferred approach, because of it's "programmability" and the desired performance. This will not happen overnight, but it will happen.
A hat-tip to Stephen Foskett is in order as well. Take a look at this entertaining article in which he struggles to find an appropriate name for "cloud storage."
I see many "Cloud Storage" services and vendors of cloud storage infrastructure products that do not do or provide for what is described in the preceding observation. For example, some cloud storage services are really offerings of storage that are associated with cloud computing. One major requirement is storing and retrieving cloud computing images. Since these are "bootable" the typical storage approach is an iSCSI connected storage resource. A cloud computing image may require files for its application, and these are often stored on shared storage systems, and may be accessed in a variety of ways, but not necessarily via Web services APIs.
Often, IT service providers call this shared storage; however, when it is accessed by cloud computing images, it is often referred to as cloud storage. Finally, block data, like a data base, is often required for the application running on a cloud computing image, and is accessed via iSCSI, and may be referred to as cloud storage.
So, where do these observations lead us?
There are many benefits of a storage cloud, and for the user these include ease of access (in a variety of ways) to various amounts of storage on an as needed basis, with instant or nearly instant provisioning, with little if any traditional storage management requirements for the user. IT service providers, and enterprise IT organizations are fundamentally organized around the premise of service delivery. So, for both of these entities, a service offering like cloud storage is an important business asset, and the primary differences in the deployed infrastructure is associated with multi-tenancy (which, among other things, drives different security requirements) and billing.
For many months, I have relied on the following definition of cloud storage: a persistent storage solution for objects (also called files or unstructured data) accessed via Web services APIs via a network (LAN or WAN).
Today, I would like to move forward and offer a new definition, more encompassing, and reflecting not just a purist view but attempting to capture what is truly important for an IT service provider, in house or as a focus of a business (hosters, telcos, and cloud providers):
"Cloud Storage provides whatever amount of storage you require, on an immediate basis. It is persistent. It can be accessed in a variety of ways, both in the data center where the cloud is housed, as well as via the Internet. If you obtain this from an external provider, it is purchased on a pay as you go basis. You do not manage it, you use it, and the service provider manages it."
Here is how we depict this at Mezeo:
I strongly believe that obtaining, using, and decommissioning persistent storage in a simple, easy way, available in any quantity on a pay-for-use basis, and accessible in a variety of ways, via the Internet or at the data center where your application runs, is the heart of the matter. If you get that service in house, or from an IT service provider, it should include the aforementioned characteristics. This is a very inclusive definition, and it provides for traditional access methods, as well as programmable access (Web services APIs).
Finally, here are three more points that are very important:
1) By Web Services API access, we mean API access to stored content! This is different from APIs for storage management and is specific to a way of working with stored content.
2) New applications, and retrofits, will ultimately expect "programmable" storage. This is a classic "Innovators Dilemma" scenario, I see it every day, and it is coming.
3) HTTP access (Web services API or "programmable") is not slower than other access, but it does tolerate the latency of the Internet. As a result, you will ultimately see that HTTP access of storage in a data center will be a preferred approach, because of it's "programmability" and the desired performance. This will not happen overnight, but it will happen.
A hat-tip to Stephen Foskett is in order as well. Take a look at this entertaining article in which he struggles to find an appropriate name for "cloud storage."







