The next BriefingsDirect deep-dive discussion explores how one of the most costly and complex parts of 
any enterprises IT infrastructure -- storage -- is being dramatically 
improved by the accelerating adoption of 
software-defined storage (SDS). 
The ability to choose low-cost hardware, to manage across different 
types of storage, and radically simplify data storage via intelligent
 automation means a virtual rewriting of the economics of data.
But
 just as IT leaders seek to simultaneously tackle storage pain points of
 scalability, availability, agility, and cost -- software-defined storage 
is also providing significant strategic- and architectural-level 
benefits.
We're joined by two executives from 
VMware to unpack these efficiencies and examine the broad innovation behind 
the rush to exploit software-defined storage, 
Alberto Farronato, Director of Product Marketing for Cloud Infrastructure Storage and Availability at VMware, and 
Christos Karamanolis, Chief Architect and a Principal Engineer in the Storage and 
Availability Engineering Organization at VMware. The discussion is moderated by me, 
Dana Gardner, Principal Analyst at 
Interarbor Solutions.
Here are some excerpts:
Gardner:
 Software-defined storage is changing something more fundamental than just data 
and economics of data. How do you see the wider implications 
of what’s happening now that software-defined storage is becoming more common?
Farronato: Software-defined 
storage is certainly about addressing the cost issue of storage, but 
more importantly, as you said, it’s also about operations. In fact, the 
overarching goal that VMware has is to bring to storage the efficient 
operational model that we brought to compute with server virtualization.
 So we have a set of initiatives around improving storage on all levels, 
and building a parallel evolution of storage to what we did with 
compute. We're very excited about what’s coming.
Gardner: Christos, one of my favorite sayings is that "architecture is IT destiny." How you see software-defined storage at that 
architectural level? How does it change the game? 
Concept of flexibility
Karamanolis:
 The fundamental architectural principle behind software-defined storage
 is the concept of flexibility. It's the idea of being able to adapt to 
different hardware resources, whether those are 
magnetic disks, 
flash storage, or other types of non-volatile memories in the future.
How
 does the end user adapt their storage platform to the needs 
they have in terms of the capabilities of the hardware, the ratios of 
the different types of storage, the networking, the 
CPU resources, and the memory resources needed for executing and providing their service to what's ahead?
That’s
 one part of flexibility, but there is another very interesting part, 
which is a very acute problem for VMware customers today. Their 
operational complexity of provisioning storage for applications and 
virtual machines (VMs) has been one way of packaging applications. 
Today, customers 
virtualize
 environments, but also in general have to provision physical storage 
containers. They have to anticipate their uses over time and have make 
an investment up front in resources that they'll need over a long 
period of time. So they create those logical unit number (
LUN) file services, or whatever that is needed, for a period of time that spans anything from weeks to years.
Software-defined
 storage advocates a new model, where applications and VMs are 
provisioned at the time that the user needs them. The storage resources 
that they need are provisioned on-demand, exactly for what the 
application and the user needs -- nothing more or less. 
The
 idea is that you do this in a way that is really intuitive to 
the end-user, in a way that reflects the abstractions that user 
understands -- applications, the data containers that the applications 
need, and the characteristics of the application workloads.
So those two aspects of flexibility are the two fundamental aspects of any software-defined storage.
Gardner:
 As we see this increased agility, flexibility, the on-demand 
nature of virtualization now coupled with software-defined storage, how 
are organizations benefiting at a business level?
Farronato:
 There are several benefits and several outcomes of adopting 
software-defined storage. The first that I would call out is the ability
 to be much more responsive to the business needs -- and the changing 
business needs -- in the form of what your application needs faster. 
As
 Christos was saying, in the old model, you had to guess ahead of time 
what the applications will need, spend a lot of time trying to 
preconfigure and predetermine the various services levels, 
performance, availability and other things that our storage really would
 be required by your application, and so spend a lot of time setting things up,
 and then hopefully, down the line, consume it the way you thought you 
would.
Difficult change management
In
 many cases, this causes long provisioning cycles. It causes difficult 
change management after you provision the application. You find that you
 need to change things around, because either the business needs have 
changed or what you guessed was wrong. For example, customers have to 
face constant 
data migration.
With
 the policy-driven approach that Christos has just described -- with the 
ability to create these storage services on-the-fly for a policy 
approach -- you don’t have to do all that pre-provisioning and 
preconfiguring. As you create the VMs and specify the requirements, the
 system responds accordingly. When you have to change things, you 
just modify the policy and everything in the underlying infrastructure changes accordingly.
Responsiveness, in my opinion,
 is the one biggest benefit that IT will deliver to the business by 
shifting to software-defined storage. There are many others, but I want 
to focus on the most important one.
When you have to change things, you 
just modify the policy and everything in the underlying infrastructure 
will change accordingly.
Gardner: Can
 you explain what happens when software-defined storage becomes strategic at the 
applications level, perhaps with implications across the entire data lifecycle?
Karamanolis:
 One thing we already see, not only among VMware customers, but as a more 
generic trend, is that infrastructure administrators -- the guys who do 
the heavy-lifting in the 
data centers
 day in and day out, who manage much more beyond what is traditionally servers and applications -- are getting more and more 
into managing networks and data storage.
Talking
 about changing models here, what we see is that tools have to be 
developed and software-defined storage is a key technology evolution 
behind that. These are tools for those administrators to manage all 
those resources that they need to make their day-to-day jobs happen.
Here, software-defined storage is playing a key role. With technology like 
Virtual SAN,
 we make the management of storage visible for people who are not 
necessarily experts in the esoterics of a certain vendor's hardware. It 
allows more IT professionals to specify the requirements 
of their applications. 
Then, the software storage 
platform can apply those requirements on the fly to provision, 
configure, and dynamically monitor and enforce compliance for the policy
 and requirements that are specified for the applications. This is a 
major shift we see in the IT industry today, and it’s going to be 
accelerated by technologies like Virtual SAN.
Gardner: When you go to software-defined storage, you 
can get to policy level, automation, and intelligence when it comes to 
how you're executing on storage. How does software-defined storage simplify storage overall? 
Distributed platform
Karamanolis:
 That's an interesting point, because if you think about this 
superficially, we’ll now go from a single, monolithic storage entity to a
 storage platform that is distributed, controlled by software, and can 
span tens or sometimes hundreds of physical nodes and/or entities. Isn’t
 complexity harder in the latter case?
The reality is 
that whether it's because of necessity or because we've learned a
 lot over the last 10 to 15 years about how to manage and control large 
distributed systems, that there is a parallel evolution of these ideas of how
 you manage your infrastructure, including the management of storage.
The user 
has to be exposed to the consequences of the policy they choose. There 
is a cost there for every one of those services.
As
 we alluded to already, the fundamental model here is that the end user,
 the IT professional that manages this infrastructure, expresses in a 
descriptive way, what they need for their applications in terms of CPU, 
memory, networking, and, in our case, storage. 
What do
 I mean by descriptive? The IT professional does not need to understand 
all the internal details of the technologies or the hardware used at any
 point in time, and which may evolve over a period of time. 
Instead,
 they express at a high level a set of requirements -- we call 
them policies -- that capture the requirements of the application. For 
example, in the case of storage, they specify the level of availability 
that is 
required for certain applications and performance goals, and they can 
also specify things like the data protection policies for certain data 
sets.
Of
 course, for all those things, nothing comes for free. So the user 
has to be exposed to the consequences of the policy that they choose. There 
is a cost there for every one of those services.
But 
the key point is that the software platform automatically configures the
 appropriate resources, whether they're arrayed across multiple physical
 devices, arrayed across the network, or whether they get asynchronous
 data as specified in a remote location in order to comply with certain 
disaster recovery (DR) policies.
All
 those things are done by the software, without the user having to worry
 about whether the storage underneath is highly available storage, in 
which case they need to be able to create only two copies of the data, 
or whether it is of some low-end hardware for which that would require 
three or four copies of the data. All those things are determined 
automatically by the platform.
This is the new mode. 
Perhaps I'm oversimplifying some of these problems, but the idea is that
 the user should really not have to know the specific hardware configurations of
 a disk array. If the requirements can not be met, it is because these
 new technologies are not incorporated into the storage platform.
Policy driven
Farronato: Virtual SAN is a completely policy-driven product, and we call it VM-centric or 
application-centric. The whole management paradigm for storage, when you
 use Virtual SAN, is predicated around the VM and the policies that you 
create and you assign to the VMs as you create your VMs, as you scale 
your environment.
One of the great things that you can 
achieve with Virtual SAN is providing differentiated service levels to 
individual VMs from a single data store. In the past, you had to create 
individual LUNs or volumes, assign data services like replication 
or 
RAID levels to each individual volume, and then map the application to them.
With
 Virtual SAN, you're simply going to have a capacity container that 
happens to be distributed across a number of nodes in your cluster -- and 
everything that happens from that point on is just dropping your VMs into 
this container. It automatically instantiates all the data services by 
virtue of having built-in intelligence that interprets the requirements 
of the policy.
One of the great things that you can 
achieve with Virtual SAN is providing differentiated service levels to 
individual VMs from a single data store.
That makes this system extremely simple 
and intuitive to use. In fact, one of the core design objectives of 
Virtual SAN is simplicity. If you look at a short description of the 
system, the radically simple 
hypervisor-converged storage means bringing that idea of eliminating the complexity of storage to the next level.
Gardner:
 We've talked about simplicity, policy driven, automation, and 
optimization. It seems to me that those add up very quickly to a 
fit-for-purpose approach to storage, so that we are not 
under-provisioning or over-provisioning, and that can lead to 
significant cost-savings. 
So let’s translate this back to economics. Alberto, do you have any thoughts on how we lower 
total cost of ownership (TCO) through these SDS approaches of simplicity, optimization, policy driven, and intelligence?
Farronato: There are always two sides of the equation. There
 is a 
CAPEX and an 
OPEX
 component. Looking at how a product like Virtual SAN reduces CAPEX, 
there are several ways, but I can mention a couple of key components or 
drivers.
First, I'd call out the fact that it is an 
x86
 server-based storage area network (SAN). So it leverages server-side components to deliver shared 
storage. By virtue of using server-side resources right off the bat 
there are significant savings that you can achieve through lower-cost 
hardware components. So the same hard drive or solid-state drive (SSD) that you deploy on a shared 
external storage array could be on the order of 80 percent cheaper.
The other aspect that I would call out that 
reduces the overall CAPEX cost is more along the lines of this, as you said, 
consume on-demand approach or, as we put it in many other terms, grow-as-you-go. 
With a scale-out model, you can start with a small deployment and a 
small upfront investment. 
You can then progressively 
scale out as your environment grows by the much finer granularity that 
you would with a monolithic array. And as you scale, you scale both 
compute, but also 
IOPs  and that goes hand in hand with often the number of VMs that you are running out of your cluster.
System growth
 
So the system grows with the size of your 
environment, rather than requiring you to buy a lot of resources upfront 
that many times remain under-utilized for a long time.
On
 the OPEX side, when things become simpler, it means that overall 
administration productivity increases. So we expect a trend where 
individual administrators will be able to manage a greater amount of capacity, 
and to do so in conjunction with management of the virtual infrastructure 
to achieve additional benefits.
Gardner: Virtual SAN has been in general availability now for several 
months, since March 2014, after being 
announced last year at VMworld 2013. Now that it’s in place and growing in the 
market, are there any unintended benefits or unintended consequences 
from that total-cost perspective in real-world day-in, day-out 
operations?
The system grows with the size of your 
environment, rather than require you to buy a lot of resources upfront 
that many times remain under-utilized for a long time.
I'm looking for ways in which a typical 
organization is seeing software-defined storage benefiting them 
culturally and organizationally in terms of skills, labor, and that sort
 of softer metric.
Karamanolis: That’s a very 
interesting point. Our technologists sometimes tend to overlook the 
cultural shifts that technology causes in the field. In the case of 
Virtual SAN, we see a lot of, as one customer put it, being empowered to
 manage their own storage, in the vertical that we are controlling in 
their IT organization, without having to depend on the centralized 
storage organization in this company. 
What
 we really see here is a shift in paradigm about how our customers use 
Virtual SAN today to enable them to have a much faster turnaround for 
trying new applications, new workloads, and getting them from test and 
dev into production without having to be constrained by the processes 
and the timelines that are imposed by a central storage IT organization.
 
This is a major achievement, and the major tool for VMware administrators in the field, 
which we believe is going to lead the way to a much wider adoption of 
Virtual SAN and software-defined storage in general.
Gardner: How does this simplification and automation have a 
governance, risk, and compliance (GRC)  benefit?
Farronato:
 With this approach you have a more granular way to control the service 
levels that you deliver to your customers, to your internal customers, 
and a more efficient way to do it by standardizing through polices rather 
than trying to standardize service levels over a category of hardware. 
Self-service consumption
You
 can more easily keep track of what each individual application is 
receiving, whether it’s in compliance to that particular policy that you
 specified. You can also now enable self-service consumption more easily
 and effectively. 
We have, as part of our Policy-Based Management Engine, 
APIs that will allow for integration with cloud automation frameworks, such as 
vCloud Automation Center for 
OpenStack, where end users will be able to consume a predefined category of service. 
It
 will speed up the provisioning process, while at the same time, 
enabling IT to maintain that control and visibility that all the admins 
want to maintain over how the resources are consumed and allocated.
You can also now enable self-service consumption more easily
 and effectively. 
Gardner:
 I suppose there are as many on-ramps to software-defined data center as
 there are enterprises. So it's interesting that it can be done at that 
custom level, based on actual implementation, but also have a strategic 
vision or a strategic architectural direction. So, it's future-proof as 
well as supporting legacy.
How about some examples? Do 
we have either use-case scenarios or an actual organization that we can 
look to and say that they have deployed these VSAN and they have 
benefited in certain ways and they are indicative of what others should 
expect?  
Farronato: Let me give you some 
statistics and some interesting facts. We can look at some of the early 
examples where, in the last three months since the product has become 
available, we've found a significant success already in the marketplace,
 with a great start in terms of adoption from our customers. 
We
 already have more than 300 paying customers in just one quarter. That 
follows the great success of the public beta that ran through the fall 
and the early winter with several thousand customers testing and taking a
 look at the product.  
We are finding that 
virtual desktop infrastructure (VDI)
 is the most popular use case for Virtual SAN right now. There are a 
number of reasons why Virtual SAN fits this model from the scale out, as
 well as the fact that the hyper-converged storage architecture is 
particularly suitable to address the storage issues of a VDI deployment.
 
DevOps, or if you want, preproduction environments, 
loosely defined as test dev, is another area. There are disaster recovery targets in 
combination with 
vSphere Replication and 
Site Recovery Manager. And some of the more aggressive customers are also starting to deploy it in production use cases.
In the last three months since the product has become 
available, we've found a significant success already in the marketplace.
As
 I said, the 300 customers that we already have span the gamut in terms 
of size and names. We have large enterprises, banking, down to the 
smaller accounts and companies, including education or smaller SMBs.  
There
 are a couple of interesting cases that we'll be showcasing at VMworld 2014 in late-August. 
If you look at the session list, they're already available as actual use
 cases presented by our customers themselves. 
Adobe 
will be talking about their massive implementation of Virtual SAN. And 
for their our production environment, on their data analytics platform, 
there will be another interesting use case with 
TeleTech talking about how they have leveraged 
Cisco UCS to progress VDI deployments. 
VDI equation
Gardner:
 I'd like to revisit the VDI equation for a moment, because one of the 
things that’s held people up is the impact on storage, and the costs 
associated with the storage to support VDI. But if you're able to bring 
down costs by 50 percent, in some cases, using software-defined storage.
 That radically changes the VDI equation. Isn’t that the case, Christos,
 where you can now say that you can do VDI cheaper than almost any other
 approach to a virtualized desktop? 
Karamanolis:
 Absolutely, and the cost of storage is the main impediment in 
organizations to implement a VDI strategy. With Virtual SAN, as Alberto 
mentioned earlier, we provide a very compelling cost proposition, both 
in terms of the capacity of the storage, as well as the performance you 
gain out of the storage.
You get the needs, both capacity and performance of your VDI workloads 
for a fraction of the cost you would pay for with a traditional disk 
array storage.
Alberto already touched on 
the cost of the capacity, referring to the difference in prices one can 
get from server vendors and from the market, as opposed to single 
hardware being procured as part of a traditional disk array. 
I'd
 like to touch on something that is an unsung hero of Virtual SAN and of
 VDI deployment especially, and that's performance. Virtual SAN, as 
should be clear by now, is a storage platform that is strongly 
integrated with our hypervisor. Specifically, the data path 
implementation and the distributed protocols that are implemented in 
Virtual SAN are part of the 
ESXi kernel. 
That
 means that, because of that, we can actually achieve very high 
performance goals, while we minimize the CPU cycles that are consumed to
 serve those high I/Os per second. What that means, especially for VDI, 
is that we use a small slice of the CPU and memory of every single ESXi 
host to implement this distributed software-driven storage 
controller. 
It
 doesn't affect all the VMs that run on the same ESXi host, who have 
already published extensive and detailed performance evaluations, where 
we compare VDI deployments only on Virtual SAN versus using an external 
disk array. 
And even though Virtual SAN use percentage
 is cut to be 10 percent of local CPU and memory on those hosts, the 
consolidation ratio, the number of virtual desktops we run on those 
clusters, is virtually unaffected, while we get the full performance that
 is realized with an external, all-flash disk array. So this is the 
value of Virtual SAN in those environments. 
Essentially,
 you get the needs, both capacity and performance of your VDI workloads, 
for a fraction of the cost you would pay for with a traditional disk 
array storage.
Gardner: We're only a few weeks from 
VMworld 2014 in San Francisco, and I know there's going to be a lot of interest in 
mobile and in desktop infrastructure for virtualized desktops and 
applications. 
Do you think that we can make some sort 
of a determination about 2014? Maybe this is the year that we turn the 
corner on VDI, and that that is a bigger driver to some of these higher 
efficiencies. Any closing thoughts on the vision for software-defined 
data center and VDI and the timing with VMworld. Alberto?
Last barrier
Farronato:
 Certainly, one of the goals that we set ourselves for this Virtual SAN 
release, solving the VDI use case, eliminating probably the last 
barrier, and enabling a broader adoption of VDI across the enterprise, 
and we hope that will materialize. We're very excited about what the 
early findings show.
With respect to VMworld and some 
of the other things that we'll be talking about at the conference with 
respect to storage, we'll continue to explain our vision of 
software-defined storage, talk about the Virtual SAN momentum, some of 
the key initiatives that we are rolling out with our OEM partners around
 things such as Virtual SAN Ready Nodes. 
We're going 
to talk about how we will extend the concept of policy management and 
dynamic composition of storage services to external storage, with a 
technology called Virtual Volumes. 
There are many other things, and it's gearing up to be a very exciting VMworld Conference for storage-related issues.
Gardner:
 Last word to you, Christos. Do you have any thoughts about why 2014 is 
such a pivotal time in the software-defined storage evolution?
Karamanolis:
 I think that this is the year where the vision that we've been talking 
about, us and the industry at large, is going to become real in the eyes
 of some of the bigger, more conservative enterprise IT organizations. 
With
 Virtual SAN from VMware, we're going to make a very strong case at 
VMworld that this is a real enterprise-class storage system that's 
applicable across a very wide range of use cases and customers. 
With
 actual customers using the product in the field, I believe that it is 
going to be a strong evidence for the rest of the industry that 
software-defined storage is real, it is solving real world problems, and
 it is here to stay. 
Together with opening up some of 
the management APIs that Virtual SAN uses in VMware products to third 
parties through this Virtual Volumes technology that Alberto mentioned, 
we'll also be initiating an industry-wide initiative of making, 
providing, and offering software-defined storage solutions beyond just 
VMware and the early companies, mostly startups so far, that have been 
adopting this model. It’s going to become a key industry direction.
Listen to the podcast. Find it on iTunes. Read a full transcript or download a copy. Sponsor: VMware.
You may also be interested in: