Provisioning a VPC with CloudFormation in AWS • Mostly Cloudy • Joe McShea's Blog

This post is about a detailed examination of a CloudFormation template for provisioning a Virtual Private Cloud (VPC) in Amazon AWS. It is the first of what will be a series of posts that talk about Infrastructure as Code, revolving around trying to accomplish some particular thing. In particular, I’m moving my blog (this blog) to AWS from a shared hosting provider, and my experiences doing that is what this series will be about.

I have some specific goals in moving my blog, like my current service is a bit slower than I would like, and also less reliable (only 2 nines of availability), so I was looking to upgrade either way. Looking at the next step up in shared hosting, it would cost about twice as much as I’m currently spending (around $400/year), so I would ideally like to come up with something on AWS that isn’t much more than that, but with improved reliability and maybe speed. I have no idea starting out if I can actually meet any of these goals.

What I’m trying to build looks something like this:

Complete diagram of my VPC plus workload

And right off the bat, that looks fairly frightening to me. It’s not really, but it’s a lot to bite off at once if you’re just getting started with AWS.

In truth, however, I’m not really just starting off as I write this, in fact by the time you read this post my blog will already be running on AWS. If you happened to come to my blog sometime in August and it was down…sorry, there was a bit of a learning curve in the transition ;). But in either case, if you want to build something like this and you don’t know where to start, start small. So this post is only going to describe a template to deploy the networking backbone for a two tiered application with high availability. Basically, that’s what a VPC is. Think of it like a corporate network. But more specifically, like the portion of a corporate network at a single location, or at least a group of locations in close proximity that don’t have to reach out to the Internet to talk to one another.

And that looks like this:

That’s a wee bit better. What I’m after is the networking infrastructure for a two tier application. And for resiliency, I want to provision it across two different availability zones. Now I’ve seen some wonderful templates out there that will let me do two to four or even more availability zones, using conditional resources, and that’s great, but a thousand plus lines of JSON (possibly multiple thousands) and split across several files, and I got tired just perusing them. I’d like to start a bit simpler. There are also some other reasons I rejected these templates besides complexity, but that’s best left for another blog post (probably one of the next couple posts in this series, I think they call that foreshadowing).

So I’m going to start off with a template that builds out a two tier network in 2, and only 2, availability zones. That simplification allowed me to do this template in about 600 lines of JSON, which is quite a bit smaller than most of the similar sample templates I found. It’s actually only a bit more than 300 lines of YAML, which I guess is arguably a point in the favor of YAML, but the difference is mostly formatting and white space and white space is free and makes stuff easier to read in my opinion. Now having the smallest template isn’t exactly the most important thing, especially if you get there by leaving out important stuff, but it is pretty helpful when you’re writing a blog post that is going to attempt to describe that template in nauseating detail.

Note that nowhere in this post am I going to mention the shared hosting provider I was using, because this isn’t a hosting provider bashing blog. I was on an introductory level shared hosting plan. So when I say that neither speed nor reliability was quite as good as I would like, one option would have been to upgrade to a better plan with the same provider. When you’re on an introductory level shared hosting plan, you can’t expect 4 nines of availability or lightning fast page load times. You get what you pay for. And by using WP Super Cache, I was able to get the page load times for cached pages down to a respectable 2.5 seconds, so my hosting provider wasn’t terrible, but I was ready to upgrade.

The CloudFormation Template for my VPC

The basic format of a CloudFormation template is pretty simple and looks like this:

{
    "AWSTemplateFormatVersion": "2010-09-09",
    "Description": "VPC: public and private subnets in two availability zones.",
    "Metadata": {
    },
    "Parameters": {
    },
    "Resources": {
    },
    "Outputs": {
    }
}

{

"AWSTemplateFormatVersion": "2010-09-09",

"Description": "VPC: public and private subnets in two availability zones.",

"Metadata": {

"Parameters": {

"Resources": {

"Outputs": {

}

Now the devil’s in the details, and the details are in those 4 empty properties above, Metadata, Parameters, Resources, and Outputs. All of these sections are optional except the resources section, but you’ll actually use most of them in most templates. There are actually several other optional sections that are not used in this template. I’ll discuss them as I get to them in other blog posts in this series.

Also, now is a good time to mention that I’m a long time developer, so I’m very comfortable with JSON. And also fairly appalled by any language that treats white space as significant, so I’m not overly fond of YAML. Those are the two options for writing CloudFormation templates. And JSON appears to be an unpopular choice among CloudFormation practitioners these days, but all of my CloudFormation templates will be in JSON. If you prefer YAML, there’s no need for us to argue about the relative virtues of one over the other. The CloudFormation designer in the AWS console will seamlessly convert back and forth between the two in a split second, so use whichever syntax you like. Besides, I’ll probably just include both formats in the download to try and make everyone happy, but I’ll be showing the JSON version in this post.

VPC Parameters

I’ll start off with parameters, because it will control much of how users of the template will interact with a template and customize it to their particular use case. So well thought out parameters can be the difference between a good CloudFormation template and a bad one. Here is the parameters section of my template:

"Parameters": {
    "AvailabilityZones": {
        "Description": "List of Availability Zones to use for the subnets in the VPC. Note: The logical order is preserved and only 2 AZs are used for this deployment.",
        "Type": "List<AWS::EC2::AvailabilityZone::Name>"
    },
    "VPCCIDR": {
        "AllowedPattern": "^([0-9]{1,3}\\.){3}[0-9]{1,3}(\\/([0-9]|[1-2][0-9]|3[0-2]))?$",
        "ConstraintDescription": "CIDR block parameter must be in the form x.x.x.x/16-28",
        "Default": "172.24.0.0/16",
        "Description": "CIDR Block for the VPC",
        "Type": "String"
    },
    "PublicSubnetACIDR": {
        "AllowedPattern": "^([0-9]{1,3}\\.){3}[0-9]{1,3}(\\/([0-9]|[1-2][0-9]|3[0-2]))?$",
        "ConstraintDescription": "CIDR block parameter must be in the form x.x.x.x/16-28, and of course must fall within the VPC CIDR.",
        "Default": "172.24.42.0/24",
        "Description": "CIDR Block for the public DMZ subnet 1 located in Availability Zone 1",
        "Type": "String"
    },
    "PrivateSubnetACIDR": {
        "AllowedPattern": "^([0-9]{1,3}\\.){3}[0-9]{1,3}(\\/([0-9]|[1-2][0-9]|3[0-2]))?$",
        "ConstraintDescription": "CIDR block parameter must be in the form x.x.x.x/16-28, and of course must fall within the VPC CIDR.",
        "Default": "172.24.44.0/24",
        "Description": "CIDR block for private subnet 1 located in Availability Zone 1.",
        "Type": "String"
    },
    "PublicSubnetBCIDR": {
        "AllowedPattern": "^([0-9]{1,3}\\.){3}[0-9]{1,3}(\\/([0-9]|[1-2][0-9]|3[0-2]))?$",
        "ConstraintDescription": "CIDR block parameter must be in the form x.x.x.x/16-28, and of course must fall within the VPC CIDR.",
        "Default": "172.24.43.0/24",
        "Description": "CIDR Block for the public DMZ subnet 2 located in Availability Zone 2",
        "Type": "String"
    },
    "PrivateSubnetBCIDR": {
        "AllowedPattern": "^([0-9]{1,3}\\.){3}[0-9]{1,3}(\\/([0-9]|[1-2][0-9]|3[0-2]))?$",
        "ConstraintDescription": "CIDR block parameter must be in the form x.x.x.x/16-28, and of course must fall within the VPC CIDR.",
        "Default": "172.24.45.0/24",
        "Description": "CIDR block for private subnet 2 located in Availability Zone 2.",
        "Type": "String"
    }
},

"Parameters": {

"AvailabilityZones": {

"Description": "List of Availability Zones to use for the subnets in the VPC. Note: The logical order is preserved and only 2 AZs are used for this deployment.",

"Type": "List<AWS::EC2::AvailabilityZone::Name>"

"VPCCIDR": {