On-Premise Deployment
Warning
This section will discuss deploying Chef Automate HA on-premise machines or on existing VMs. The steps are as follows:
Install Chef Automate HA
Prerequisites
- All VMs or Machines are up and running.
- OS Root Volume (/) must be at least 40 GB.
- TMP space (/var/tmp) must be at least 5GB.
- Separate Hab volume (/hab) provisioned at least 100 GB for OpenSearch node
/hab
volume will be more based on the data retention policy. - A Common user has access to all machines.
- This common user should have sudo privileges.
- This common user uses the same SSH Private Key file to access all machines.
- Key-based SSH for the provisioning user for all the machines for HA-Deployment.
- We do not support passphrases for Private Key authentication.
- LoadBalancers are set up according to Chef Automate HA Architecture needs as explained in Load Balancer Configuration page.
- Network ports are opened as per Chef Automate Architecture needs as explained in Security and Firewall page.
- Make sure your Linux has the
sysctl
utility available in all Machines. - DNS is configured to redirect
chefautomate.example.com
to the Primary Load Balancer. - DNS is configured to redirect
chefinfraserver.example.com
to the Primary Load Balancer. - Certificates are created and added for the chefautomate.example.com, and
chefinfraserver.example.com
in the Load Balancers. - If DNS is not used, add the records to
/etc/hosts
in all the machines, including Bastion:
sudo sed '/127.0.0.1/a \\n<Primary_LoadBalancer_IP> chefautomate.example.com\n<Primary_LoadBalancer_IP> chefinfraserver.example.com\n' -i /etc/hosts
- If the instance is RedHat, set SElinux config
enforcing
topermissive
in all the nodes. SSH to each node, then run:
sudo sed -i 's/SELINUX=enforcing/SELINUX=permissive/g' /etc/selinux/config
Warning
- PLEASE DO NOT MODIFY THE WORKSPACE PATH; it should always be “/hab/a2_deploy_workspace”.
- We currently don’t support AD managed users in nodes. We only support local Linux users.
- If you have configured a sudo password for the user, you must create an environment variable
sudo_password
and set the password as the variable’s value. Example:export sudo_password=<password>
. And then, run all sudo commands with thesudo -E or --preserve-env
option. Example:sudo -E ./chef-automate deploy config.toml --airgap-bundle automate.aib
. This is required for thechef-automate
CLI to run the commands with sudo privileges. Please refer this for details.
Steps to run on Bastion Host Machine
Run the below commands to download the latest Automate CLI and Airgapped Bundle:
#Run commands as sudo. sudo -- sh -c " #Download Chef Automate CLI. curl https://packages.chef.io/files/current/latest/chef-automate-cli/chef-automate_linux_amd64.zip \ | gunzip - > chef-automate && chmod +x chef-automate \ | cp -f chef-automate /usr/bin/chef-automate #Download the latest Airgapped Bundle. #To download specific version bundle, example version: 4.2.59 then replace latest.aib with 4.2.59.aib curl https://packages.chef.io/airgap_bundle/current/automate/latest.aib -o automate.aib "
Note
Chef Automate bundles are available for 365 days from the release of a version. However, the milestone release bundles are available for download forever.Note
If the Airgapped Bastion machine differs, transfer the Bundle file (latest.aib
) and Chef Automate CLI binary (chef-automate
) to the Airgapped Bastion Machine using thescp
command.After transferring, in Airgapped Bastion, run the below commands:
#Run commands as sudo. sudo -- sh -c " #Move the Chef Automate CLI to `/usr/bin`. cp -f chef-automate /usr/bin/chef-automate "
Steps to generate config
Generate config using the below command:
sudo chef-automate config gen config.toml
Click here to know more about generating config
or
Generate the empty config and populate manually
sudo -- sh -c " chef-automate init-ha-config existing_infra "
Config Verify
We verify the above config using the below command :
sudo chef-automate verify -c config.toml
To know more about config verify you can check Config Verify Doc page.
Once the verification is succesfully completed, then proceed with deployment, In case of failure please fix the issue and re-run the verify command.
Steps to deploy
Continue with the deployment after generating the config:
#Run commands as sudo. sudo -- sh -c " #Print data in the config cat config.toml #Run deploy command to deploy `automate.aib` with set `config.toml` chef-automate deploy config.toml --airgap-bundle automate.aib "
Verify the deployment by checking status summary
#Run commands as sudo. sudo -- sh -c " #After Deployment is done successfully. Check the status of Chef Automate HA services chef-automate status summary "
Check if Chef Automate UI is accessible by going to (Domain used for Chef Automate) https://chefautomate.example.com. After successful deployment, proceed with following…
Sample Config
Note
- Assuming 10+1 nodes (1 bastion, 2 for automate UI, 2 for Chef-server, 3 for Postgresql, 3 for OpenSearch).
- The following config will, by default, leave the backup configuration empty.
- To provide multiline certificates use triple quotes like
""" multiline certificate contents"""
.
[architecture]
[architecture.existing_infra]
ssh_user = "ec2-user"
ssh_group_name = "ec2-user"
ssh_key_file = "~/.ssh/my-key.pem"
ssh_port = "22"
secrets_key_file = "/hab/a2_deploy_workspace/secrets.key"
secrets_store_file = "/hab/a2_deploy_workspace/secrets.json"
architecture = "existing_nodes"
workspace_path = "/hab/a2_deploy_workspace"
backup_mount = "/mnt/automate_backups"
backup_config = "file_system"
[automate]
[automate.config]
admin_password = "Progress@123"
fqdn = "chefautomate.example.com"
config_file = "configs/automate.toml"
root_ca = "-----BEGIN CERTIFICATE-----
<Certificates>
-----END CERTIFICATE-----"
instance_count = "2"
[chef_server]
[chef_server.config]
fqdn = "chefinfraserver.example.com"
lb_root_ca = "-----BEGIN CERTIFICATE-----
<Certificates>
-----END CERTIFICATE-----"
instance_count = "2"
[opensearch]
[opensearch.config]
instance_count = "3"
[postgresql]
[postgresql.config]
instance_count = "3"
[existing_infra]
[existing_infra.config]
automate_private_ips = ["192.0.0.1", "192.0.0.2"]
chef_server_private_ips = ["192.0.0.3", "192.0.0.4"]
opensearch_private_ips = ["192.0.0.5", "192.0.0.6", "192.0.0.7"]
postgresql_private_ips = ["192.0.0.8", "192.0.0.9", "192.0.0.10"]
On-Premise Setup with AWS Managed Services
Prerequisites
- Follow the Prerequisites for On-Premise deployment. Click here to know more.
- This deployment excludes the installation for Postgresql and OpenSearch as we are using the AWS Managed Services.
- Set up AWS RDS PostgreSQL 13.5-R1. Click here to know more. Open the required port in Security Groups while creating AWS RDS Postgresql.
- Set up AWS OpenSearch 1.3. Click here to know more.
- For Backup and Restore with Managed Service. Click here to know more.
- Create the Virtual Private Cloud (VPC) in AWS before starting or using default. Click here to learn more about VPC and CIDR creation.
- Get AWS credentials (
aws_access_key_id
andaws_secret_access_key
) with privileges like:AmazonS3FullAccess
andAdministratorAccess
. Click here to learn more about creating IAM Users. See the steps here to run on Bastion to download the latest Automate CLI and Airgapped Bundle.
Steps to generate config
Generate config using the below command:
sudo chef-automate config gen config.toml
Click here to know more about generating config
or
Generate the empty config and populate manually
sudo -- sh -c " chef-automate init-ha-config existing_infra "
Config Verify
We verify the above config using the below command:
sudo chef-automate verify -c config.toml
To know more about config verify you can check Config Verify Doc page.
Once the verification is succesfully completed, then proceed with deployment, In case of failure please fix the issue and re-run the verify command.
Steps to deploy
Continue with the deployment after generating the config:
#Run commands as sudo.
sudo -- sh -c "
#Verify the data in the config
cat config.toml
#Run deploy command to deploy `automate.aib` with set `config.toml`
chef-automate deploy config.toml --airgap-bundle automate.aib
"
Verify Deployment
Verify the deployment by checking status summary:
sudo -- sh -c "
#After Deployment is done successfully. Check the status of Chef Automate HA services
chef-automate status summary
"
Check if Chef Automate UI is accessible by going to (Domain used for Chef Automate) https://chefautomate.example.com.
After successful deployment, proceed with following…
- Create user and orgs, Click here to learn more about user and org creation
- Workstation setup, Click here to learn more about workstation setup
- Node bootstrapping, Click here to learn more about node bootstraping.
Sample Config to setup On-Premise Deployment with AWS Managed Services
[architecture]
[architecture.existing_infra]
ssh_user = "ec2-user"
ssh_group_name = "ec2-user"
ssh_key_file = "~/.ssh/my-key.pem"
ssh_port = "22"
secrets_key_file = "/hab/a2_deploy_workspace/secrets.key"
secrets_store_file = "/hab/a2_deploy_workspace/secrets.json"
architecture = "existing_nodes"
workspace_path = "/hab/a2_deploy_workspace"
backup_mount = "/mnt/automate_backups"
backup_config = "object_storage"
[object_storage]
[object_storage.config]
bucket_name = "fdjlfdsklfds"
access_key = "CCAI..............."
secret_key = "JVS................"
endpoint = "https://s3.amazonaws.com"
region = "us-east-2"
[automate]
[automate.config]
admin_password = "adminpassword"
fqdn = "chefautomate.example.com"
config_file = "configs/automate.toml"
root_ca = "-----BEGIN CERTIFICATE-----
-----END CERTIFICATE-----"
instance_count = "2"
[chef_server]
[chef_server.config]
fqdn = "chefinfraserver.example.com"
lb_root_ca = "-----BEGIN CERTIFICATE-----
-----END CERTIFICATE-----"
instance_count = "2"
[opensearch]
[opensearch.config]
instance_count = "0"
[postgresql]
[postgresql.config]
instance_count = "0"
[existing_infra]
[existing_infra.config]
automate_private_ips = ["192.0.0.1", "192.0.0.2"]
chef_server_private_ips = ["192.0.0.3", "192.0.0.4"]
[external]
[external.database]
type = "aws"
[external.database.postgre_sql]
instance_url = "pg.aws.com:5432"
superuser_username = "masteruser"
superuser_password = "masterpassword"
dbuser_username = "dbusername"
dbuser_password = "dbpassword"
[external.database.open_search]
opensearch_domain_name = "opensearchdomain"
opensearch_domain_url = "os.aws.com"
opensearch_username = "osuser"
opensearch_user_password = "opensearchpassowrd"
[external.database.open_search.aws]
aws_os_snapshot_role_arn = "arn:aws:acm:ap-southeast-2:112758395563:certificate/9b04-6513-4ac5-9332-2ce4e"
os_snapshot_user_access_key_id = "CCAI..............."
os_snapshot_user_access_key_secret = "JVS................"
On-Premise Setup with Self-Managed Services
Prerequisites
- Follow the Prerequisites for On-Premise deployment. Click here.
- This deployment excludes the installation for Postgresql and OpenSearch as we are using the Self Managed services. See the steps here to run on Bastion to download the latest Automate CLI and Airgapped Bundle.
Steps to generate config
Generate config using the below command:
sudo chef-automate config gen config.toml
Click here to know more about generating config
or
Generate the empty config and populate manually
sudo -- sh -c " chef-automate init-ha-config existing_infra "
Config Verify
We verify the above config using the below command :
sudo chef-automate verify -c config.toml
To know more about config verify you can check Config Verify Doc page.
Once the verification is succesfully completed, then proceed with deployment, In case of failure please fix the issue and re-run the verify command.
Steps to deploy
Continue with the deployment after generating the config:
#Run commands as sudo.
sudo -- sh -c "
#Verify the data in the config
cat config.toml
#Run deploy command to deploy `automate.aib` with set `config.toml`
chef-automate deploy config.toml --airgap-bundle automate.aib
"
Verify Deployment
Verify the deployment by checking status summary:
sudo -- sh -c "
#After Deployment is done successfully. Check the status of Chef Automate HA services
chef-automate status summary
"
Check if Chef Automate UI is accessible by going to (Domain used for Chef Automate) https://chefautomate.example.com.
After successful deployment, proceed with following…
- Create user and orgs, Click here to learn more about user and org creation
- Workstation setup, Click here to learn more about workstation setup
- Node bootstrapping, Click here to learn more about node bootstraping.
Sample Sonfig to setup On-Premise Deployment with Self Managed Services
[architecture]
[architecture.existing_infra]
ssh_user = "ec2-user"
ssh_group_name = "ec2-user"
ssh_key_file = "~/.ssh/my-key.pem"
ssh_port = "22"
secrets_key_file = "/hab/a2_deploy_workspace/secrets.key"
secrets_store_file = "/hab/a2_deploy_workspace/secrets.json"
architecture = "existing_nodes"
workspace_path = "/hab/a2_deploy_workspace"
backup_mount = "/mnt/automate_backups"
backup_config = "object_storage"
[object_storage]
[object_storage.config]
bucket_name = "example-bucket"
access_key = "JVS......."
secret_key = "VIK........"
endpoint = "https://objectstorage.example.com"
[automate]
[automate.config]
admin_password = "adminpassword"
fqdn = "chefautomate.example.com"
config_file = "configs/automate.toml"
root_ca = "-----BEGIN CERTIFICATE-----
-----END CERTIFICATE-----"
instance_count = "2"
[chef_server]
[chef_server.config]
fqdn = "chefinfraserver.example.com"
lb_root_ca = "-----BEGIN CERTIFICATE-----
-----END CERTIFICATE-----"
instance_count = "2"
[opensearch]
[opensearch.config]
instance_count = "0"
[postgresql]
[postgresql.config]
instance_count = "0"
[existing_infra]
[existing_infra.config]
automate_private_ips = ["192.0.0.1", "192.0.0.2"]
chef_server_private_ips = ["192.0.0.3", "192.0.0.4"]
[external]
[external.database]
type = "self-managed"
[external.database.postgre_sql]
instance_url = "pg.example.com:5432"
superuser_username = "superusername"
superuser_password = "superuserpassowrd"
dbuser_username = "databaseusername"
dbuser_password = "databaseuserpassword"
postgresql_root_cert = "-----BEGIN CERTIFICATE-----
-----END CERTIFICATE-----"
[external.database.open_search]
opensearch_domain_name = "opensearch-domain"
opensearch_domain_url = "opensearch.example.com:9200"
opensearch_username = "opensearchusername"
opensearch_user_password = "opensearchuserpassword;"
opensearch_root_cert = "-----BEGIN CERTIFICATE-----
-----END CERTIFICATE-----"
Add More Nodes to the OnPremises Deployment
The commands require some arguments so that it can determine which types of nodes you want to add to your HA setup from your bastion host. It needs the IP addresses of the nodes you want to add as comma-separate values with no spaces in between.
For example,
if you want to add nodes with IP 10.1.2.23 to automate, you have to run the:
chef-automate node add --automate-ips 10.1.2.23
If you want to add nodes with IP 10.1.2.23 and 10.0.1.42 to the chef-server, you have to run the:
chef-automate node add --chef-server-ips 10.1.2.23,10.0.1.42
If you want to add nodes with IP 10.1.2.23 and 10.0.1.42 to OpenSearch, you have to run the:
chef-automate node add --opensearch-ips 10.1.2.23,10.0.1.42
If you want to add nodes with IP 10.1.2.23, 10.0.1.54 and 10.0.1.42 to PostgreSQL you have to run:
chef-automate node add --postgresql-ips 10.1.2.23,10.0.1.42,10.0.1.54
You can mix and match different services to add nodes across various services.
If you want to add nodes with IP 10.1.2.23 to automate and nodes with IP 10.0.1.54 and 10.0.1.42 to PostgreSQL, you have to run:
chef-automate node add --automate-ips 10.1.2.23 --postgresql-ips 10.0.1.42,10.0.1.54
If you want to add nodes with IP 10.1.2.23 to automate, nodes with IP 10.1.0.36 and 10.0.1.233 to chef-server, and nodes with IP 10.0.1.54 and 10.0.1.42 to PostgreSQL you have to run:
chef-automate node add --automate-ips 10.1.2.23 --chef-server-ips 10.1.0.36,10.0.1.233 --postgresql-ips 10.0.1.42,10.0.1.54
Once the command executes, it will add the supplied nodes to your automate setup. The changes might take a while.
- Make sure to update your loadbalancer configuration with the IP address of the new node. For reference, check Load Balancer Configuration page
Note
- If you have patched some external config to any existing services, then apply the same on the new nodes. For example, if you have patched any external configurations like SAML or LDAP or any other done manually post-deployment in automate nodes, make sure to patch those configurations on the new automate nodes. The same must be followed for services like Chef-Server, Postgresql, and OpenSearch.
- The new node will be configured with the certificates already configured in your HA setup.
- If you had applied unique certificates per node, then the certificates of one of the nodes have been applied by default on the new nodes.
- If you want to change the certificates for the new nodes, you can manually run the
chef-automate cert-rotate [options]
command.
Warning
Remove Single Node From Cluster on OnPremises Deployment
Warning
- We do not recommend the removal of any node from the backend cluster, but replacing the node is recommended. For the replacement of a node, click here for reference.
- Removal of nodes for Postgresql or OpenSearch is at your own risk and may result in data loss. Consult your database administrator before trying to delete Postgresql or OpenSearch nodes.
- Below process can be done for
chef-server
andautomate
. - Only one node can be removed simultaneously, irrespective of node type.
The command requires some arguments to determine which types of nodes you want to remove from your HA setup from your bastion host. It needs the IP address of the node you want to remove. For example,
If you want to remove the node of automate, you have to run the:
chef-automate node remove --automate-ip "<automate-ip-address>"
If you want to remove the node of the chef-server, you have to run the:
chef-automate node remove --chef-server-ip "<chef-server-ip-address>"
If you want to remove the node of OpenSearch, you have to run the:
chef-automate node remove --opensearch-ip "<opensearch-ip-address>"
If you want to remove the node of PostgreSQL, you have to run:
chef-automate node remove --postgresql-ip "<postgresql-ip-address>"
Once the command executes, it will remove the supplied node from your HA setup. The changes might take a while.
- Make sure to remove the IP address of the deleted node from your loadbalancer configuration. For reference, check Load Balancer Configuration page
Note
- The IPs provided must be associated with a node in your HA setup.
- Automate instance count cannot be less than 1.
- Chef Server instance count cannot be less than 1.
- Open search instance count cannot be less than 3.
- Postgresql instance count cannot be less than 3.
Replace Node in Automate HA Cluster
- Add a New Node following this.
- Stop the Habitat Supervisor on the node .i.e; going to be removed, use the
systemctl stop hab-sup
command to stop the habitat supervisor. - Remove an Existing Node following this.
Uninstall Chef automate HA
Danger
To Uninstall On-Premise
To uninstall Chef Automate HA instances after unsuccessful deployment, run the below command in your bastion host.
chef-automate cleanup --onprem-deployment
Troubleshooting
Failure to Replace Nodes
Error: Upload failed: scp: /var/automate-ha: Permission denied
Resolution: Execute the below command.
cd /hab/a2_deploy_workspace/terraform for x in $(terraform state list -state=/hab/a2_deploy_workspace/terraform/terraform.tfstate | grep module); do terraform taint $x; done cd -
Run the
deploy
command again once the module’s tainted.chef-automate deploy config.toml --airgap-bundle <Path-to-the-airgap-bundle>