Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents

Introduction to Abiquo

Abiquo is a cloud computing platform that enables you to manage public clouds and private clouds. You can use your own physical infrastructure to provide infrastructure as a service. It is a hypervisor agnostic model (no vendor lock-in) and it provides a unified interface for managing:

  • VM templates

  • Compute

  • Network

  • Storage

  • Events

Abiquo datacenters

In private cloud, a datacenter consists of:

  • A set of hypervisors

    Storage servers (for optional managed storage)

  • Network File System (NFS)

  • DHCP

  • Abiquo remote services

...

Abiquo is designed to manage multiple datacenters and public cloud providers.

Hypervisors

Abiquo manages can manage multiple hypervisors and conversions between hypervisors.
Managed by For example, with the vendor-provided API:

  • VMware

  • Hyper-V

...

  • vCenter

With the Abiquo AIM agent using libvirt

  • KVM

And Abiquo also works with all the major public cloud providers including AWS, Azure, Google Cloud Platform (GCP), Oracle Cloud Infrastructure (OCI), and VCD.

Abiquo remote services

The Abiquo platform uses Remotes Services to manage the hypervisors. The Abiquo remote services found in each datacenter are:

  • Appliance manager

  • Business process manager

  • Discovery manager

  • Virtualization manager

  • Virtual system monitor

  • Remote access manager

  • Services manager

Appliance manager

Provides For the ability to upload and download of VM templates into repositories templates and to make self-service templates available to the datacenter. Administrators may use the AM API to manage templates.   

Business process manager

Provides For conversion of VM templates between disk formats supported by hypervisors and export to public cloud providers, thus eliminating vendor lock-in.

  • Conversion requests and responses are submitted via message queues

  • Requests are processed one at a time,

    because working at NFS level implies being careful of

    to avoid I/O conflicts at NFS level

  • Uses RabbitMQ as a message broker

Discovery manager

Provides hypervisor discovery of the following:

  • Hypervisor type

  • Deployed VMs

  • Physical machine capabilities and resources

Uses the API provided by the hypervisor vendor. Also manages external storage.

Virtualization manager (virtual factory remote service)

This is the common layer to unify and manage the virtualization capabilities of each hypervisor type. Manages the life cycle of the VMs and network and storage configuration for VMs. Job requests and responses are submitted via message queues (RabbitMQ) with an actor-based model using Akka.

...

The Abiquo Server manages an arbitrary number of datacenters and public cloud regions by accessing their remote services. Communications between modules and datacenters are performed via RabbitMQ. MySQL stores the system configuration and state. The platform also offers enterprise functionality: such as pricing, scheduling, security. The server exposes its data and operations via a REST API.

Architecture diagram

The architecture elements fit together as shown in the following diagram.Image Removed

...

Technology stack

The Abiquo technology stack includes the following projects.

  • Apache projects

    • Thrift 

    • Commons

  • Others

    • MySQL (MariaDB)

    • Redis 

    • RabbitMQ

    • Akka 

    • Libvirt

    • Jersey

Virtual machine operations

This section outlines the VM operations in the virtual factory. The implementation uses asynchronous communication and independent, concurrent operations. It offers improved scalability, traceability, performance, and stability.

This document assumes a good working knowledge of the Abiquo platform.

Advantages

  • Scalability

    • asynchronous VM operations

      • independent

      • concurrent

  • Traceability

    • simple, standardized VM definition for all hypervisors

    • error messages and logs

    • monitoring console

  • Performance

    • non-blocking operations

  • Stability

    • improved reconfigure

    • improved rollback

    • standardized and unified hypervisor communication

    • automated testing

Scalability and stability

...

Virtual machine and hypervisor actions

  • Hypervisor Connection

    • login

    • logout

  • Virtual Machine

    • configure

    • reconfigure

    • unconfigure

    • snapshot

    • get state

    • power on

    • power off

    • stop

    • resume

Virtual machine states diagram

The VMs and VApps can transition between states as shown in the following diagram.Image Removed

...

See VM and VApp states.Virtual appliance states and Virtual machine states table

Deployment scenario

To deploy a virtual appliance containing N virtual machines, we operate directly on each VM. We perform

  • N virtual machine configurations

  • N power-on requests

We wish to run these N configurations in parallel (as resources permit) and request a power-on of each VM after configuration. These operations are concurrent and independent.Image Removed

...

The virtual factory coordinates these tasks.

...

The key to the performance and scalability of the VM operations is asynchronous communication. Communications between the API/Server and the virtual factory are all "send and forget" requests, so the operations on a VM are independent. The platform achieves high performance and scalability because it does not wait for requests on other VMs in a virtual appliance. 

Each request to the virtual factory is a complete entity and includes the hypervisor connection to avoid reconnects. 

A request contains:

  • tasks and jobs

  • virtual machine definitions

  • hypervisor connection

...

The Server/API and virtual factory communicate using RabbitMQ which uses AMQP.

...

Each datacenter has its own IN and OUT queue. The queue name is configured with the Abiquo configuration property property named abiquo.datacenter.id for for the Remote Services (V2V/BPM and virtual factory). This property identifies the AMQP datacenter queues. It is needed for building routing keys and queue names, so that they are unique for each datacenter. Do not change this property! The value of this property is generated at installation, but you can recover the property through the API with a Get Datacenterdatacenter request.

RabbitMQ is shared by all datacenters on the Abiquo platform. RabbitMQ is a single message broker for all infrastructure.

Tasks and jobs

Tasks are performed on virtual machinesVMs. A task is a collection of jobs. 
Tasks are executed independently and concurrently. Within tasks, jobs can be dependent and sequential or concurrent.

For example, when deploying a virtual appliance. There are two independent and concurrent tasks: deploy VM 1 and deploy VM 2. Within each task there are two dependent jobs: Configure and Power on. 

...

The components of the virtual factory and their basic functions are as follows.

  • Queue manager: handles incoming queue messages

  • Task orchestrator: decides when to send a job

  • Virtualization worker: executes the VM operation

Many jobs can run concurrently with multiple instances of the virtualization worker, but the number of jobs in a specific installation is dependent on both the available resources and the configuration.

The number of concurrent sessions is limited by resources because you cannot have more open sessions than the number of cores in your hypervisor host. It can also be controlled at the platform level, and for hypervisors/providers and for VDCs using the Abiquo Configuration Properties. For example, for the platform, the abiquo.virtualfactory.openSession property controls the default number of simultaneous operations on a single hypervisor or provider, and by default it is set to 2. To configure the number of concurrent connections, see Control the number of concurrent operations.Image Removed

...

Traceability

...

There is an API query to provide the task state:
GET virtualmachine/1/tasks/

The virtual factory has a common step log, which means the same steps are logged by each hypervisor or provider. For example, all hypervisors log: start template copy and finish template copy. This information can be found in the virtual factory log and as INFO level logging in Catalina catalina.out. In the future, this information will also be accessible to the API.

...

The diagram below shows the interaction between the elements of the virtual factory in dealing with requests and responses.

  • Queue Manager: handles incoming queue messages

  • Task Orchestrator: decides when to send a job

  • Virtualization Worker: executes the VM operation

...

Queue manager

  • Receives task message from the API or Server using the RabbitMQ connection.

    • checks the message and sends the task to the task orchestrator

  • Receives task response (from task orchestrator) then acknowledges the AMQP message

    • the message is deleted from the queue

Task orchestrator

  • Receives task messages

    • saves the current task message

    • saves a map of task IDs to jobs IDs

    • if it's a sequential task, send the first job to a VW

    • it it's a concurrent task, send all the jobs to many VWs

    • if the job being sent is a task, send to itself

  • Receives job results

    • notify end of job to the outgoing AMQP queue

    • check if the task is completed, then check if it should notify itself or the Queue Manager

    • if not completed and a sequential task, send the next job

  • Receives task results

    • only intermediate task results, because task is also a job.

    • perform the same logic as a Job Result

Virtualization worker

  • Receives job messages

    • create a new instance of Hypervisor Connection and perform the VM operation

    • send back a Job Result when it's done