Abstracting RabbitMQ RPC with TaskCompletionSource

October 14, 2017 Gigi 13 Comments

I recently wrote about TaskCompletionSource, a little-known tool in .NET that is great for transforming arbitrary asynchrony into the Task-Based Asynchronous Pattern. That means you can hide the whole thing behind a simple and elegant async/await.

In this article, we’ll see this in practice as we implement the Remote Procedure Call (RPC) pattern in RabbitMQ. This is a fancy way of saying request/response, except that it all happens asynchronously! That’s right. No blocking.

The source code for this article is in the RabbitMqRpc folder at the Gigi Labs BitBucket Repository.

The RabbitMQ.Client NuGet package is necessary to make this code work. The client is written using an asynchronous Main() method, which requires at least C# 7.1 to compile.

RabbitMQ RPC Overview

You can think of RPC as request/response communication. We have a client asking a server to process some input and return the output in its response. However, this all happens asynchronously. The client sends the request on a request queue and forgets about it, rather than waiting for the response. Eventually, the server will (hopefully) process the request and send a response message back on a response queue.

The request and response can be matched on the client side by attaching a CorellationId to both the request and the response.

In this context, we don’t really talk about publishers and consumers, as is typical when talking about messaging frameworks. That’s because in order to make this work, both the client and the server must have both a publisher and a consumer.

Client: Main Program

For our client application, we’ll have the following main program code. We will implement an RpcClient that will hide the request/response plumbing behind a simple Task that we then await:

        static async Task Main(string[] args)
        {
            Console.Title = "RabbitMQ RPC Client";

            using (var rpcClient = new RpcClient())
            {
                Console.WriteLine("Press ENTER or Ctrl+C to exit.");

                while (true)
                {
                    string message = null;

                    Console.Write("Enter a message to send: ");
                    using (var colour = new ScopedConsoleColour(ConsoleColor.Blue))
                        message = Console.ReadLine();

                    if (string.IsNullOrWhiteSpace(message))
                        break;
                    else
                    {
                        var response = await rpcClient.SendAsync(message);

                        Console.Write("Response was: ");
                        using (var colour = new ScopedConsoleColour(ConsoleColor.Green))
                            Console.WriteLine(response);
                    }
                }
            }
        }

The program continuously asks for input, and sends that input as the request message. The server will process this message and return a response. Note that we are using the ScopedConsoleColour class from my “Scope Bound Resource Management in C#” article to colour certain sections of the output. Here is a taste of what it will look like:

While this console application only allows us to send one request at a time, the underlying approach is really powerful with APIs that can concurrently serve a multitude of clients. It is asynchronous and can scale pretty well, yet the consuming code sees none of the underlying complexity.

Client: Request Sending

The heart of this abstraction is the RpcClient class. In the constructor, we set up the typical plumbing: create a connection, channel, queues, and a consumer.

    public class RpcClient : IDisposable
    {
        private bool disposed = false;
        private IConnection connection;
        private IModel channel;
        private EventingBasicConsumer consumer;
        private ConcurrentDictionary<string,
            TaskCompletionSource<string>> pendingMessages;

        private const string requestQueueName = "requestqueue";
        private const string responseQueueName = "responsequeue";
        private const string exchangeName = ""; // default exchange

        public RpcClient()
        {
            var factory = new ConnectionFactory() { HostName = "localhost" };

            this.connection = factory.CreateConnection();
            this.channel = connection.CreateModel();

            this.channel.QueueDeclare(requestQueueName, true, false, false, null);
            this.channel.QueueDeclare(responseQueueName, true, false, false, null);

            this.consumer = new EventingBasicConsumer(this.channel);
            this.consumer.Received += Consumer_Received;
            this.channel.BasicConsume(responseQueueName, true, consumer);

            this.pendingMessages = new ConcurrentDictionary<string,
                TaskCompletionSource<string>>();
        }

        // ...
    }

A few other things to notice here:

We are keeping a dictionary that allow us to match responses with the requests that generated them, based on a CorrelationId. We have already seen this approach in “TaskCompletionSource by Example“.
This class implements IDisposable, as it has several resources that need to be cleaned up. While I don’t show the code for this for brevity’s sake, you can find it in the source code.
We are not using exchanges here, so using an empty string for the exchange name allows us to use the default exchange and publish directly to the queue.

The SendAsync() method, which we saw being used in the main program, is implemented as follows:

        public Task<string> SendAsync(string message)
        {
            var tcs = new TaskCompletionSource<string>();
            var correlationId = Guid.NewGuid().ToString();

            this.pendingMessages[correlationId] = tcs;

            this.Publish(message, correlationId);

            return tcs.Task;
        }

Here, we are generating GUID to use as a CorrelationId, and we are adding an entry in the dictionary for this request. This dictionary maps the CorrelationId to a corresponding TaskCompletionSource. When the response arrives, it will set the result on this TaskCompletionSource, which enables the underlying task to complete. We return this underlying task, and that’s what the main program awaits. The main program will not be able to continue until the response is received.

In this method, we are also calling a private Publish() method, which takes care of the details of publishing to the request queue on RabbitMQ:

        private void Publish(string message, string correlationId)
        {
            var props = this.channel.CreateBasicProperties();
            props.CorrelationId = correlationId;
            props.ReplyTo = responseQueueName;

            byte[] messageBytes = Encoding.UTF8.GetBytes(message);
            this.channel.BasicPublish(exchangeName, requestQueueName, props, messageBytes);

            using (var colour = new ScopedConsoleColour(ConsoleColor.Yellow))
                Console.WriteLine($"Sent: {message} with CorrelationId {correlationId}");
        }

While this publishing code is for the most part pretty standard, we are using two particular properties that are especially suited for the RPC pattern. The first is CorrelationId, where we store the CorrelationId we generated earlier, and which the server will copy and send back as part of the response, enabling this whole orchestration. The second is the ReplyTo property, which is used to indicate to the server on which queue it should send the response. We don’t need it for this simple example since we are always using the same response queue, but this property enables the server to dynamically route responses where they are needed.

Server

The request eventually reaches a server which has a consumer waiting on the request queue. Its Main() method is mostly plumbing that enables this consumer to work:

        private static IModel channel;

        static void Main(string[] args)
        {
            Console.Title = "RabbitMQ RPC Server";

            var factory = new ConnectionFactory() { HostName = "localhost" };

            using (var connection = factory.CreateConnection())
            {
                using (channel = connection.CreateModel())
                {
                    const string requestQueueName = "requestqueue";
                    channel.QueueDeclare(requestQueueName, true, false, false, null);

                    // consumer

                    var consumer = new EventingBasicConsumer(channel);
                    consumer.Received += Consumer_Received;
                    channel.BasicConsume(requestQueueName, true, consumer);

                    Console.WriteLine("Waiting for messages...");
                    Console.WriteLine("Press ENTER to exit.");
                    Console.WriteLine();
                    Console.ReadLine();
                }
            }
        }

When a message is received, the Consumer_Received event handler processes the message:

        private static void Consumer_Received(object sender, BasicDeliverEventArgs e)
        {
            var requestMessage = Encoding.UTF8.GetString(e.Body);
            var correlationId = e.BasicProperties.CorrelationId;
            string responseQueueName = e.BasicProperties.ReplyTo;

            Console.WriteLine($"Received: {requestMessage} with CorrelationId {correlationId}");

            var responseMessage = Reverse(requestMessage);
            Publish(responseMessage, correlationId, responseQueueName);
        }

In this example, the server’s job is to reverse whatever messages it receives. Thus, each response will contain the same message as in the corresponding request, but backwards. This reversal code is taken from this Stack Overflow answer. Although trivial to implement, this serves as a reminder that there’s no need to reinvent the wheel if somebody already implemented the same thing (and quite well, at that) before you.

        public static string Reverse(string s)
        {
            char[] charArray = s.ToCharArray();
            Array.Reverse(charArray);
            return new string(charArray);
        }

Having computed the reverse of the request message, and extracted both the CorrelationId and ReplyTo properties, these are all passed to the Publish() method which sends back the response:

        private static void Publish(string responseMessage, string correlationId,
            string responseQueueName)
        {
            byte[] responseMessageBytes = Encoding.UTF8.GetBytes(responseMessage);

            const string exchangeName = ""; // default exchange
            var responseProps = channel.CreateBasicProperties();
            responseProps.CorrelationId = correlationId;

            channel.BasicPublish(exchangeName, responseQueueName, responseProps, responseMessageBytes);

            Console.WriteLine($"Sent: {responseMessage} with CorrelationId {correlationId}");
            Console.WriteLine();
        }

The response is sent back on the queue specified in the ReplyTo property of the request message. The response is also given the same CorrelationId as the request; that way the client will know that this response is for that particular request.

Client: Response Handling

When the response arrives, the client’s own consumer event handler will run to process it:

        private void Consumer_Received(object sender, BasicDeliverEventArgs e)
        {
            var correlationId = e.BasicProperties.CorrelationId;
            var message = Encoding.UTF8.GetString(e.Body);

            using (var colour = new ScopedConsoleColour(ConsoleColor.Yellow))
                Console.WriteLine($"Received: {message} with CorrelationId {correlationId}");

            this.pendingMessages.TryRemove(correlationId, out var tcs);
            if (tcs != null)
                tcs.SetResult(message);
        }

The client extracts the CorrelationId from the response, and uses it to get the TaskCompletionSource for the corresponding request. If the TaskCompletionSource is found, then its result is set to the content of the response. This causes the underlying task to complete, and thus the caller awaiting that task will be able to resume and work with the result.

If the TaskCompletionSource is not found, then we ignore the response, and there is a reason for this:

“You may ask, why should we ignore unknown messages in the callback queue, rather than failing with an error? It’s due to a possibility of a race condition on the server side. Although unlikely, it is possible that the RPC server will die just after sending us the answer, but before sending an acknowledgment message for the request. If that happens, the restarted RPC server will process the request again. That’s why on the client we must handle the duplicate responses gracefully, and the RPC should ideally be idempotent.” — RabbitMQ RPC tutorial

Demo

If we run both the client and server, we can enter messages in the client, one by one. The client publishes each message on the request queue and waits for the response, at which point it allows the main program to continue by setting the result of that request’s TaskCompletionSource.

Summary

What we have seen in this article is the same material I had explained in “TaskCompletionSource by Example“, but with a real application to RabbitMQ.

A TaskCompletionSource has an underlying Task that can represent a pending request. By giving each request an ID, you can keep track of it as the corresponding response should carry the same ID. A mapping between request IDs and TaskCompletionSource can easily be kept in a dictionary. When a response arrives, its corresponding entry in the dictionary can be found, and the Task can be completed. Any client code awaiting this Task may then resume.

13 thoughts on “Abstracting RabbitMQ RPC with TaskCompletionSource”

Joshua Ryder says:

October 18, 2017 at 8:33 am

Hi.
I don’t really get the use case?
Why would I await a message queue answer?
Do you have any good examples of why I might do this?

The strength of message queues is fire and forget and then having some event handler to handle the event when the message returns.

If you are waiting for an answer, why not use HTTP calls?

Reply
1. Gigi says:
  
  October 18, 2017 at 9:11 am
  
  Actually, the strength of message queues is… queueing. It allows you to send a certain amount of work and let it be buffered on the way to the worker. It also guarantees ordering if used in a certain way. Fire and forget is part of what makes it asynchronous, but it does not exclude patterns such as RPC (which is request/response yet at the same time asynchronous).
  
  HTTP is also request/response, but you are buffering requests within the public-facing API itself. That’s okay if you’re doing it asynchronously. For this kind of thing, I don’t see a very big difference in the fundamental interactions, but you might find message queues more useful if, for example, the work you send must arrive in FIFO order.
  
  Reply
2. Kyle Estes says:
  
  December 28, 2022 at 6:20 pm
  
  By using queues you loosen the coupling between your different services. An HTTP call is a fairly tight coupling, you are asking for a specific service to handle your request directly. By swapping this out for a queue, you are asking for “SOME” service to handle your request and give you a response based on a contract between the services. This would allow you to completely swap your implementation of services out over night and as long as you have something in your pipeline handling that message you wont have to change anything on the calling services.
  
  Reply
David Benavides says:

October 18, 2017 at 10:08 am

Nice approach. I see a lot of value in hiding the received event handling away. This desing converts the interaction in a “future” coding style.
Some of the use cases this would cove:
1. Creating a sequence of request/replies to multple messages queue servers. This would allow to implement a clean continue-with sequence of interactions where call order matters (i.e: you get an id on the first call that needs to be decoded on a second call)
2. This helps tracking state progress. Specially when a processing instance goes thru multiple requests over its lifecycle. Given that the originating instance is in scope of the processing sequence, accesing additional metadata is clean and simple

In general, this would prevent the classic design pitfall of handling the first response on the event handler, later send a new request that cause a notification triggered on another received event handler method, wich ends up on a poor spaguetti code.

Reply
Nicolas says:

December 12, 2017 at 1:36 pm

Hi.
I did something similar before seeing your post (https://github.com/grandchamp/CommandScaler), but i’m trying to make some scenarios scaling with Docker but i can’t see any performance improvement when spawning more servers to handle the messages.
I ended up in your post looking for TaskCompletionSource and seeing if i’m doing something wrong.
Did you try to do requests in Parallel (Parallel.For) and check performance?

Reply
1. Gigi says:
  
  December 13, 2017 at 1:35 am
  
  No, because it makes no sense to use Parallel.For for I/O-bound tasks. See Common Mistakes in Asynchronous Programming with .NET.
  
  Reply
  1. Nicolas says:
    
    December 13, 2017 at 3:34 pm
    
    Maybe i wasn’t clear enough.
    Imagine a scenario where you have an API that send commands to be processed by handlers.
    Your API send a RPC to handler.
    To simulate a multi-user environment, i did use Parallel.For to spawn requests in parallel.
    
    In theory, if you have more handlers to Rabbit send messages, you’ll have more processing power and if with one handler you process in 100 requests in 10s, with 2 handlers you’ll process 100 requests in 5s.
    
    The point is even if i spawn more handlers performance don’t improve.
    
    Reply
    1. Gigi says:
      
      December 14, 2017 at 12:27 am
      
      I see. It’s hard to tell what’s the issue without seeing what your test does. It is possible, for instance, that at high levels of concurrency you’re getting a lot of contention in the load generation (Parallel.* does incur overheads) but without anything to go by, this is just one of many possible wild guesses.
      
      Reply
Dipak katve says:

February 24, 2018 at 7:45 am

hey, i found this will helpful to me, i just want to know that can i use this with multi thread application such like a order processing with queue management,
thanks

Reply
1. Gigi says:
  
  February 24, 2018 at 12:37 pm
  
  I don’t see why you couldn’t.
  
  Reply
Dariush Tasdighi says:

December 30, 2020 at 7:45 pm

Hi dear

I was searching (exactly) this solution for a long time! And suddenly I found this article! It’s perfect and useful… Thanks a lot dear Gigi… I do not understand some other questions! Your solution is perfect and in a lot of scenario we need this solution… for example in Banking (Transactions) applications.
Regards
Dariush Tasdighi

Reply
Richard Davison says:

October 11, 2021 at 3:16 am

when multiple clients are running.. why do they receive the messages of other clients?

Reply
1. Gigi says:
  
  October 11, 2021 at 8:26 pm
  
  If you have multiple clients consuming from the same queue, they will typically take it in turns to process the messages, so there’s nothing to ensure that a particular message is sent to a particular client. You’d likely have to set up some kind of routing for that. Which I think isn’t a great idea, because ideally clients should be stateless and able to take over from each other if one of them crashes.
  
  Reply

Gigi Labs