Releases: OrleansContrib/Orleans.SyncWork
v1.0.4
What's Changed
- Update README.md by @Kritner in #1
- Create LICENSE by @Kritner in #2
- Adds additional XML documentation to package classes/methods. by @Kritner in #3
- Create ci.yml by @Kritner in #7
- Trying out CICD w/ https://github.com/brandedoutcast/publish-nuget by @Kritner in #8
New Contributors
Full Changelog: v0.0.1...v1.0.4
v0.0.1
Yeah, this is just mostly a copy of the README.md at the time of the first release. Don't judge me.
Orleans.SyncWork
The meat and potatoes of the project. This project contains the abstraction of "Long Running CPU bound Synchronous work" in the form of an abstract base class SyncWorker; which implements an interface ISyncWorker.
When long running work is identified, you can extend the base class SyncWorker
, providing a TRequest
and TResponse
unique to the long running work. This allows you to create as many ISyncWork<TRequest, TResponse>
implementations as necessary, for all your long running CPU bound needs! (At least that is the hope.)
Basic "flow" of the SyncWork:
Start
- Poll
GetStatus
until aCompleted
orFaulted
status is received GetResult
orGetException
depending on theGetStatus
This package introduces a few "requirements" against Orleans:
-
In order to not overload Orleans, a LimitedConcurrencyLevelTaskScheduler is introduced. This task scheduler is registered (either manually or through the provided extension method) with a maximum level of concurrency for the silo being set up. This maximum concurrency MUST allow for idle threads, lest the Orleans server be overloaded. In testing, the general rule of thumb was
Environment.ProcessorCount - 2
max concurrency. The important part is that the CPU is not fully "tapped out" such that the normal Orleans asynchronous messaging can't make it through due to the blocking sync work - this will make things start timing out. -
Blocking grains are stateful, and are currently keyed on a Guid. If in a situation where multiple grains of long running work is needed, each grain should be initialized with its own unique identity.
-
Blocking grains likely CAN NOT dispatch further blocking grains. This is not yet tested under the repository, but it stands to reason that with a limited concurrency scheduler, the following scenario would lead to a deadlock:
- Grain A is long running
- Grain B is long running
- Grain A initializes and fires off Grain B
- Grain A cannot complete its work until it gets the results of Grain B
In the following scenario, if "Grain A" is "actively being worked" and it fires off a "Grain B", but "Grain A" cannot complete its work until "Grain B" finishes is, but "Grain B" cannot start its work until "Grain A" finishes its work, you've run into a situation where the limited concurrency task scheduler can never finish the work of "Grain A".
There may be a way to avoid the above scenario, but I have not yet deeply explored it.
Usage
Extend the base class to implement a long running grain (example: PasswordVerifier).
public class PasswordVerifier : SyncWorker<PasswordVerifierRequest, PasswordVerifierResponse>, IGrain
{
private readonly IPasswordVerifier _passwordVerifier;
public PasswordVerifier(
ILogger<PasswordVerifier> logger,
LimitedConcurrencyLevelTaskScheduler limitedConcurrencyLevelTaskScheduler,
IPasswordVerifier passwordVerifier) : base(logger, limitedConcurrencyLevelTaskScheduler)
{
_passwordVerifier = passwordVerifier;
}
protected override async Task<PasswordVerifierResponse> PerformWork(PasswordVerifierRequest request)
{
var verifyResult = await _passwordVerifier.VerifyPassword(request.PasswordHash, request.Password);
return new PasswordVerifierResponse()
{
IsValid = verifyResult
};
}
}
public class PasswordVerifierRequest
{
public string Password { get; set; }
public string PasswordHash { get; set; }
}
public class PasswordVerifierResponse
{
public bool IsValid { get; set; }
}
Run the grain:
var request = new PasswordVerifierRequest()
{
Password = "my super neat password that's totally secure because it's super long",
PasswordHash = "$2a$11$vBzJ4Ewx28C127AG5x3kT.QCCS8ai0l4JLX3VOX3MzHRkF4/A5twy"
}
var passwordVerifyGrain = grainFactory.GetGrain<ISyncWorker<PasswordVerifierRequest, PasswordVerifierResponse>>(Guid.NewGuid());
var result = await passwordVerifyGrain.StartWorkAndPollUntilResult(request);
The above StartWorkAndPollUntilResult
is an extension method defined in the package (SyncWorkerExtensions) that Start
s, Poll
s, and finally GetResult
or GetException
upon completed work. There would seemingly be place for improvement here as it relates to testing unexpected scenarios, configuration based polling, etc.
Orleans.SyncWork.Tests
Unit testing project for the work in Orleans.SyncWork. These tests bring up a "TestCluster" which is used for the full duration of the tests against the grains.
One of the tests in particular throws 10k grains onto the cluster at once, all of which are long running (~200ms each) on my machine - more than enough time to overload the cluster if the limited concurrency task scheduler is not working along side the SyncWork base class correctly.
TODO: still could use a few more unit tests here to if nothing else, document behavior.
Orleans.SyncWork.Demo.Api
This is a demo of the ISyncWork<TRequest, TResult>
in action. This project is being used as both a Orleans Silo, and client. In a more real world scenario, the grains and silo would be defined and stood up separately from the consumption of the grains, at least in the situations where I've used this design.
The OrleansDashboard is also brought up with the API.
Swagger UI is also made available to the API for testing out the endpoints for demo purposes.
Orleans.SyncWork.Demo.Api.Benchmark
Utilizing Benchmark DotNet, a benchmarking class was created to both test that the cluster wasn't falling over, and see what sort of timing situation we're dealing with.
Following is the benchmark used at the time of writing:
public class Benchy
{
const int TotalNumberPerBenchmark = 100;
private readonly IPasswordVerifier _passwordVerifier = new Services.PasswordVerifier();
private readonly PasswordVerifierRequest _request = new PasswordVerifierRequest()
{
Password = IPasswordVerifier.Password,
PasswordHash = IPasswordVerifier.PasswordHash
};
[Benchmark]
public void Serial()
{
for (var i = 0; i < TotalNumberPerBenchmark; i++)
{
_passwordVerifier.VerifyPassword(IPasswordVerifier.PasswordHash, IPasswordVerifier.Password);
}
}
[Benchmark]
public async Task MultipleTasks()
{
var tasks = new List<Task>();
for (var i = 0; i < TotalNumberPerBenchmark; i++)
{
tasks.Add(_passwordVerifier.VerifyPassword(IPasswordVerifier.PasswordHash, IPasswordVerifier.Password));
}
await Task.WhenAll(tasks);
}
[Benchmark]
public async Task MultipleParallelTasks()
{
var tasks = new List<Task>();
Parallel.For(0, TotalNumberPerBenchmark, i =>
{
tasks.Add(_passwordVerifier.VerifyPassword(IPasswordVerifier.PasswordHash, IPasswordVerifier.Password));
});
await Task.WhenAll(tasks);
}
[Benchmark]
public async Task OrleansTasks()
{
var siloHost = await BenchmarkingSIloHost.GetSiloHost();
var grainFactory = siloHost.Services.GetRequiredService<IGrainFactory>();
var grain = grainFactory.GetGrain<ISyncWorker<PasswordVerifierRequest, PasswordVerifierResponse>>(Guid.NewGuid());
var tasks = new List<Task>();
for (var i = 0; i < TotalNumberPerBenchmark; i++)
{
tasks.Add(grain.StartWorkAndPollUntilResult(_request));
}
await Task.WhenAll(tasks);
}
}
And here are the results:
Method | Mean | Error | StdDev |
---|---|---|---|
Serial | 12.284 s | 0.0145 s | 0.0135 s |
MultipleTasks | 12.274 s | 0.0073 s | 0.0065 s |
MultipleParallelTasks | 1.723 s | 0.0185 s | 0.0144 s |
OrleansTasks | 1.118 s | 0.0080 s | 0.0074 s |
And of course note, that in the above the Orleans tasks are limited to my local cluster. In a more real situation where you have multiple nodes to the cluster, you could theoretically expect to get better timing, though you'd probably have to deal more with network latency.