The coolest rebalancing policy offered here uses 

In this framework, the actor aims to maximize expected reward while also maximizing entropy. That is, to succeed at the task while acting as randomly as possible.

bitcoin

Payment channel rebalancing: A SimPy-based Discrete Event Simulator

Rsync25

The coolest rebalancing policy offered here uses [deep reinforcement learning](https://en.wikipedia.org/wiki/Deep_reinforcement_learning) using a [Soft Actor-Critic](https://browse.arxiv.org/pdf/1801.01290.pdf) implementation:

> In this framework, the actor aims to maximize expected reward while also maximizing entropy. That is, to succeed at the task while acting as randomly as possible.