Combinatorial Multi-Armed Bandit

A method for selecting a subset of best items w.r.t. a black-box reward function: Given 𝑁 items, the algorithm searches for an optimal 𝑠𝑢𝑏𝑠𝑒𝑡 of a maximum number of 𝑘 items, which maximizes a blackbox reward function 𝑓(𝑠𝑢𝑏𝑠𝑒𝑡).

The algorithm generates new subset candidates based on the novelty of the items while also considering what is the ratio of presence of an item that exist among the top previously tried subsets (by sorting them wrt rewards achieved).

The algorithm is not yet perfect but just takes few thousand trials to find 4 out of 5 best items from a set of 100 items where one would need to make up to few millions of trials with random search to achieve the same local maximum.

Example Use-case: Select best combination of k stocks out of all US stocks for training an asset allocation model.

Name	Name	Last commit message	Last commit date
Latest commit kayuksel Update README.md Jun 7, 2019 6222821 · Jun 7, 2019 History 38 Commits
LICENSE	LICENSE	Initial commit	Feb 12, 2019
README.md	README.md	Update README.md	Jun 7, 2019
comb_bandit.py	comb_bandit.py	Update comb_bandit.py	Feb 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Combinatorial Multi-Armed Bandit

About

Releases

Packages

Languages

License

yigaza/combinatorial-bandit

Folders and files

Latest commit

History

Repository files navigation

Combinatorial Multi-Armed Bandit

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages