You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+1-1
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# Combinatorial Multi-Armed Bandit
2
2
3
-
A method for selecting a subset of best items w.r.t. black-box reward function: Given 𝑁 items, the algorithm searches for an optimal 𝑠𝑢𝑏𝑠𝑒𝑡 of a maximum number of 𝑘 items, which maximizes a blackbox reward function 𝑓(𝑠𝑢𝑏𝑠𝑒𝑡).
3
+
A method for selecting a subset of best items w.r.t. a black-box reward function: Given 𝑁 items, the algorithm searches for an optimal 𝑠𝑢𝑏𝑠𝑒𝑡 of a maximum number of 𝑘 items, which maximizes a blackbox reward function 𝑓(𝑠𝑢𝑏𝑠𝑒𝑡).
4
4
5
5
The algorithm generates new subset candidates based on the novelty of the items while also considering what is the ratio of presence of an item that exist among the top previously tried subsets (by sorting them wrt rewards achieved).
0 commit comments