You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The main novelty of the paper is defining a fitness score for the outputs from a transformer instructed to insert API calls. The score is used to filter the sampled outputs for finetuning the transformer to make API calls that decreases perplexity of the text that follows it.
-[x] create custom generate function for palm that can do external API calls
100
159
-[x] allow for generating tokens at different cursor indices
160
+
-[x] api token (which was left and right brackets in paper) needs to be customizable
101
161
-[ ] allow for customizing how to fine handling errors in function name, parameters, or execution and output
102
-
-[ ] api token (which was left and right brackets in paper) needs to be customizable
103
162
-[ ] Toolformer should eventually calculate all statistics (how many properly sampled, filtered out by different criterias, the distribution of scores as well as how many were rejected) before the final fine-tuning
104
163
-[ ] do end-to-end training in `Toolformer`
164
+
-[x] doing the prompting and bootstrapping the data
165
+
-[ ] prefiltering of bootstrapped data followed by api calls and then another round of filtering
166
+
-[ ] keep track of all stats
167
+
-[ ] take care of fine-tuning, with the interleaving of datasets + optimizer hyperparams
0 commit comments