Skip to content

Commit

Permalink
TogetherSolver (#1502)
Browse files Browse the repository at this point in the history
This PR contributes a `TogetherSolver` class, a solver for using models
served by the [Together AI
API](https://docs.together.ai/docs/quickstart)

Because [Together supports the OpenAI python
sdk](https://docs.together.ai/docs/openai-api-compatibility), we simply
create a subclass of the `OpenAISolver`, overriding some functionality.
There is therefore some refactoring of the `OpenAISolver` included in
this PR to facilitate this code sharing.

At the moment, we support the models specified in
`evals/registry/solvers/together.yaml`, but in principle most models
offered from the Together AI API can easily be added

Notes:

- logit biasing not supported by the Together API due to a lack of a
unified tokenizer a la [tiktoken](https://github.com/openai/tiktoken)
from openai
- For the same reason, checking for context length limits not supported

Co-authored-by: Chan Jun Shern <[email protected]>
Co-authored-by: Ian McKenzie <[email protected]>
  • Loading branch information
3 people authored Mar 22, 2024
1 parent e30e141 commit 5805c20
Show file tree
Hide file tree
Showing 4 changed files with 554 additions and 121 deletions.
92 changes: 92 additions & 0 deletions evals/registry/solvers/together.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
# --- Direct Solvers ---
generation/direct/llama-2-13b-chat:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: meta-llama/Llama-2-13b-chat-hf
extra_options:
temperature: 1
max_tokens: 512
postprocessors: &postprocessors
- evals.solvers.postprocessors.postprocessors:Strip

generation/direct/llama-2-70b-chat:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: meta-llama/Llama-2-70b-chat-hf
extra_options:
temperature: 1
max_tokens: 512
postprocessors: *postprocessors

generation/direct/mixtral-8x7b-instruct:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: mistralai/Mixtral-8x7B-Instruct-v0.1
extra_options:
temperature: 1
max_tokens: 512
postprocessors: *postprocessors
# --- COT Solvers ---

generation/cot/llama-2-13b-chat:
class: evals.solvers.nested.cot_solver:CoTSolver
args:
cot_solver:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: meta-llama/Llama-2-13b-chat-hf
extra_options:
temperature: 1
max_tokens: 512
extract_solver:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: meta-llama/Llama-2-13b-chat-hf
extra_options:
temperature: 1
max_tokens: 512

generation/cot/llama-2-70b-chat:
class: evals.solvers.nested.cot_solver:CoTSolver
args:
cot_solver:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: meta-llama/Llama-2-70b-chat-hf
extra_options:
temperature: 1
max_tokens: 512
extract_solver:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: meta-llama/Llama-2-70b-chat-hf
extra_options:
temperature: 1
max_tokens: 512

generation/cot/mixtral-8x7b-instruct:
class: evals.solvers.nested.cot_solver:CoTSolver
args:
cot_solver:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: mistralai/Mixtral-8x7B-Instruct-v0.1
extra_options:
temperature: 1
max_tokens: 512
extract_solver:
class: evals.solvers.together_solver:TogetherSolver
args:
completion_fn_options:
model: mistralai/Mixtral-8x7B-Instruct-v0.1
extra_options:
temperature: 1
max_tokens: 512
Loading

0 comments on commit 5805c20

Please sign in to comment.