Add support for cohere/titan embedding models #361

QuinnGT · 2024-02-05T18:00:09Z

Issue #, if available:

Issues open that this resolves, #218

Description of changes:

Brought over changes from spugachev 3 month old branch on cohere. Added changes for new schema. Deployed and tested in fresh environment last night but I see massi-ang opened #359 this morning. However cohere embedding needs additional changes for how it handles inputs. Documented here.

search_document type and search_query type are used. Classification and clustering are not needed at this time.

embedding_types was not defined as float is appropriate for this implementation.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

bigadsoleiman

Thanks for this submission!

Just added a couple of feedbacks

lib/shared/layers/python-sdk/python/genai_core/embeddings.py

bin/config.ts

massi-ang

Thanks for the PR.

lib/chatbot-api/functions/api-handler/routes/embeddings.py

lib/shared/layers/python-sdk/python/genai_core/opensearch/query.py

lib/shared/layers/python-sdk/python/genai_core/aurora/query.py

lib/shared/layers/python-sdk/python/genai_core/chunks.py

lib/shared/layers/python-sdk/python/genai_core/workspaces.py

QuinnGT · 2024-02-13T08:37:18Z

Quick update. I pushed updates for everything except for titan multimodal. Currently deploying those changes and testing before taking on titan multimodal.

In regards to Titan Multimodal Embeddings, it will need a bit more than just calling out the additional model. It will need support for it's API payload like max input text tokens and max input image size. However we are not capturing images in our use of Unstructured or rather lack of use of unstructured, like the partition_image function. Also I see unstructured is poorly used as we pull the full container yet only use the loader.

As soon as the cohere changes finish and tests pass, I'll circle back on titan multimodal and unstructured. Feel free to post thoughts or feedback.

massi-ang · 2024-02-13T09:50:15Z

I uderstand your concerns, but I think they are out of scope to just adding this embedding model for text embeddings. maxInputTokens is optional and we currently only need to support text embeddings. This was already tested in #359 and working fine.

We should then work to better leverage the multimodal embeddings, but that is an unrelated feature to this one.

QuinnGT · 2024-02-13T22:48:31Z

Alright, all requested changes have been made. I also fixed the embedding prompt to show when enableRag was true vs just being skipped no matter what the use selected, in the magic-config.

New environment has been deployed. Bedrock and Sagemaker embedding models tested as functional.

QuinnGT · 2024-02-14T01:30:58Z

Found issue with creating workspace. Not sure if it's from me or changes merged into this PR. Need to test and see.

cli/magic-config.ts

QuinnGT · 2024-02-14T06:05:25Z

Resolved the latest issues. Feel free to test and let me know if anything else is needed.

lib/shared/layers/python-sdk/python/genai_core/embeddings.py

QuinnGT · 2024-02-15T20:44:06Z

@bigadsoleiman I noticed an error in deployment that's happening for log groups due to size of name. I tested it here and it works. Do you want me to bring that into this PR or wait till this is approved?

massi-ang · 2024-02-22T08:54:56Z

@bigadsoleiman I noticed an error in deployment that's happening for log groups due to size of name. I tested it here and it works. Do you want me to bring that into this PR or wait till this is approved?

Hi Quinn, I checked your change and my concern is that if you deploy two stacks, they will share the same log group, since the value of id will be the same in for both.

QuinnGT · 2024-02-22T23:46:19Z

@bigadsoleiman I noticed an error in deployment that's happening for log groups due to size of name. I tested it here and it works. Do you want me to bring that into this PR or wait till this is approved?

Hi Quinn, I checked your change and my concern is that if you deploy two stacks, they will share the same log group, since the value of id will be the same in for both.

Great point @massi-ang . How about I use the config.prefix with the id so it looks something like logGroupName: /aws/vendedlogs/states/${config.prefix}-${id}.

Before: /aws/vendedlogs/states/CreateAuroraWorkspace-CreateAuroraWorkspaceSMLogGroup
After: /aws/vendedlogs/states/dev-CreateAuroraWorkspaceSMLogGroup

massi-ang · 2024-02-23T08:33:50Z

I would suggest using /aws/vendedlogs/states/${cdk.Stack.of(this).name}-${id}, but there could still be issues in case we apply the same solution to other constructs which have the same id (since id in only unique in the scope of the parent construct).

QuinnGT · 2024-02-26T05:15:53Z

@bigadsoleiman is there anything else pending that I missed? If not, can you approve this PR? Thank you!

Cerrix · 2024-02-27T17:33:20Z

Hi @QuinnGT,
I tested the pull request because I was looking for the Cohere Multilingual implementation. When I selected that model for embeddings, I received the following error response from AppSync:
{ "data": null, "errors": [ { "path": [ "calculateEmbeddings" ], "data": null, "errorType": "TypeError", "errorInfo": null, "locations": [ { "line": 2, "column": 3, "sourceName": null } ], "message": "Object of type Task is not JSON serializable" } ] }
both creating the workspace or testing the embedding.
With Titan everything is fine.

Thank you!

QuinnGT · 2024-02-28T03:26:34Z

Hi @QuinnGT, I tested the pull request because I was looking for the Cohere Multilingual implementation. When I selected that model for embeddings, I received the following error response from AppSync: { "data": null, "errors": [ { "path": [ "calculateEmbeddings" ], "data": null, "errorType": "TypeError", "errorInfo": null, "locations": [ { "line": 2, "column": 3, "sourceName": null } ], "message": "Object of type Task is not JSON serializable" } ] } both creating the workspace or testing the embedding. With Titan everything is fine.

Thank you!

@Cerrix thank you for reporting this. Can you share what RAG you were testing with?

Cerrix · 2024-02-28T07:11:32Z

Yep: I’m testing in with Open Search (and just that). I get the same error even if testing cohere through the “embeddings” page (not only creating a new workspace).

thank you

QuinnGT · 2024-02-29T08:04:12Z

Yep: I’m testing in with Open Search (and just that). I get the same error even if testing cohere through the “embeddings” page (not only creating a new workspace).

thank you

Hey @Cerrix I enabled and deployed an opensearch instance and I'm not seeing any errors with workspace creation or testing embeddings. I'm not sure if it's a new or existing deployment you are testing in but try doing a fresh deployment from the latest, along with a new npm install and build. Feel free to report back with more information to troubleshoot, if needed.

QuinnGT · 2024-02-29T08:25:17Z

Comparison of embedding models from demo data.

Cerrix · 2024-02-29T09:17:07Z

It is a week old deploy. But let me try again from scratch! I'll keep you posted!

Cerrix · 2024-03-03T23:06:49Z

It is so strange. I re-deployed from scratch the repository branching your PR in Oregon (us-west-2).

Here is my config file:

{
  "prefix": "llmchatbot-demo",
  "privateWebsite": false,
  "certificate": "",
  "domain": "",
  "bedrock": {
    "enabled": true,
    "region": "us-west-2"
  },
  "llms": {
    "sagemaker": []
  },
  "rag": {
    "enabled": true,
    "engines": {
      "aurora": {
        "enabled": false
      },
      "opensearch": {
        "enabled": true
      },
      "kendra": {
        "enabled": false,
        "createIndex": false,
        "external": [],
        "enterprise": false
      }
    },
    "embeddingsModels": [
      {
        "provider": "sagemaker",
        "name": "intfloat/multilingual-e5-large",
        "dimensions": 1024
      },
      {
        "provider": "sagemaker",
        "name": "sentence-transformers/all-MiniLM-L6-v2",
        "dimensions": 384
      },
      {
        "provider": "bedrock",
        "name": "amazon.titan-embed-text-v1",
        "dimensions": 1536
      },
      {
        "provider": "bedrock",
        "name": "amazon.titan-embed-image-v1",
        "dimensions": 1024
      },
      {
        "provider": "bedrock",
        "name": "cohere.embed-english-v3",
        "dimensions": 1024
      },
      {
        "provider": "bedrock",
        "name": "cohere.embed-multilingual-v3",
        "dimensions": 1024,
        "default": true
      },
      {
        "provider": "openai",
        "name": "text-embedding-ada-002",
        "dimensions": 1536
      }
    ],
    "crossEncoderModels": [
      {
        "provider": "sagemaker",
        "name": "cross-encoder/ms-marco-MiniLM-L-12-v2",
        "default": true
      }
    ]
  }
}

Still I got the same error:

QuinnGT · 2024-03-05T22:23:18Z

@Cerrix Interesting, I'm not sure. It looks like it's an error from the appsync function. Almost like the old version of app sync functions are trying to be used with the newer genai_core python functions. Did you purge your cdk.out when doing a fresh deployment? Also double check cdk version is updated on your local as there seems to be some rebuild bug introduced on each release.

@massi-ang I see Bigad approved the PR but we're still waiting on you or another repo owner for approval. What else do you need to from me?

Cerrix · 2024-03-05T22:54:24Z

I'm deploying following the Cloud9 instructions of the sample documentation: https://aws-samples.github.io/aws-genai-llm-chatbot/guide/deploy.html

If we need to update some dependencies I think it must be documented before merging the PR in main.

I can try with a vanilla cloud9 instance but we need to double check the issue. I don't know if someone from the team would like to try the deploy!

massi-ang

Please fix the Provider type with the right semantics

cli/magic-config.ts

bin/config.ts

lib/shared/layers/python-sdk/python/genai_core/types.py

Add support for cohere embedding models

1fcb5d1

bigadsoleiman requested changes Feb 8, 2024

View reviewed changes

lib/shared/layers/python-sdk/python/genai_core/embeddings.py Outdated Show resolved Hide resolved

lib/shared/layers/python-sdk/python/genai_core/embeddings.py Outdated Show resolved Hide resolved

bigadsoleiman mentioned this pull request Feb 8, 2024

feat(embeddings): adding Cohere and Titan Multimodal #359

Closed

bigadsoleiman changed the title ~~Add support for cohere embedding models~~ Add support for cohere/titan embedding models Feb 8, 2024

bigadsoleiman requested changes Feb 8, 2024

View reviewed changes

bin/config.ts Show resolved Hide resolved

Merge remote-tracking branch 'upstream/main'

50127ff

massi-ang requested changes Feb 9, 2024

View reviewed changes

lib/chatbot-api/functions/api-handler/routes/embeddings.py Outdated Show resolved Hide resolved

Move static values to Enum. Create new provider functions.

9a45e7a

bigadsoleiman requested changes Feb 12, 2024

View reviewed changes

Add Enum to query, chunks and workspaces

f7132eb

bigadsoleiman and others added 5 commits February 13, 2024 17:30

Merge branch 'main' into main

79af19f

Fix error with API calls to provider and model

3b00af2

Merge branch 'main' of https://github.com/QuinnGT/aws-genai-llm-chatbot

88f8e67

Add Titan Multimodal Embeddings-No image support

8ec74fa

Fix default embedding model prompt on create

8994b64

QuinnGT marked this pull request as draft February 14, 2024 01:29

Rob-Powell reviewed Feb 14, 2024

View reviewed changes

cli/magic-config.ts Outdated Show resolved Hide resolved

QuinnGT added 2 commits February 13, 2024 21:57

Removed mistake on hardcoded enum values

efedfd9

Add condition for embedding to only show on aurora or opensearch

7ed298b

QuinnGT marked this pull request as ready for review February 14, 2024 06:04

bigadsoleiman requested changes Feb 15, 2024

View reviewed changes

lib/shared/layers/python-sdk/python/genai_core/embeddings.py Outdated Show resolved Hide resolved

Update task with enum

26a7c70

QuinnGT requested a review from bigadsoleiman February 21, 2024 17:31

Merge branch 'aws-samples:main' into main

0dd6b65

bigadsoleiman approved these changes Feb 26, 2024

View reviewed changes

Merge branch 'aws-samples:main' into main

935666e

massi-ang requested changes Mar 6, 2024

View reviewed changes

cli/magic-config.ts Show resolved Hide resolved

bin/config.ts Show resolved Hide resolved

lib/shared/layers/python-sdk/python/genai_core/types.py Show resolved Hide resolved

QuinnGT and others added 4 commits March 7, 2024 10:13

Merge branch 'aws-samples:main' into main

74a8318

Merge branch 'aws-samples:main' into main

a6c929b

Merge branch 'main' into main

66482bb

Merge branch 'main' into main

b066b0d

bigadsoleiman approved these changes Mar 14, 2024

View reviewed changes

bigadsoleiman merged commit 8838856 into aws-samples:main Mar 14, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for cohere/titan embedding models #361

Add support for cohere/titan embedding models #361

QuinnGT commented Feb 5, 2024

bigadsoleiman left a comment

massi-ang left a comment

QuinnGT commented Feb 13, 2024 •

edited

Loading

massi-ang commented Feb 13, 2024 •

edited

Loading

QuinnGT commented Feb 13, 2024 •

edited

Loading

QuinnGT commented Feb 14, 2024

QuinnGT commented Feb 14, 2024

QuinnGT commented Feb 15, 2024

massi-ang commented Feb 22, 2024

QuinnGT commented Feb 22, 2024

massi-ang commented Feb 23, 2024

QuinnGT commented Feb 26, 2024

Cerrix commented Feb 27, 2024

QuinnGT commented Feb 28, 2024

Cerrix commented Feb 28, 2024

QuinnGT commented Feb 29, 2024

QuinnGT commented Feb 29, 2024

Cerrix commented Feb 29, 2024 •

edited

Loading

Cerrix commented Mar 3, 2024

QuinnGT commented Mar 5, 2024

Cerrix commented Mar 5, 2024

massi-ang left a comment

Add support for cohere/titan embedding models #361

Add support for cohere/titan embedding models #361

Conversation

QuinnGT commented Feb 5, 2024

bigadsoleiman left a comment

Choose a reason for hiding this comment

massi-ang left a comment

Choose a reason for hiding this comment

QuinnGT commented Feb 13, 2024 • edited Loading

massi-ang commented Feb 13, 2024 • edited Loading

QuinnGT commented Feb 13, 2024 • edited Loading

QuinnGT commented Feb 14, 2024

QuinnGT commented Feb 14, 2024

QuinnGT commented Feb 15, 2024

massi-ang commented Feb 22, 2024

QuinnGT commented Feb 22, 2024

massi-ang commented Feb 23, 2024

QuinnGT commented Feb 26, 2024

Cerrix commented Feb 27, 2024

QuinnGT commented Feb 28, 2024

Cerrix commented Feb 28, 2024

QuinnGT commented Feb 29, 2024

QuinnGT commented Feb 29, 2024

Cerrix commented Feb 29, 2024 • edited Loading

Cerrix commented Mar 3, 2024

QuinnGT commented Mar 5, 2024

Cerrix commented Mar 5, 2024

massi-ang left a comment

Choose a reason for hiding this comment

QuinnGT commented Feb 13, 2024 •

edited

Loading

massi-ang commented Feb 13, 2024 •

edited

Loading

QuinnGT commented Feb 13, 2024 •

edited

Loading

Cerrix commented Feb 29, 2024 •

edited

Loading