New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Added LLama-2-13b-Chat plugin and modified README.md #5

Merged

rstrahan merged 12 commits into aws-samples:develop from aaron-sim:llama2_plugin

Sep 21, 2023

aaron-sim commented Sep 14, 2023

Hi,

I have created a plugin which uses Llama-2-13b-chat model (requires the model to be deployed via SageMaker Jumpstart)

Added an additional folder for llama-2-13b-chat-llm plugin and modified the README.md to include the information related to Llama.

rstrahan and others added 9 commits

August 1, 2023 20:45


          Merge branch 'develop'

0cf0c93


          Merge branch 'develop' v0.1.1

bb64c3d


          Merge pull request aws-samples#3 from aws-samples/develop

143eaaf

Cfn nag fixes


          Merge branch 'develop' v0.1.2

cca9e4e


          Merge branch 'develop' v0.1.2

8d58281


          Merge branch 'develop' v0.1.2


          Merge branch 'develop' v0.1.2

fab4d88


          Merge branch 'develop'

3c239ff


          Added LLama-2-13b-Chat plugin and modified README.md

27e041d

rstrahan requested changes

View reviewed changes

Contributor

rstrahan left a comment

This is great.. Left a few comments - if you can address them we can get this merged in quickly. Thanks so much!

README.md Outdated

 . AI21 LLM: Uses AI21's Jurassic model API - requires an AI21 account with an API Key
 . Anthropic LLM: Uses Anthropic's Claude model API - requires an Anthropic account with an API Key
 . Amazon Bedrock Embeddings and LLM: Uses Amazon Bedrock service API (preview) - requires access to Amazon Bedrock service (currently in private preview)
+. Llama 2 13b Chat LLM: Uses Llama 2 13b Chat model - requires Llama-2-chat model to be deployed via SageMaker JumpStart.

Contributor

rstrahan Sep 14, 2023

Add a reference link to point users to guide on how to install the model in SageMaker JumpStart

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

lambdas/llama-2-13b-chat-llm/src/llm.py Outdated

+              import io
+              from typing import Dict
+              # TEMPERATURE = os.environ.get("TEMPERATURE", 1e-10)

Contributor

rstrahan Sep 14, 2023

Remove commented out code throughout, if not used, for cleanliness.
Presumably these settings are supplied via model parameters in the event, not local function environment vars.

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

lambdas/llama-2-13b-chat-llm/src/llm.py Outdated

+              # MAX_NEW_TOKENS = os.environ.get("MAX_NEW_TOKENS", 1024)  # max number of tokens to generate in the output
+              # grab environment variables
+              ENDPOINT_NAME = os.environ['ENDPOINT_NAME']

Contributor

rstrahan Sep 14, 2023

Can you rename to SAGEMAKER_ENDPOINT_NAME so it doesn't get confused with ENDPOINT_URL used in the other functions.

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

lambdas/llama-2-13b-chat-llm/template.yml Outdated

		Description: QnABot on AWS LLM Plugin for Llama-2-13b-chat

		Parameters:

Contributor

rstrahan Sep 14, 2023

I think we need parameters to identify the (existing)

SageMakerEndpointName, and
SageMakerEndpointArn

And add more helpful text to the parameter descriptions to make it very clear how they should provision the endpoint (eg Jumpstart docs) and obtain the Name and Arn.

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

lambdas/llama-2-13b-chat-llm/template.yml Outdated

+                              Action:
+                                - "sagemaker:InvokeEndpoint"
+                              Resource:
+                                - "*"

Contributor

rstrahan Sep 14, 2023

Scope this to be least privilege.. Lambda only needs to invoke the specific endpoint specified as a parameter eg SageMakerEndpointArn

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

lambdas/llama-2-13b-chat-llm/template.yml Outdated


		Parameters:

		LLMModel:

Contributor

rstrahan Sep 14, 2023

Is this meant to me the SageMaker endpoint name? If se, let's rename it to SageMakerEndpointName and provide more complete description.
And also add one for Arn, so you can scope the invoke policy properly.

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

lambdas/llama-2-13b-chat-llm/template.yml Outdated

+                    Runtime: python3.10
+                    Environment:
+                      Variables:
+                        ENDPOINT_NAME: !Ref LLMModel

Contributor

rstrahan Sep 14, 2023

As noted above, param name is confusing, since you want the endpoint name, not the model. E.g. Call it SageMakerEndpointName as suggested earlier.

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

lambdas/llama-2-13b-chat-llm/template.yml Outdated

		@@ -0,0 +1,112 @@
		AWSTemplateFormatVersion: "2010-09-09"
		Description: QnABot on AWS LLM Plugin for Llama-2-13b-chat

Contributor

rstrahan Sep 14, 2023

deployed using SageMaker JumpStart (add link)

Author

aaron-sim Sep 15, 2023

Addressed in the 2nd commit.

Aaron Sim added 2 commits

September 15, 2023 08:02


          Addressed comments

00f2e38


          Updated Parmeters description in template.yml

87eb265

rstrahan assigned windrichie

windrichie requested changes

View reviewed changes

lambdas/llama-2-13b-chat-llm/template.yml Outdated Show resolved Hide resolved


          Removed SageMakerEndpointARN parameter and updated resource policy to…

d5f0ee8

… construct the ARN using the SageMakerEndpointName parameter

aaron-sim force-pushed the llama2_plugin branch from cc70a8e to d5f0ee8 Compare

September 19, 2023 18:14

windrichie approved these changes

View reviewed changes

rstrahan approved these changes

View reviewed changes

rstrahan changed the base branch from main to develop

September 21, 2023 21:51

rstrahan merged commit aaaf41b into aws-samples:develop

aaron-sim deleted the llama2_plugin branch

October 10, 2023 05:37

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet