Skip to content

Releases: epam/ai-dial-adapter-bedrock

0.14.0

12 Sep 14:49
Compare
Choose a tag to compare

Features

  • supported multiple AWS accounts and regions within a single bedrock adapter deployment (#141)
  • added Llama 3.1 8B/70B/405B models (#136)
  • added Cohere embedding models (Embed English and Embed Multilingual) (#132)
  • added code=truncate_prompt_error to errors raised by prompt truncation algorithm (#134)

Fixes

  • supported zero-argument tools/functions (#134)
  • Claude 3 fails when request.max_prompt_tokens is provided instead of ignoring it (#144)
  • improved reporting of content filter errors for Stability (#143)
  • supported passing through of 429 errors reported by Bedrock (#138)
  • image tokens are accounted in token usage computation for Titan Multimodal embeddings (f5d734f)

Other

  • bump aidial-sdk from 0.8.0 to 0.13.0 (#127, #134)
  • bump aiohttp from 3.9.4 to 3.10.2 (#137)

0.13.3

24 Jul 13:35
Compare
Choose a tag to compare

Fixed

  • fixed incorrect usage of log.exception (#125)

Other

  • bump setuptools from 68.2.2 to 70.0.0 (#124)
  • bump epam/ai-dial-ci from 1.8.3 to 1.9.0 (#129)

0.13.2

15 Jul 09:52
Compare
Choose a tag to compare

Fixes

  • added model field to the embeddings request (#122) (#123)

0.13.1

11 Jul 08:52
Compare
Choose a tag to compare

Other

  • bump epam/ai-dial-ci from 1.8.1 to 1.8.3 (#118)
  • bump certifi from 2023.7.22 to 2024.7.4 (#119)
  • bump zipp from 3.17.0 to 3.19.1 (#120)

0.13.0

25 Jun 12:59
Compare
Choose a tag to compare

Features

  • supported native tool use in Claude V3 chat model (#116)
  • added Claude 3.5 Sonnet chat model (#117)
  • added Titan Multimodal Embeddings Generation 1 (G1) (#114)
  • added Amazon Titan Text Embeddings V2 (#112)
  • added Titan Embeddings G1 – Text v1.2 (#113)

Other

  • bump urllib3 from 1.26.18 to 1.26.19 (#115)

0.12.0

13 Jun 11:55
Compare
Choose a tag to compare

Other

  • bump epam/ai-dial-ci from 1.8.0 to 1.8.1 (#109)
  • bump requests from 2.31.0 to 2.32.0 (#108)

0.11.0

16 May 12:01
Compare
Choose a tag to compare

Features

  • supported Llama 3 8B and Llama 3 70B models (#102)
  • supported legacy tool use for Claude 3 models (#98)
  • added tests for function/tool usage; migrated tests from langchain to openai; removed langchain dependency (#100)
  • reporting user-facing error messages via display_message (#89)

Fixes

  • tool call ids made unique (#107)

Other

  • bump tqdm from 4.66.1 to 4.66.3 (#104)
  • removed redundant chat app code (#101)

0.10.0

02 May 08:34
Compare
Choose a tag to compare

Features

  • redirected stability.stable-diffusion-xl to stability.stable-diffusion-xl-v1, since stability.stable-diffusion-xl-v0 is deprecated (#92)
  • supported Claude 3 Opus (#90)

Fixes

  • fixed bug with ignored max_tokens in Claude 3 (#99)

Other

  • bump aiohttp from 3.9.2 to 3.9.4 (#88)
  • bump pydantic from 1.10.12 to 1.10.13 (#95)

0.9.0

19 Apr 08:49
Compare
Choose a tag to compare

Features

  • migrated to the latest SDK and thus supported new way of OTEL configuration (#85)
  • removed deprecated claude-v1 model (#68)
  • supported Claude 3 Haiku model with deployment name anthropic.claude-3-haiku-20240307-v1:0 (#84)

Fixes

  • deprecated DIAL_USE_FILE_STORAGE env var (breaking changes, see the issue for a migration guide: #73)

Other

  • bump idna from 3.4 to 3.7 (#87)

0.8.0

04 Apr 14:36
Compare
Choose a tag to compare

Features

  • supported Claude 3 Sonnet (#77)
  • supported tokenize and truncate_prompt endpoints (#82)

Fixes

  • DEFAULT_REGION env var is marked as deprecated in favour of AWS_DEFAULT_REGION env var (#80)

Other

  • bump black from 23.3.0 to 24.3.0 (#79)
  • bump epam/ai-dial-ci from 1.7.0 to 1.8.0 (#78)
  • bump langchain-core from 0.1.29 to 0.1.30 (#76)
  • bump langchain-core from 0.1.30 to 0.1.35 (#83)
  • bump peter-evans/slash-command-dispatch from 3.0.1 to 4.0.0 (#75)
  • update github workflows (#74)