Commit Graph

293 Commits

Author SHA1 Message Date
Pan, Wen-Ming
4b54843ed7
fix: run agent with Vertex AI Gemini models (#5260)
Co-authored-by: Wenming Pan <pwm@google.com>
2024-06-16 09:36:31 +08:00
kurokobo
2e842333b1
fix: correct typos in the icons for microsoft (#5243) 2024-06-15 21:02:47 +08:00
Masashi Tomooka
d9bee03ff6
fix: embedding job fails using IAM role (#5252) 2024-06-15 18:57:54 +08:00
Jyong
ba5f8afaa8
Feat/firecrawl data source (#5232)
Co-authored-by: Nicolas <nicolascamara29@gmail.com>
Co-authored-by: chenhe <guchenhe@gmail.com>
Co-authored-by: takatost <takatost@gmail.com>
2024-06-15 02:46:02 +08:00
Bin
0f35d07052
support ERNIE-4.0-8K-Latest (#5216) 2024-06-14 18:45:24 +08:00
-LAN-
7f44e88eda
fix(model_providers/ollama): Fix OllamaLargeLanguageModel to correctly set the stop option (#5217) 2024-06-14 18:26:14 +08:00
Jason
b7ff765d8d
Add novita.ai as model provider (#4961) 2024-06-14 18:23:06 +08:00
Masashi Tomooka
0633aae7dc
feat: allow to use IAM Role for Bedrock (#5188) 2024-06-14 15:18:42 +08:00
takatost
415022aa14
fix: pydantic2 error (#5172) 2024-06-14 03:05:04 +08:00
rerorero
b85ae146a7
fix: JSON mode with an image doesn't work for Gemini (#5169) 2024-06-14 02:32:09 +08:00
Pan, Wen-Ming
f13af5a811
fix(model_providers/vertex_ai): Vertex AI Anthropic models authentication failed (#4971) 2024-06-14 01:34:31 +08:00
Bowen Liang
f976740b57
improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 2024-06-14 01:05:37 +08:00
kurokobo
e61f5d029a
chore(docs): fix minor small typos (#5124) 2024-06-13 17:36:01 +08:00
sino
8210637bc5
feat: support jina-clip-v1 embedding model (#5146) 2024-06-13 16:31:18 +08:00
呆萌闷油瓶
790543131a
chore:add some new api version for azure openai (#5142) 2024-06-13 16:30:47 +08:00
yanghx
adc948e87c
fix(api/core/model_runtime/model_providers/baichuan,localai): Parse ToolPromptMessage. #4943 (#5138)
Co-authored-by: -LAN- <laipz8200@outlook.com>
2024-06-13 13:08:30 +08:00
orangeclk
79e8489942
feat: support siliconflow (#5129) 2024-06-13 12:59:41 +08:00
xielong
ea69dc2a7e
feat: support hunyuan llm models (#5013)
Co-authored-by: takatost <takatost@users.noreply.github.com>
Co-authored-by: Bowen Liang <bowenliang@apache.org>
2024-06-12 17:24:23 +08:00
Pika
ecc7f130b4
fix(typo): misspelling (#5094) 2024-06-12 17:01:21 +08:00
sino
0ce97e6315
feat: support doubao llm function calling (#5100) 2024-06-12 15:43:50 +08:00
rerorero
28997772a5
fix: remote_url doesn't work for gemini (#5090) 2024-06-12 13:14:53 +08:00
orangeclk
2050a8b8f0
feat: add glm4 new models and zhipu embedding-2 (#5089) 2024-06-12 08:22:17 +08:00
sino
5f870ac950
chore: update maas model provider description (#5056) 2024-06-11 11:22:22 +08:00
Jaxon Ley
2573b138bf
fix: update presence_penalty configuration for wenxin AI ernie-4.0-8k and ernie-3.5-8k models (#5039) 2024-06-09 14:44:11 +08:00
takatost
3929d289e0
feat: set default memory messages limit to infinite (#5002) 2024-06-06 17:39:44 +08:00
Joe
5cdb95be1f
fix: gemini timeout error (#4955) 2024-06-06 10:19:03 +08:00
Bowen Liang
f32b440c4a
chore: fix indention violations by applying E111 to E117 ruff rules (#4925) 2024-06-05 14:05:15 +08:00
takatost
f44d1e62d2
fix: bedrock get_num_tokens prompt_messages parameter name err (#4932) 2024-06-05 01:53:05 +08:00
takatost
d1dbbc1e33
feat: backend model load balancing support (#4927) 2024-06-05 00:13:04 +08:00
Pan, Wen-Ming
b98a1a3303
feat: added Anthropic Claude3 models to Google Cloud Vertex AI (#4870)
Co-authored-by: pwm <pwm@google.com>
2024-06-04 02:52:46 +08:00
takatost
696c5308a9
chore: optimize nvidia nim credential schema and info (#4898) 2024-06-04 02:26:26 +08:00
Joshua
3c8a120e51
add-nvidia-mim (#4882) 2024-06-03 21:10:18 +08:00
Pan, Wen-Ming
cdbc260571
Bugfix: Vertex AI vision model not support image (#4853) 2024-06-02 11:11:09 +08:00
Yash Parmar
e0da0744b5
add: ollama keep alive parameter added. issue #4024 (#4655) 2024-05-31 12:22:02 +08:00
Weaxs
b189faca52
feat: update ernie model (#4756) 2024-05-29 14:57:23 +08:00
xielong
e1cd9aef8f
feat: support baichuan3 turbo, baichuan3 turbo 128k, and baichuan4 (#4762) 2024-05-29 14:46:04 +08:00
crazywoola
705a6e3a8e
Fix/4742 ollama num gpu option not consistent with allowed values (#4751) 2024-05-29 13:33:35 +08:00
xielong
793f0c1dd6
fix: Corrected schema link in model_runtime's README.md (#4757) 2024-05-29 13:03:21 +08:00
xielong
88b4d69278
fix: Correct context size for banchuan2-53b and banchuan2-turbo (#4721) 2024-05-28 16:37:44 +08:00
crazywoola
27dae156db
fix: colon in file mistral.mistral-small-2402-v1:0 (#4673) 2024-05-27 13:15:20 +08:00
Giovanny Gutiérrez
2deb23e00e
fix: Show rerank in system for localai (#4652) 2024-05-27 12:09:51 +08:00
longzhihun
fe9bf5fc4a
[seanguo] add support of amazon titan v2 and modify the price of amazon titan v1 (#4643)
Co-authored-by: Chenhe Gu <guchenhe@gmail.com>
2024-05-26 23:30:22 +08:00
miendinh
f804adbff3
feat: Support for Vertex AI - load Default Application Configuration (#4641)
Co-authored-by: miendinh <miendinh@users.noreply.github.com>
Co-authored-by: crazywoola <427733928@qq.com>
2024-05-25 13:40:25 +08:00
Krasus.Chen
f156014daa
update lite8k/speed8k/128k max_token to newest (#4636)
Co-authored-by: Your Name <chen@krasus.red>
2024-05-24 19:33:42 +08:00
Bowen Liang
3fda2245a4
improve: extract method for safe loading yaml file and avoid using PyYaml's FullLoader (#4031) 2024-05-24 12:08:12 +08:00
Patryk Garstecki
296887754f
Support for Vertex AI (#4586) 2024-05-24 12:01:40 +08:00
QuietRocket
9ae72cdcf4
feat: Add Gemini Flash (#4616) 2024-05-24 11:43:06 +08:00
takatost
11642192d1
chore: add https://api.openai.com placeholder in OpenAI api base (#4604) 2024-05-23 12:56:05 +08:00
呆萌闷油瓶
e57bdd4e58
chore:update gpt-3.5-turbo and gpt-4-turbo parameter for azure (#4596) 2024-05-23 11:51:38 +08:00
somethingwentwell
461488e9bf
Add Azure OpenAI API version for GPT4o support (#4569)
Co-authored-by: wwwc <wwwc@outlook.com>
2024-05-22 17:43:16 +08:00