Skip to content

Releases: xorbitsai/inference

v0.14.0

02 Aug 08:42
dd85cfe
Compare
Choose a tag to compare

What's new in 0.14.0 (2024-08-02)

These are the changes in inference v0.14.0.

New features

  • FEAT: Supports model_path input when launching models by @Valdanitooooo in #1918
  • FEAT: Support gte-Qwen2-7B-instruct and multi gpu deploy by @amumu96 in #1994

Enhancements

Bug fixes

Documentation

New Contributors

Full Changelog: v0.13.3...v0.14.0

v0.13.3

26 Jul 10:59
aa51ff2
Compare
Choose a tag to compare

What's new in 0.13.3 (2024-07-26)

These are the changes in inference v0.13.3.

New features

Enhancements

  • ENH: added MLX for llama-3-instruct, codestral, Yi-1.5-chat, internlm2.5-chat by @qinxuye in #1908
  • ENH: add gptq for llama-3-instruct by @Phoenix500526 in #1934

Bug fixes

  • BUG: fix inpainting and flexible infer due to inner API change by @qinxuye in #1907

Documentation

New Contributors

Full Changelog: v0.13.2...v0.13.3

v0.13.2

19 Jul 11:26
880929c
Compare
Choose a tag to compare

What's new in 0.13.2 (2024-07-19)

These are the changes in inference v0.13.2.

New features

Enhancements

Bug fixes

  • BUG: Fix stream unicode issue for chinese characters when using vllm backend by @ChengjieLi28 in #1865
  • BUG: sglang stream error while stream_option not set by @wxiwnd in #1901
  • BUG: fix client import by @amumu96 in #1905

Full Changelog: v0.13.1...v0.13.2

v0.13.1

12 Jul 11:10
5e3f254
Compare
Choose a tag to compare

What's new in 0.13.1 (2024-07-12)

These are the changes in inference v0.13.1.

New features

Enhancements

Bug fixes

  • BUG: cache status missing for model id with quantization placeholder by @Zihann73 in #1849

Documentation

Others

New Contributors

Full Changelog: v0.13.0...v0.13.1

v0.13.0

05 Jul 10:33
007408c
Compare
Choose a tag to compare

What's new in 0.13.0 (2024-07-05)

These are the changes in inference v0.13.0.

New features

Enhancements

Bug fixes

Tests

Documentation

Full Changelog: v0.12.3...v0.13.0

v0.12.3

28 Jun 07:36
3d9c261
Compare
Choose a tag to compare

What's new in 0.12.3 (2024-06-28)

These are the changes in inference v0.12.3.

New features

Enhancements

Bug fixes

Others

  • CHORE: upgrade version fix security vulnerability by @rickywu in #1674

New Contributors

Full Changelog: v0.12.2...v0.12.3

v0.12.2.post1

22 Jun 17:37
7705d4a
Compare
Choose a tag to compare

What's new in 0.12.2.post1 (2024-06-22)

These are the changes in inference v0.12.2.post1.

Enhancements

Full Changelog: v0.12.2...v0.12.2.post1

v0.12.2

21 Jun 09:14
5cef7c3
Compare
Choose a tag to compare

What's new in 0.12.2 (2024-06-21)

These are the changes in inference v0.12.2.

New features

  • FEAT: Add Tools Support for Qwen Series MOE Models by @zhanghx0905 in #1642
  • FEAT: [UI]Modify the deletion function of a custom model. by @yiboyasss in #1656
  • FEAT: [UI]Custom model presents JSON data and modifies it. by @yiboyasss in #1670
  • FEAT: Add Rerank model token input/output usage by @wxiwnd in #1657

Enhancements

  • ENH: Continuous batching supports all the models with transformers backend by @ChengjieLi28 in #1659

Bug fixes

  • BUG: show error when user launch quantized model without device supported by @Minamiyama in #1645
  • BUG: Fix default rerank type by @codingl2k1 in #1649
  • BUG: chat_completion not response while error appears more than 100 by @liuzhenghua in #1663

Tests

Others

Full Changelog: v0.12.1...v0.12.2

v0.12.1

14 Jun 09:31
34a57df
Compare
Choose a tag to compare

What's new in 0.12.1 (2024-06-14)

These are the changes in inference v0.12.1.

New features

Enhancements

Bug fixes

Others

New Contributors

Full Changelog: v0.12.0...v0.12.1

v0.12.0

07 Jun 07:27
55c5636
Compare
Choose a tag to compare

What's new in 0.12.0 (2024-06-07)

These are the changes in inference v0.12.0.

New features

Enhancements

  • ENH: make CogVLM2 support stream output by @Minamiyama in #1572
  • BLD: Docker clean all images after building image on self-hosted machine by @ChengjieLi28 in #1595
  • BLD: Fix pip is looking multiple versions of some packages while installing by @ChengjieLi28 in #1603

Bug fixes

Documentation

New Contributors

Full Changelog: v0.11.3...v0.12.0