RSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 2 months ago

Consistency diffusion language models: Up to 14x faster, no quality loss

www.together.ai

1

3

Consistency diffusion language models: Up to 14x faster, no quality loss

www.together.ai

RSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 2 months ago

1

Consistency diffusion language models: Up to 14x faster inference without sacrificing quality

www.together.ai

Standard diffusion language models can't use KV caching and need too many refinement steps to be practical. CDLM fixes both with a post-training recipe that enables exact block-wise KV caching and trajectory-consistent step reduction — delivering up to 14.5x latency improvements

Chat

ChaoticNeutralCzech@feddit.org
link
fedilink
English
arrow-up
1·
2 months ago
ML or LM?

Hacker News@lemmy.bestiver.se

hackernews@lemmy.bestiver.se

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Community locked: only moderators can create posts. You can still comment on posts.

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

Source of the RSS Bot

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

910 users / day
2.24K users / week
4.38K users / month
9.65K users / 6 months
1 local subscriber
4.7K subscribers
30.2K Posts
19K Comments
Modlog