Slow generation spe…
 
Notifications
Clear all

Slow generation speed


Andrew Day
(@Andrew)
Eminent Member Registered
Joined: 2 years ago
Posts: 14
Topic starter  

Slow generation speed makes the product feel heavier than users expect. Even if the answer is accurate, long waiting times create the impression that the system is struggling.

Generation speed is often affected by model size, token output length, prompt size, and the amount of extra context fed into the request. Many of these factors can be improved without changing the core product idea.

Users usually care more about responsiveness than perfection. Faster generation often improves satisfaction more than one extra round of polishing the answer.



   
ReplyQuote
Share: