Hi friends,
I have just found that the inference speed dropped to only 1/10 of the original model.
Had anyone encountered this?
Thank you.
Hello @wild-bee,
Please file a bug report for this issue using Feedback Assistant. It is unexpected that model encryption would affect inference time.
-- Greg