Commit d22bf15
committed
[Cria][Lllama runner] Use caching temp allocator
Pull Request resolved: #16081
Use of caching allocator improves TITO model performance by 6+ %.
Will add repro instructions here but requires next diff to see the impact
ghstack-source-id: 327106879
Differential Revision: [D85532078](https://our.internmc.facebook.com/intern/diff/D85532078/)1 parent 161c5c5 commit d22bf15
1 file changed
+0
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
225 | 225 | | |
226 | 226 | | |
227 | 227 | | |
228 | | - | |
229 | 228 | | |
230 | 229 | | |
231 | 230 | | |
| |||
0 commit comments