Commit a7c4a16
Enable TF32 mode in GRU ops (#2512)
TF32 GRU op was not in place. This brings ~1.72x speed up on Molan
Co-authored-by: Janghaeng Lee <janghaeng.lee@intel.com>1 parent 5c7bfad commit a7c4a16
1 file changed
+8
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
80 | 80 | | |
81 | 81 | | |
82 | 82 | | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
83 | 87 | | |
84 | 88 | | |
85 | 89 | | |
| |||
323 | 327 | | |
324 | 328 | | |
325 | 329 | | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
326 | 334 | | |
327 | 335 | | |
328 | 336 | | |
| |||
0 commit comments