User contributions for Fall2024 Wiki Team6
Jump to navigation
Jump to search
15 December 2024
- 21:4621:46, 15 December 2024 diff hist −12 Adafactor →Numerical Examples current Tag: Visual edit
- 21:4321:43, 15 December 2024 diff hist −58 Adafactor →Step 2: Compute G t 2 {\displaystyle G_{t}^{2}} (Element-wise Square of Gradient) Tag: Visual edit
- 21:0221:02, 15 December 2024 diff hist +495 Adafactor →Problem setup Tag: Visual edit
14 December 2024
- 22:1222:12, 14 December 2024 diff hist −3 Adafactor →Numerical Examples Tag: Visual edit
- 22:0622:06, 14 December 2024 diff hist −34 Adafactor No edit summary Tag: Visual edit
- 22:0422:04, 14 December 2024 diff hist −54 Adafactor No edit summary Tag: Visual edit
- 21:4321:43, 14 December 2024 diff hist +4 Adafactor →Numerical Examples Tag: Visual edit
13 December 2024
- 16:5416:54, 13 December 2024 diff hist +4 Adafactor →Software Tools and Platforms Tag: Visual edit
- 16:5116:51, 13 December 2024 diff hist +1,557 Adafactor →Conclusion Tag: Visual edit
- 16:4816:48, 13 December 2024 diff hist −34 Adafactor →Applications: change tensorflow Tag: Visual edit
12 December 2024
- 11:5811:58, 12 December 2024 diff hist −18 Adafactor →Proposed Hyperparameters for Adafactor
- 11:5711:57, 12 December 2024 diff hist +123 Adafactor →4. Proposed Hyperparameters for Adafactor
- 11:5611:56, 12 December 2024 diff hist +1,096 Adafactor →4. Proposed Hyperparameters for Adafactor
- 11:5411:54, 12 December 2024 diff hist +29 Adafactor →Problem formulation
- 11:5311:53, 12 December 2024 diff hist +507 Adafactor Undo revision 6932 by Fall2024 Wiki Team6 (talk) Tag: Undo
- 11:4911:49, 12 December 2024 diff hist +1,013 Adafactor Undo revision 6933 by Fall2024 Wiki Team6 (talk) Tag: Undo
11 December 2024
- 20:0720:07, 11 December 2024 diff hist +1,665 Adafactor →Applications Tag: Visual edit: Switched
- 20:0220:02, 11 December 2024 diff hist +2,024 Adafactor →Introduction Tag: Visual edit: Switched
- 19:5119:51, 11 December 2024 diff hist +21 Adafactor →Introduction
- 19:4919:49, 11 December 2024 diff hist +1,249 Adafactor →Introduction Tag: Visual edit: Switched
- 17:0217:02, 11 December 2024 diff hist −3 Adafactor →Numerical Examples Tag: Visual edit
- 16:5716:57, 11 December 2024 diff hist +329 Adafactor →Numerical Examples Tag: Visual edit
- 16:4416:44, 11 December 2024 diff hist +126 Adafactor →Numerical Examples Tag: Visual edit
- 16:2316:23, 11 December 2024 diff hist +615 Adafactor →Numerical Examples Tag: Visual edit
- 12:1012:10, 11 December 2024 diff hist +19 Adafactor →Numerical Examples Tag: Visual edit
- 02:0002:00, 11 December 2024 diff hist −11 Adafactor →Numerical Examples Tag: Visual edit
- 01:5801:58, 11 December 2024 diff hist +4,080 Adafactor →Numerical Examples Tag: Visual edit
10 December 2024
- 23:2623:26, 10 December 2024 diff hist −1,314 Adafactor →Numerical Examples Tag: Visual edit
- 23:2323:23, 10 December 2024 diff hist +2 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:2323:23, 10 December 2024 diff hist +2 Adafactor →Why Clipping
- 23:2323:23, 10 December 2024 diff hist +1 Adafactor →5.Discussion
- 23:2323:23, 10 December 2024 diff hist −1,013 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:2323:23, 10 December 2024 diff hist −507 Adafactor →Why Clipping
- 23:2223:22, 10 December 2024 diff hist +1,537 Adafactor →Problem formulation
- 23:2123:21, 10 December 2024 diff hist −16 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:2023:20, 10 December 2024 diff hist +2 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:1923:19, 10 December 2024 diff hist +117 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:1823:18, 10 December 2024 diff hist +7 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:1723:17, 10 December 2024 diff hist +97 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:1523:15, 10 December 2024 diff hist +281 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:0823:08, 10 December 2024 diff hist +54 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:0723:07, 10 December 2024 diff hist +13 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:0623:06, 10 December 2024 diff hist +180 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:0423:04, 10 December 2024 diff hist +2 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:0423:04, 10 December 2024 diff hist +197 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 23:0323:03, 10 December 2024 diff hist +13 Adafactor →Why Adafactor is more memory efficient, compared to Adam
- 22:5922:59, 10 December 2024 diff hist +68 Adafactor →Why Clipping
- 22:5422:54, 10 December 2024 diff hist +130 Adafactor →Why Clipping
- 22:5222:52, 10 December 2024 diff hist +379 Adafactor →Adafactor for Weighted Matrices
- 22:5222:52, 10 December 2024 diff hist −379 Adafactor →Why Clipping