File tree
274 files changed
+23736
-854
lines changed- assets
- csrc
- flash_attn
- src
- fmha
- ft_attention
- fused_dense_lib
- layer_norm
- rotary
- xentropy
- flash_attn
- layers
- losses
- models
- modules
- ops
- triton
- utils
- tests
- losses
- models
- modules
- ops
- training
- configs
- callbacks
- datamodule
- experiment
- owt
- pile
- logger
- metrics
- mode
- model
- gpt2model
- optimizer
- scheduler
- task
- trainer
- src
- callbacks
- datamodules
- datasets
- distributed
- metrics
- models/modules
- optim
- tasks
- utils
- tests/datamodules
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
274 files changed
+23736
-854
lines changed+21
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + |
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + |
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + |
+33-8
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
11 |
| - | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
12 | 32 |
| |
13 | 33 |
| |
14 | 34 |
| |
| |||
18 | 38 |
| |
19 | 39 |
| |
20 | 40 |
| |
21 |
| - | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
22 | 47 |
| |
23 |
| - | |
| 48 | + | |
24 | 49 |
| |
25 | 50 |
| |
26 | 51 |
| |
| |||
38 | 63 |
| |
39 | 64 |
| |
40 | 65 |
| |
41 |
| - | |
| 66 | + | |
42 | 67 |
| |
43 | 68 |
| |
44 | 69 |
| |
45 | 70 |
| |
46 | 71 |
| |
47 | 72 |
| |
48 | 73 |
| |
49 |
| - | |
| 74 | + | |
50 | 75 |
| |
51 | 76 |
| |
52 | 77 |
| |
| |||
148 | 173 |
| |
149 | 174 |
| |
150 | 175 |
| |
151 |
| - | |
152 |
| - | |
| 176 | + | |
| 177 | + | |
153 | 178 |
| |
154 |
| - | |
| 179 | + | |
155 | 180 |
| |
156 | 181 |
| |
157 | 182 |
|
Loading
Loading
Loading
Loading
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
176 | 176 |
| |
177 | 177 |
| |
178 | 178 |
| |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
179 | 189 |
| |
180 | 190 |
| |
181 | 191 |
| |
| |||
299 | 309 |
| |
300 | 310 |
| |
301 | 311 |
| |
302 |
| - | |
303 | 312 |
| |
304 | 313 |
| |
305 | 314 |
| |
306 | 315 |
| |
307 | 316 |
| |
308 | 317 |
| |
309 | 318 |
| |
310 |
| - | |
| 319 | + | |
311 | 320 |
| |
312 | 321 |
| |
313 | 322 |
| |
314 | 323 |
| |
315 | 324 |
| |
316 | 325 |
| |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
317 | 335 |
| |
318 | 336 |
| |
319 | 337 |
| |
| |||
341 | 359 |
| |
342 | 360 |
| |
343 | 361 |
| |
344 |
| - | |
| 362 | + | |
345 | 363 |
| |
346 | 364 |
| |
347 | 365 |
| |
| |||
454 | 472 |
| |
455 | 473 |
| |
456 | 474 |
| |
457 |
| - | |
458 | 475 |
| |
459 |
| - | |
460 |
| - | |
461 |
| - | |
462 |
| - | |
463 |
| - | |
464 |
| - | |
465 |
| - | |
466 |
| - | |
467 |
| - | |
| 476 | + | |
| 477 | + | |
| 478 | + | |
| 479 | + | |
| 480 | + | |
| 481 | + | |
468 | 482 |
| |
469 | 483 |
| |
470 | 484 |
| |
| |||
481 | 495 |
| |
482 | 496 |
| |
483 | 497 |
| |
484 |
| - | |
485 |
| - | |
486 |
| - | |
487 |
| - | |
| 498 | + | |
| 499 | + | |
| 500 | + | |
| 501 | + | |
488 | 502 |
| |
489 | 503 |
| |
490 | 504 |
| |
| |||
597 | 611 |
| |
598 | 612 |
| |
599 | 613 |
| |
600 |
| - | |
601 | 614 |
| |
602 | 615 |
| |
603 | 616 |
| |
|
Binary file not shown.
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
36 | 36 |
| |
37 | 37 |
| |
38 | 38 |
| |
39 |
| - | |
| 39 | + | |
| 40 | + | |
40 | 41 |
| |
41 | 42 |
| |
42 | 43 |
| |
| |||
195 | 196 |
| |
196 | 197 |
| |
197 | 198 |
| |
198 |
| - | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
199 | 202 |
| |
200 |
| - | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
201 | 206 |
| |
202 | 207 |
| |
203 | 208 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
34 | 34 |
| |
35 | 35 |
| |
36 | 36 |
| |
37 |
| - | |
38 |
| - | |
39 |
| - | |
40 |
| - | |
41 |
| - | |
42 |
| - | |
43 |
| - | |
44 |
| - | |
45 |
| - | |
46 |
| - | |
47 |
| - | |
48 |
| - | |
49 |
| - | |
50 |
| - | |
51 | 37 |
| |
52 | 38 |
| |
53 | 39 |
| |
| |||
148 | 134 |
| |
149 | 135 |
| |
150 | 136 |
| |
151 |
| - | |
152 |
| - | |
153 |
| - | |
154 |
| - | |
155 |
| - | |
156 |
| - | |
157 |
| - | |
158 |
| - | |
159 |
| - | |
160 |
| - | |
161 |
| - | |
162 |
| - | |
163 |
| - | |
164 |
| - | |
165 |
| - | |
166 |
| - | |
167 |
| - | |
168 |
| - | |
169 |
| - | |
170 |
| - | |
171 |
| - | |
172 |
| - | |
173 |
| - | |
174 |
| - | |
175 |
| - | |
176 |
| - | |
177 |
| - | |
178 |
| - | |
179 |
| - | |
180 |
| - | |
181 |
| - | |
182 |
| - | |
183 |
| - | |
184 |
| - | |
185 |
| - | |
186 |
| - | |
187 |
| - | |
188 | 137 |
| |
189 | 138 |
| |
190 | 139 |
| |
| |||
306 | 255 |
| |
307 | 256 |
| |
308 | 257 |
| |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
309 | 279 |
| |
310 | 280 |
| |
311 | 281 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + |
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + |
0 commit comments