| Batch size | Val_freq |
|---|---|
| 1 | 230 |
| 2 | 110 |
| 4 | 50 |
| 8 | 20 |
| 16 | 5 |
Batch Size Consideration: the choice of batch size can also impact the effective learning rate. Smaller batch sizes may benefit from a higher learning rate, while larger batch sizes may require a lower learning rate.
learning rate:
1e-4 (c1)

1e-5, (c2)

1e-6 (c3)

10e-4 (c4):

10e-5, (c5):

10e-6 (c6) - 1 epoch: