1. Model Train Layout Update for 2/6/2022

    Model Train Layout Update for 2/6/2022

    108
    31
    261
    4
  2. An Extreme G4 Solar Storm Train, Eight Storms Race to Earth | Space Weather Spotlight 10 May 2024

    An Extreme G4 Solar Storm Train, Eight Storms Race to Earth | Space Weather Spotlight 10 May 2024

    12
    1
    1.1K
  3. Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained)

    Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained)

    524
  4. All Aboard! Trains, Trains and More Trains with Mark Fenbers!

    All Aboard! Trains, Trains and More Trains with Mark Fenbers!

    71
  5. How to Optimize Fat Loss | Alan Aragon & Shawn Stevenson

    How to Optimize Fat Loss | Alan Aragon & Shawn Stevenson

    40
  6. Model Train Layout Update 01-30-2023

    Model Train Layout Update 01-30-2023

    2
    0
    73
    2
  7. ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation

    ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation

    25
    8
    19
  8. Ho scale Model Train layout Update 12/24/2021

    Ho scale Model Train layout Update 12/24/2021

    5
    0
    54
    4
  9. CM3: A Causal Masked Multimodal Model of the Internet (Paper Explained w/ Author Interview)

    CM3: A Causal Masked Multimodal Model of the Internet (Paper Explained w/ Author Interview)

    60
  10. Model Trains Running Complation 2

    Model Trains Running Complation 2

    51
    17
    218
    1
  11. Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos (Paper Explained)

    Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos (Paper Explained)

    19
  12. [ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitable

    [ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitable

    110
    30
    111
  13. Training a NLP Translation Model on Custom Data

    Training a NLP Translation Model on Custom Data

    4
  14. LLaMA: Open and Efficient Foundation Language Models (Paper Explained)

    LLaMA: Open and Efficient Foundation Language Models (Paper Explained)

    28
  15. Autoregressive Diffusion Models (Machine Learning Research Paper Explained)

    Autoregressive Diffusion Models (Machine Learning Research Paper Explained)

    11
    5
    127
  16. Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)

    Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)

    20
  17. Top diy tractor making mini train transporting gasoline for petrol pump | train rescue

    Top diy tractor making mini train transporting gasoline for petrol pump | train rescue

    22