ADHD people: switching between things very quickly.
also ADHD people: tasks must be challenging enough to be engaging, but not so challenging that it triggers frustration intolerance.
... this is literally impedance matching.
按照 https://adityassrana.github.io/blog/theory/2020/08/26/Weight-Init.html 一文的说法,用“正确”的方式初始化了模型参数,结果甚至 train 不起来。想起以前不知在哪看到的「大家都在 overfit Adam」,感觉多少也是异曲同工。或许无形中已经 overfit 了 PyTorch 默认的不科学的 init。
Just some nonsense from GitHub CEOS. You are taking others' code without giving any credit and pretending to be for the betterment of humanity. Be honest, man. It is all about money.
When inventor Frederick Banting discovered insulin in 1923, he refused to put his name on the patent. He felt it was unethical for a doctor to profit from a discovery that would save lives. Now that is what I call the *betterment of humanity* and not this your bullshit generative AI.
Creator of Mortal & https://mjai.ekyu.moe | INTP-T | Novice producer & DJ | Rustacean | Arch Linux