Jump to main content Jump to sidebar
Home Postmill
  • Forums
  • Wiki
  • Log in
  • Sign up
  • Submissions
  • Comments
    • Featured
    • All
    • Hot
    • New
    • Active
    • Top
    • Controversial
    • Most commented

Do we really need 100B+ parameters in a large language model?

Submitted by Vegetable-Skill-9700 t3_121agx4 on March 25, 2023 at 4:24 AM in deeplearning

  • 54 comments
43 loading

Has anyone tried to use deep learning with CNNs on satellite images to predict malaria risk (or other similar diseases)?

Submitted by R_K_J-DK t3_120urzh on March 24, 2023 at 7:15 PM in deeplearning

  • 19 comments
13 loading

Cuda out of memory error

Submitted by Rishh3112 t3_120gvgw on March 24, 2023 at 11:01 AM in deeplearning

  • 24 comments
1 loading

Question for use of ML in adaptive authentication

Submitted by geoffroy_lesage t3_11zcc1a on March 23, 2023 at 7:17 AM in deeplearning

  • 20 comments
1 loading

Anyone have any good alternatives to Paperspace? My account got closed for unauthorized access.

Submitted by FermatsLastAccount t3_11ys19h on March 22, 2023 at 6:31 PM in deeplearning

  • 9 comments
0 loading

Alpaca Turbo : A chat interface to interact with alpaca models with history and context

Submitted by viperx7 t3_11xdx3w on March 21, 2023 at 11:21 AM in deeplearning

  • 21 comments
27 loading

Alpaca-7B and Dalai, how can I get coherent results?

Submitted by Haghiri75 t3_11wdi8m on March 20, 2023 at 9:01 AM in deeplearning

  • 12 comments
15 loading

How noticeable is the difference training a model 4080 vs 4090

Submitted by Numerous_Talk7940 t3_11w9hkj on March 20, 2023 at 5:18 AM in deeplearning

  • 20 comments
16 loading

Best GPUs for pretraining roBERTa-size LLMs with a $50K budget, 4x RTX A6000 v.s. 4x A6000 ADA v.s. 2x A100 80GB

Submitted by AngrEvv t3_11vb220 on March 19, 2023 at 4:17 AM in deeplearning

  • 7 comments
18 loading

Seeking Career Advice to go from general CS background to a career in AI/Machine Learning

Submitted by brown_ja t3_11urpbb on March 18, 2023 at 3:41 PM in deeplearning

  • 11 comments
13 loading

5100+ Chat GPT Prompts Excel Sheet

Submitted by Ok-Demand-7347 t3_11un0j6 on March 18, 2023 at 12:22 PM in deeplearning

  • 6 comments
0 loading

Question on Attention

Submitted by FunQuarter3511 t3_11ugj0f on March 18, 2023 at 6:31 AM in deeplearning

  • 7 comments
6 loading

Choose wisely

Submitted by nickpngc t3_11t4c9c on March 16, 2023 at 7:52 PM in deeplearning

  • 7 comments
2 loading

How To Fine-tune LLaMA Models, Smaller Models With Performance Of GPT3

Submitted by l33thaxman t3_11ryc3s on March 15, 2023 at 2:38 PM in deeplearning

  • 10 comments
40 loading

image to image

Submitted by Marius1235 t3_11rs817 on March 15, 2023 at 10:17 AM in deeplearning

  • 7 comments
2 loading

How To Scale Transformers’ Memory up to 262K Tokens With a Minor Change?

Submitted by rezayazdanfar t3_11qfl2o on March 13, 2023 at 5:19 PM in deeplearning

  • 7 comments
15 loading

Image reconstruction

Submitted by grid_world t3_11qazip on March 13, 2023 at 2:18 PM in deeplearning

  • 6 comments
1 loading

Using GANs to generate defective data

Submitted by Tekno-12345 t3_11qanvr on March 13, 2023 at 2:06 PM in deeplearning

  • 18 comments
11 loading

Which topic in deep learning do you think will become relevant or popular in the future?

Submitted by gokulPRO t3_11pyvb3 on March 13, 2023 at 3:30 AM in deeplearning

  • 15 comments
12 loading

Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers?

Submitted by nirnamous t3_11pq968 on March 12, 2023 at 9:22 PM in deeplearning

  • 9 comments
16 loading

How to code a PPO neural network in java

Submitted by SpigotNerd t3_11oo58v on March 11, 2023 at 4:20 PM in deeplearning

  • 7 comments
0 loading

Desktop Computer or some other way to train neural networks?

Submitted by ScottHameed t3_11o8xjx on March 11, 2023 at 2:58 AM in deeplearning

  • 21 comments
11 loading

AskReddit: which MBP Pro is future proof for AI apps development?

Submitted by General-Jaguar-8164 t3_11n8xrc on March 10, 2023 at 12:20 AM in deeplearning

  • 9 comments
0 loading

Can feature engineering avoid overfitting?

Submitted by Constant-Cranberry29 t3_11mokqu on March 9, 2023 at 10:12 AM in deeplearning

  • 15 comments
6 loading

this is reality.

Submitted by Genius_feed t3_11mnw1m on March 9, 2023 at 9:28 AM in deeplearning

  • 6 comments
59 loading
  • More

Running Postmill