Wednesday Jul 26, 2023

Large data sources with bad data. Smaller data sources with good data. Which is better?

Large bad data.  Obviously not good and some have recognized this as “model collapse” – the bad data causes more bad data to be generated.  Small bad data – nothing needs to be said here.  Small (vetted, provenance-known) data – perhaps within your enterprise walls – perhaps the way to go, until large, good data that is used for training appears.  When will this happen?  I do not think this is on the horizon.

Comments (0)

To leave or reply to comments, please download free Podbean or

No Comments

Copyright 2023 All rights reserved.

Podcast Powered By Podbean

Version: 20240731