(98)
|
Whereas the generality of a model could, inter alia, also be determined
by a number of parameters, models with at least a billion of parameters and trained
with a large amount of data using self-supervision at scale should be considered to display
significant generality and to competently perform a wide range of distinctive tasks.
|