• boonhet@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    1
    ·
    18 hours ago

    The best advancements in usability of AI haven’t all come from training larger models. Tool usage is super useful and doesn’t require new training for new tools, etc.

    Deepseek for example doesn’t have a monthly release schedule, they’ve only released one version so far this year.

    Plus for new knowledge, there’s web search. It’s no longer strictly true that AI output is restricted to information available before the cutoff date.

    At this point you only really need to train a new model if you’re trying out architectural changes, you don’t need to crank out constant updates.