• shoo@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    2
    ·
    13 hours ago

    Strange to not qualify the last one as theft. If it’s out putting code, it’s from the same kind of training set. If it’s out putting character responses, they’re from that same literary training data.

    • Kay Ohtie@pawb.social
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      4
      ·
      12 hours ago

      Open-source training texts intended for pairing with your intended style of output have been around for far longer than OpenAI has been grifting data from the entire Internet and collected book works. It came across like that’s what they’re using, not some shit off HuffingFarce that was built off of AO3 and Harry Potter.