• traxex@lemmy.dbzer0.com
      link
      fedilink
      arrow-up
      5
      ·
      8 hours ago

      Hell yeah, I hope I contributed to some bot somewhere absolutely flailing to provide a good python snippet.

  • Quicky@piefed.social
    link
    fedilink
    English
    arrow-up
    90
    ·
    14 hours ago

    I’m torn between wanting to opt-out because it’s morally correct, or remaining opted-in so I can poison AI models with my terrible code.

    • Flipper@feddit.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 hours ago

      Step one: Download a C or CPP repository.

      Step two: Replace all semicolons with a greek comma.

      Step three: ??

      Step four: Poison Copilot, so that it randomly insert greek comas that the compilers totally choke on.

    • Cevilia (they/she/…)@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      20
      ·
      12 hours ago

      I signed up to github purely to opt in and upload terrible python code.

      If they desperately want to train the idiot machine on my awful self-taught code, that’s on them.

    • bobo@lemmy.ml
      link
      fedilink
      arrow-up
      33
      ·
      14 hours ago

      so I can poison AI models with my terrible code.

      Don’t forget to teach it obscenities and yell at it whenever it fucks something up!

      • Madrigal@lemmy.world
        link
        fedilink
        English
        arrow-up
        27
        ·
        13 hours ago

        Nah, guarantee the models have rules built in to deal with obvious stuff like that.

        You need to be more subtle. Give them information that is slightly wrong.

        • 4am@lemmy.zip
          link
          fedilink
          arrow-up
          13
          ·
          12 hours ago

          Yeah all you have to do is commit anything to GitHub

          They’re scraping all the code regardless of your preferences. I guarantee it.

      • communism@lemmy.ml
        link
        fedilink
        arrow-up
        5
        ·
        10 hours ago

        It’s great. I also self-host my own Forgejo (that’s the software Codeberg runs on) instance for private repos, to avoid using up space on Codeberg’s servers.

        Main problem is the lack of federation, leading to splintering across Codeberg/GitLab/sourcehut/self-hosted forges. I know there’s Radicle, and Forgejo is working on ActivityPub integration, but it’s slow-moving to get what should be inherently federated by design (git) to actually be federated. In practice you need accounts on a dozen different websites if you want to regularly contribute to foss.

  • smeg@feddit.uk
    link
    fedilink
    English
    arrow-up
    33
    arrow-down
    1
    ·
    13 hours ago

    Not to be too snarky, but was there ever an assumption that stuff you put in wasn’t being used to train it? Safe to assume that any online service you’re using is making use of the data you’re giving it.

    • nogooduser@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      13 hours ago

      If you’re a business with a contract with them it should state that they won’t use your data to train their models.

      If you’re using the free service then you’re right that it’s safe to assume that your data was already being used.

      • MNByChoice@midwest.social
        link
        fedilink
        arrow-up
        8
        arrow-down
        1
        ·
        13 hours ago

        business with a contract

        I always wonder at this and have cautioned my managers repeatedly. Yes, we have a contract, but they have a literal army of lawyers and we have less (one lawyer one retainer for hourly work or a small grouping focused on taxes and employment law). As if our ownership won’t bend over backwards to avoid suing a large company like Google, AWS, Microsoft, or Oracle. (Maybe OpenAI and Anthropic are sue-able by a $100 million corp?)

        As proof I offer the lawsuits between businesses that have proceeded far enough the general public has heard about them. Not a specific one, just all of them.

        • nogooduser@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          13 hours ago

          You have to trust the contract.

          If you use Microsoft 365 or Google Workspace etc then they already have all your data anyway. Most businesses have to trust other companies and the contract at some point.

          The only other option is to use Open Source self hosted everything which is beyond most people’s ability.

  • lime!@feddit.nu
    link
    fedilink
    arrow-up
    21
    ·
    14 hours ago

    fun fact, if you’ve ever accidentally clicked the “enable” button on copilot because you’re a dumbass who can’t read, you get a shitton of more settings, most of which are locked to “enabled”.

  • Captain_Faraday@programming.dev
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    1
    ·
    13 hours ago

    Got this email last night and felt validated for never uploading any code to GitHub because I don’t trust Microsoft. lol I don’t have any big coding projects, but I self-host a ForgeJo server in my mini rack at home behind a Twingate VPN.

    • Hawke@lemmy.world
      link
      fedilink
      arrow-up
      7
      ·
      11 hours ago

      FYI: it is not “ForgeJo”

      Forgejo is derived from Esperanto where the “ejo” suffix means “place”. The J is pronounced like y is in English.

      It’s “forge-ejo” not “forge-joe”