Skip to content
  • Kategorien
  • Aktuell
  • Tags
  • Beliebt
  • World
  • Benutzer
  • Gruppen
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Standard: (Kein Skin)
  • Kein Skin
Einklappen

other.li Forum

  1. Übersicht
  2. Uncategorized
  3. Working on some poison-as-a-service (PaaS).

Working on some poison-as-a-service (PaaS).

Geplant Angeheftet Gesperrt Verschoben Uncategorized
enjoythinking
25 Beiträge 10 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • ? Gast

    Working on some poison-as-a-service (PaaS). Looking to launch in the next few days.

    #AI #enjoythinking

    ? Offline
    ? Offline
    Gast
    schrieb zuletzt editiert von
    #5

    @JulianOliver solid gold

    1 Antwort Letzte Antwort
    0
    • ? Gast

      Working on some poison-as-a-service (PaaS). Looking to launch in the next few days.

      #AI #enjoythinking

      ? Offline
      ? Offline
      Gast
      schrieb zuletzt editiert von
      #6

      @JulianOliver

      Looking forward to it!! Also, you might find this great list helpful: https://algorithmic-sabotage.gitlab.io/asrg/sabot-in-the-age-of-ai/ via @asrg. It brings together a range of related approaches and could be useful as you refine things before launch.

      ? 1 Antwort Letzte Antwort
      0
      • ? Gast

        @JulianOliver Nepenthes GeoCities Edition? 😄

        ? Offline
        ? Offline
        Gast
        schrieb zuletzt editiert von
        #7

        @alxndr 100%

        1 Antwort Letzte Antwort
        0
        • ? Gast

          @JulianOliver

          Looking forward to it!! Also, you might find this great list helpful: https://algorithmic-sabotage.gitlab.io/asrg/sabot-in-the-age-of-ai/ via @asrg. It brings together a range of related approaches and could be useful as you refine things before launch.

          ? Offline
          ? Offline
          Gast
          schrieb zuletzt editiert von
          #8

          @anaiscrosby @asrg A great resource!

          1 Antwort Letzte Antwort
          0
          • ? Gast

            Working on some poison-as-a-service (PaaS). Looking to launch in the next few days.

            #AI #enjoythinking

            ? Offline
            ? Offline
            Gast
            schrieb zuletzt editiert von
            #9

            Also working on a zip bomb, to randomly scatter in among the links.

            Thanks to @anaiscrosby I came across this excellent method, using LZ77:

            https://natechoe.dev/blog/2025-08-04.html

            TBH I was just going to `dd if=/dev/urandom` my way to a titanic RAM flooding *.gz, but am getting great results with the above, and with bonus site data honey inside to keep bots on the chase.

            ? 1 Antwort Letzte Antwort
            0
            • ? Gast

              Working on some poison-as-a-service (PaaS). Looking to launch in the next few days.

              #AI #enjoythinking

              ? Offline
              ? Offline
              Gast
              schrieb zuletzt editiert von
              #10

              @JulianOliver

              This ' #antiAI content' that reads exactly like bad #LLM output on a gradient that looks like a 90s sticker book had a stroke. The snake isn't just eating its own tail, it's leaving a five-star review of the experience.

              It won't work because scrapers don't care about your CSS. The text is still plaintext in the HTML. The gibberish doesn't poison anything, models already train on billions of tokens of garbage and route around it. And if your adversarial content is indistinguishable from the thing you're fighting, you're just contributing to the slop pile for free.

              #adversarialAi not

              ? 1 Antwort Letzte Antwort
              0
              • ? Gast

                @JulianOliver

                This ' #antiAI content' that reads exactly like bad #LLM output on a gradient that looks like a 90s sticker book had a stroke. The snake isn't just eating its own tail, it's leaving a five-star review of the experience.

                It won't work because scrapers don't care about your CSS. The text is still plaintext in the HTML. The gibberish doesn't poison anything, models already train on billions of tokens of garbage and route around it. And if your adversarial content is indistinguishable from the thing you're fighting, you're just contributing to the slop pile for free.

                #adversarialAi not

                ? Offline
                ? Offline
                Gast
                schrieb zuletzt editiert von
                #11

                @n_dimension Oh the CSS is only there for me and any other human that likes to look at it.

                I am already seeing bots chewing into my tarpit, & spending time++ getting lost, wasting cycles, and soon I'll be flooding their RAM. While they keep coming back (they are, over and over), my little swamp will be waiting.

                If I & others are managing to contribute to the "slop pile", that's great. We're helping keep genAI text distinguishable from human generated content in an era of unregulated deception

                ? 1 Antwort Letzte Antwort
                0
                • ? Gast

                  @n_dimension Oh the CSS is only there for me and any other human that likes to look at it.

                  I am already seeing bots chewing into my tarpit, & spending time++ getting lost, wasting cycles, and soon I'll be flooding their RAM. While they keep coming back (they are, over and over), my little swamp will be waiting.

                  If I & others are managing to contribute to the "slop pile", that's great. We're helping keep genAI text distinguishable from human generated content in an era of unregulated deception

                  ? Offline
                  ? Offline
                  Gast
                  schrieb zuletzt editiert von
                  #12

                  @JulianOliver

                  How do you differentiate Ai bots from search engine bots from sploit scrapers?

                  I've seen sploit scrapers in my logs, but I wouldn't be able to disambiguate Ai from indexer.

                  ? 1 Antwort Letzte Antwort
                  0
                  • ? Gast

                    @JulianOliver

                    How do you differentiate Ai bots from search engine bots from sploit scrapers?

                    I've seen sploit scrapers in my logs, but I wouldn't be able to disambiguate Ai from indexer.

                    ? Offline
                    ? Offline
                    Gast
                    schrieb zuletzt editiert von
                    #13

                    @n_dimension I do not see a way to differentiate, and suspect there is no way.

                    1 Antwort Letzte Antwort
                    0
                    • ? Gast

                      Also working on a zip bomb, to randomly scatter in among the links.

                      Thanks to @anaiscrosby I came across this excellent method, using LZ77:

                      https://natechoe.dev/blog/2025-08-04.html

                      TBH I was just going to `dd if=/dev/urandom` my way to a titanic RAM flooding *.gz, but am getting great results with the above, and with bonus site data honey inside to keep bots on the chase.

                      ? Offline
                      ? Offline
                      Gast
                      schrieb zuletzt editiert von
                      #14

                      @anaiscrosby After seeing ChatGPTBot blow 123 seconds on my drip-feed poison tarpit and then never come back, I got reading on how modern LLM scrapers might employ mechanisms to detect tarpits and blacklist.

                      During research I came across this tarpit evading scraper that provides some interesting insights into how modern LLM scrapers might do this.

                      https://github.com/Draconiator/Ipema

                      This gives me pause and has me looking at other solutions for counter-detection.

                      The GeoCities CSS is going nowhere.

                      ? ? ? ? ? 5 Antworten Letzte Antwort
                      0
                      • ? Gast

                        @anaiscrosby After seeing ChatGPTBot blow 123 seconds on my drip-feed poison tarpit and then never come back, I got reading on how modern LLM scrapers might employ mechanisms to detect tarpits and blacklist.

                        During research I came across this tarpit evading scraper that provides some interesting insights into how modern LLM scrapers might do this.

                        https://github.com/Draconiator/Ipema

                        This gives me pause and has me looking at other solutions for counter-detection.

                        The GeoCities CSS is going nowhere.

                        ? Offline
                        ? Offline
                        Gast
                        schrieb zuletzt editiert von
                        #15

                        @JulianOliver @anaiscrosby thank you

                        1 Antwort Letzte Antwort
                        0
                        • ? Gast

                          @anaiscrosby After seeing ChatGPTBot blow 123 seconds on my drip-feed poison tarpit and then never come back, I got reading on how modern LLM scrapers might employ mechanisms to detect tarpits and blacklist.

                          During research I came across this tarpit evading scraper that provides some interesting insights into how modern LLM scrapers might do this.

                          https://github.com/Draconiator/Ipema

                          This gives me pause and has me looking at other solutions for counter-detection.

                          The GeoCities CSS is going nowhere.

                          ? Offline
                          ? Offline
                          Gast
                          schrieb zuletzt editiert von
                          #16

                          @JulianOliver

                          Interesting!! Based on my little experience implementing a similar tarpit using spigot (https://github.com/gw1urf/spigot) via @pengfold, I’ve noticed something pretty similar - bursts of activity (millions of hits/day) followed by long stretches of silence. From the intensity and patterns, it does seem like many scrapers aren’t consistently avoiding the tarpit, at least initially.

                          That said, I’d be a bit cautious about that conclusion. What you might be seeing isn’t necessarily "they can’t avoid it," but more like:

                          - some scrapers don’t try to detect tarpits (they just brute-force crawl and eat the cost)
                          - others probe once, flag it, and then blacklist it, hence the sudden silence
                          - and some operate in waves (rotating IPs / infrastructure), which can look like on/off behavior

                          1 Antwort Letzte Antwort
                          0
                          • ? Gast

                            @anaiscrosby After seeing ChatGPTBot blow 123 seconds on my drip-feed poison tarpit and then never come back, I got reading on how modern LLM scrapers might employ mechanisms to detect tarpits and blacklist.

                            During research I came across this tarpit evading scraper that provides some interesting insights into how modern LLM scrapers might do this.

                            https://github.com/Draconiator/Ipema

                            This gives me pause and has me looking at other solutions for counter-detection.

                            The GeoCities CSS is going nowhere.

                            ? Offline
                            ? Offline
                            Gast
                            schrieb zuletzt editiert von
                            #17

                            @JulianOliver Did you see this paper by Anthropic researchers? https://arxiv.org/abs/2510.07192

                            250 samples can poison even the largest models. That’s one webring! Even if detectable, might be a good way to avoid getting scraped?

                            1 Antwort Letzte Antwort
                            0
                            • ? Gast

                              @anaiscrosby After seeing ChatGPTBot blow 123 seconds on my drip-feed poison tarpit and then never come back, I got reading on how modern LLM scrapers might employ mechanisms to detect tarpits and blacklist.

                              During research I came across this tarpit evading scraper that provides some interesting insights into how modern LLM scrapers might do this.

                              https://github.com/Draconiator/Ipema

                              This gives me pause and has me looking at other solutions for counter-detection.

                              The GeoCities CSS is going nowhere.

                              ? Offline
                              ? Offline
                              Gast
                              schrieb zuletzt editiert von
                              #18

                              @JulianOliver @anaiscrosby I haven't looked into tarpits but it smells to me very much like an "arms race" situation and there's no reason to think your side could prevail.

                              ? ? 2 Antworten Letzte Antwort
                              0
                              • ? Gast

                                @JulianOliver @anaiscrosby I haven't looked into tarpits but it smells to me very much like an "arms race" situation and there's no reason to think your side could prevail.

                                ? Offline
                                ? Offline
                                Gast
                                schrieb zuletzt editiert von
                                #19

                                @danstowell @anaiscrosby Winning would be nice, but I don't think it's always about prevailing. Just as a likelihood of failing need not undermine the will to act. Resistance, doing something, standing ground, rather than letting this predatorial broligarchy have their way.

                                Much of the time it's just about pushing back. If concerted, and at scale, it can indeed bring about tangible change.

                                1 Antwort Letzte Antwort
                                0
                                • ? Gast

                                  @JulianOliver @anaiscrosby I haven't looked into tarpits but it smells to me very much like an "arms race" situation and there's no reason to think your side could prevail.

                                  ? Offline
                                  ? Offline
                                  Gast
                                  schrieb zuletzt editiert von
                                  #20

                                  @danstowell @JulianOliver It’s about pushing back, not prevailing. “Tarpitting” has already emerged as a widely adopted response to AI, both a strategic approach and a meaningful act of resistance.

                                  ? 1 Antwort Letzte Antwort
                                  0
                                  • ? Gast

                                    @danstowell @JulianOliver It’s about pushing back, not prevailing. “Tarpitting” has already emerged as a widely adopted response to AI, both a strategic approach and a meaningful act of resistance.

                                    ? Offline
                                    ? Offline
                                    Gast
                                    schrieb zuletzt editiert von
                                    #21

                                    @anaiscrosby @JulianOliver Thanks. I see that it's been adopted. My concern is that it might cost us a lot developing these tarpits that have very little strategic effect if they become outmoded v quickly. But I really don't know - it's a very murky phase rn

                                    1 Antwort Letzte Antwort
                                    0
                                    • ? Gast

                                      @anaiscrosby After seeing ChatGPTBot blow 123 seconds on my drip-feed poison tarpit and then never come back, I got reading on how modern LLM scrapers might employ mechanisms to detect tarpits and blacklist.

                                      During research I came across this tarpit evading scraper that provides some interesting insights into how modern LLM scrapers might do this.

                                      https://github.com/Draconiator/Ipema

                                      This gives me pause and has me looking at other solutions for counter-detection.

                                      The GeoCities CSS is going nowhere.

                                      ? Offline
                                      ? Offline
                                      Gast
                                      schrieb zuletzt editiert von
                                      #22

                                      @anaiscrosby Running a non-Markov tarpit for half an hour on one public link, and already have Claude lost in my swamp. Waiting to see if it runs into my ZIP bomb

                                      ---
                                      216.73.216.124 - - [07/Apr/2026:03:28:49 +0200] "GET /tarpit/until/same/drive/harmattan_leftmost_intranscalency_few_ministries_few_between HTTP/2.0" 200 10132 "-" "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "-"
                                      ---

                                      ? 1 Antwort Letzte Antwort
                                      0
                                      • ? Gast

                                        @anaiscrosby Running a non-Markov tarpit for half an hour on one public link, and already have Claude lost in my swamp. Waiting to see if it runs into my ZIP bomb

                                        ---
                                        216.73.216.124 - - [07/Apr/2026:03:28:49 +0200] "GET /tarpit/until/same/drive/harmattan_leftmost_intranscalency_few_ministries_few_between HTTP/2.0" 200 10132 "-" "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "-"
                                        ---

                                        ? Offline
                                        ? Offline
                                        Gast
                                        schrieb zuletzt editiert von
                                        #23

                                        @anaiscrosby It hit it, but I guess decompressed in a thread. It's a 127M archive that decompresses to 128GB. The bot kept scraping for a bit and then dropped off. Difficult to know if it was a discouragement.

                                        Strange is that soon after other IPs were reaching statistically non-guessable randomly generated URL paths, without touching the webroot or another other tarpit URL prior. They all had iOS UA strings (readily forged).

                                        ? ? 2 Antworten Letzte Antwort
                                        0
                                        • ? Gast

                                          @anaiscrosby It hit it, but I guess decompressed in a thread. It's a 127M archive that decompresses to 128GB. The bot kept scraping for a bit and then dropped off. Difficult to know if it was a discouragement.

                                          Strange is that soon after other IPs were reaching statistically non-guessable randomly generated URL paths, without touching the webroot or another other tarpit URL prior. They all had iOS UA strings (readily forged).

                                          ? Offline
                                          ? Offline
                                          Gast
                                          schrieb zuletzt editiert von
                                          #24

                                          @JulianOliver @anaiscrosby
                                          I've learnt about poison fountain and zip bomb before but never deployed any. It's interesting to read about your results.
                                          Can I ask why you use Markov chain ? Wouldn't be enough, and simpler, to spit words at random ? Is that because totally random text would be easier to detect as a poison fountain ? Also, have you considered adding images (just coherent noise with irrelevant alt text) to the text ?
                                          About the zip bomb, how do you filter to avoid accidentally bombing an innocent guy ? Checking the user-agent isn't an option, right ? Bad bots forge it anyway.
                                          What about the legal aspect of these two methods, do you know ?

                                          1 Antwort Letzte Antwort
                                          0
                                          Antworten
                                          • In einem neuen Thema antworten
                                          Anmelden zum Antworten
                                          • Älteste zuerst
                                          • Neuste zuerst
                                          • Meiste Stimmen


                                          • Anmelden

                                          • Anmelden oder registrieren, um zu suchen
                                          • Erster Beitrag
                                            Letzter Beitrag
                                          0
                                          • Kategorien
                                          • Aktuell
                                          • Tags
                                          • Beliebt
                                          • World
                                          • Benutzer
                                          • Gruppen