Content Warning

This website contains age-restricted materials including nudity and explicit depictions of sexual activity.

By entering, you affirm that you are at least 18 years of age or the age of majority in the jurisdiction you are accessing the website from and you consent to viewing sexually explicit content.

Lemmy NSFW
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Andromxda 🇺🇦🇵🇸🇹🇼@lemmy.dbzer0.comM to Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ@lemmy.dbzer0.comEnglish · 1 year ago

Anna's Archive is looking for volunteers to run mirrors

annas-archive.org

external-link
message-square
49
link
fedilink
280
external-link

Anna's Archive is looking for volunteers to run mirrors

annas-archive.org

Andromxda 🇺🇦🇵🇸🇹🇼@lemmy.dbzer0.comM to Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ@lemmy.dbzer0.comEnglish · 1 year ago
message-square
49
link
fedilink
alert-triangle
You must log in or register to comment.
  • maxprime@lemmy.ml
    link
    fedilink
    English
    arrow-up
    62
    ·
    1 year ago

    For anyone wanting to contribute but on a smaller and more feasible scale, you can help distribute their database using torrents.

    https://annas-archive.org/torrents

    • empireOfLove2@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      47
      ·
      edit-2
      1 year ago

      I know the last time this came up there was a lot of user resistance to the torrent scheme. I’d be willing to seed 200-500gb but having minimum torrent archive sizes of like 1.5TB and larger really limits the number of people willing to give up that storage, as well as defeats a lot of the resiliency of torrents with how bloody long it takes to get a complete copy. I know that 1.5TB takes a massive chunk out of my already pretty full NAS, and I passed on seeding the first time for that reason.

      It feels like they didn’t really subdivide the database as much as they should have…

      • maxprime@lemmy.ml
        link
        fedilink
        English
        arrow-up
        26
        ·
        1 year ago

        There are plenty of small torrents. Use the torrent generator and tell the script how much space you have and it will give you the “best” (least seeded) torrents whose sum is the size you give it. It doesn’t have to be big, even a few GB is suitable for some smaller torrents.

        • empireOfLove2@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          22
          ·
          edit-2
          1 year ago

          Almost all the small torrents that I see pop up are already seeded relatively good (~10 seeders) though, which reinforces the fact that A. the torrents most desperately needing seeders are the older, largest ones and B. large torrents don’t attract seeders because of unreasonable space requirements.

          Admittedly, newer torrents seem to be split into 300gb or less pieces, which is good, but there’s still a lot of monster torrents in that list.

    • GravitySpoiled@lemmy.ml
      link
      fedilink
      English
      arrow-up
      7
      ·
      1 year ago

      Thx.

      Do you know how useful it is to host such a torrent? Who is accessing the content via that torrent?

      • maxprime@lemmy.ml
        link
        fedilink
        English
        arrow-up
        7
        ·
        1 year ago

        Anyone who wants to. I think a lot of LLM trainers access them.

        • GravitySpoiled@lemmy.ml
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Doesn’t sound like I should host some of it. I’d be more down to host it for endusers

  • HeartyOfGlass@lemm.ee
    link
    fedilink
    English
    arrow-up
    26
    ·
    1 year ago

    Could anyone broad-stroke the security requirements for something like this? Looks like they’ll pay for hosting up to a certain amount, and between that and a pipeline to keep the mirror updated I’d think it wouldn’t be tough to get one up and running.

    Just looking for theory - what are the logistics behind keeping a mirror like this secure?

    • thanksforallthefish@literature.cafe
      link
      fedilink
      English
      arrow-up
      21
      ·
      edit-2
      1 year ago

      Could be worth asking on selfhosted (how do I link a sub on lemmy ?) They probably have more relevant experience at this sort of thing.

      Edit

      Does this work ?

      https://lemmy.world/c/selfhosted

      • can@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        19
        ·
        1 year ago

        [email protected] might work for more people.

      • rufus@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        12
        ·
        edit-2
        1 year ago

        [email protected]

        Is probably more suitable. I’d be interested in the total size, though.

        • catloaf@lemm.ee
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 year ago

          900 TB, according to other comments here.

          • Illecors@lemmy.cafe
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 year ago

            Is it all or nothing sort of deal?

            • catloaf@lemm.ee
              link
              fedilink
              English
              arrow-up
              2
              ·
              1 year ago

              There are partial torrents, also according to the other comments.

      • Spunky Monkey@lemm.ee
        link
        fedilink
        English
        arrow-up
        5
        ·
        1 year ago

        It does. 😉

    • obviouspornalt
      link
      fedilink
      English
      arrow-up
      15
      ·
      1 year ago

      They outline it pretty well here:

      https://annas-blog.org/how-to-run-a-shadow-library.html

      • tsonfeir@lemm.ee
        link
        fedilink
        English
        arrow-up
        5
        ·
        1 year ago

        This is a fascinating read

  • ☂️-@lemmy.ml
    link
    fedilink
    English
    arrow-up
    25
    ·
    edit-2
    2 days ago

    deleted by creator

    • xrtxn@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      48
      ·
      1 year ago

      The selection is literally all books that can be found on the internet.

      • tsonfeir@lemm.ee
        link
        fedilink
        English
        arrow-up
        13
        ·
        1 year ago

        So how big is that?

        • Index@feddit.nl
          link
          fedilink
          English
          arrow-up
          32
          ·
          1 year ago

          According to their total dataset size excluding duplicates, over 900 TB

          • rufus@discuss.tchncs.de
            link
            fedilink
            English
            arrow-up
            17
            ·
            1 year ago

            Sure, that’s a bit more than $65.000 per year with Backblaze.

          • tsonfeir@lemm.ee
            link
            fedilink
            English
            arrow-up
            12
            ·
            1 year ago

            Shit, my synology has more than that… alas, it is full of movie “archives”

            • state_electrician@discuss.tchncs.de
              link
              fedilink
              English
              arrow-up
              18
              ·
              1 year ago

              You run a petabyte Synology at home?

              • tsonfeir@lemm.ee
                link
                fedilink
                English
                arrow-up
                7
                ·
                1 year ago

                Well, it’s not just a single synology, it’s got a bunch of expansion units, and there are multiple host machines.

            • dutchkimble@lemy.lol
              link
              fedilink
              English
              arrow-up
              6
              ·
              1 year ago

              I’m guessing you’re talking GBs?

              • tsonfeir@lemm.ee
                link
                fedilink
                English
                arrow-up
                8
                ·
                1 year ago

                Nope.

                • dutchkimble@lemy.lol
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  1 year ago

                  That’s awesome - how many drives and of what sizes do you have? Also why synology instead of higher enterprise grade solution at this point?

              • FigMcLargeHuge@sh.itjust.works
                link
                fedilink
                English
                arrow-up
                3
                ·
                edit-2
                1 year ago

                They put a link in with the total…

                Total Excluding duplicates 133,708,037 files 913.1 TB

            • ☂️-@lemmy.ml
              link
              fedilink
              English
              arrow-up
              0
              ·
              edit-2
              1 year ago

              deleted by creator

              • tsonfeir@lemm.ee
                link
                fedilink
                English
                arrow-up
                7
                ·
                1 year ago

                It’s an investment. It’s like the price of a small car. But it was built over time, so not like one lump sum.

                Originally, it was to have easier access to my already insane Blu-ray collection. But I started getting discs from Redbox, rental stores, libraries, etc. they are full rip, not that compressed PB stuff. Now there are like 3000 movies and fuck knows how many tv shows.

                A lot of my effort was to have the best release available. Or, have things that got canceled. Like the Simpsons episode with MJ, which is unavailable to stream.

                Snags… well, synology is sooo easy. Once you figure out how you want you drives set up, there’s nothing to it.

                Whatever you do, always have redundant drives. Yes, you lose space, but eventually one of them is gonna die and you don’t want to lose data.

                • redcalcium@lemmy.institute
                  link
                  fedilink
                  English
                  arrow-up
                  11
                  ·
                  1 year ago

                  You should write a will instructing your family to send those disks to the internet archive for preservation if something happened to you.

          • AmbiguousProps@lemmy.today
            link
            fedilink
            English
            arrow-up
            5
            ·
            1 year ago

            Correct me if I’m wrong, but they only index shadow libraries and do not host any files themselves (unless you count the torrents). So, you don’t need 900+ TB of storage to create a mirror.

        • FreudianCafe@lemmy.ml
          link
          fedilink
          English
          arrow-up
          6
          ·
          1 year ago

          I guess more than 5?

        • Pussista@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 year ago

          I imagine a couple of terabytes at the very least, though, I could be underestimating how many books have got deDRMed so far.

          • tsonfeir@lemm.ee
            link
            fedilink
            English
            arrow-up
            4
            ·
            1 year ago

            Apparently it’s 900TB

            • Pussista@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              9
              ·
              1 year ago

              Girl, what? No wonder they’re having trouble hosting their archive. Does Anna’s Archive host copyrighted content as well or is all that copyleft?

              • redcalcium@lemmy.institute
                link
                fedilink
                English
                arrow-up
                12
                ·
                1 year ago

                They host academic papers and books, most of them are copyrighted contents. They recently got in trouble for scraping a book metadata service to generate a list of books that hasn’t been archived yet: https://torrentfreak.com/lawsuit-accuses-annas-archive-of-hacking-worldcat-stealing-2-2-tb-data-240207/

                • Pussista@sh.itjust.works
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  1 year ago

                  Is hosting all that stuff even legal? I mean, they’re not making any money off of it, but they’re still a “piracy” hub. How have they survived this long?

                • AmbiguousProps@lemmy.today
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  edit-2
                  1 year ago

                  They index, not host, no? (Unless you count the torrents, which are distributed)

              • smnwcj@fedia.io
                link
                fedilink
                arrow-up
                4
                ·
                1 year ago

                The archive includes copyrighted works. Often multiple copies of each work, across different formats.

      • spiderman@ani.social
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 year ago

        bigger than zlib or project Gutenberg?

    • redcalcium@lemmy.institute
      link
      fedilink
      English
      arrow-up
      14
      ·
      1 year ago

      It is huge! They claimed to have preserved about 5% of the world’s books.

      • ☂️-@lemmy.ml
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        deleted by creator

  • Vigilante@lemmy.today
    link
    fedilink
    English
    arrow-up
    15
    ·
    1 year ago

    Also link any ways to donate if they’re accepting that.

    • Andromxda 🇺🇦🇵🇸🇹🇼@lemmy.dbzer0.comOPM
      link
      fedilink
      English
      arrow-up
      16
      ·
      1 year ago

      https://annas-archive.org/donate

  • matcha_addict@lemy.lol
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 year ago

    I had no idea about this project. Is it like a better search engine for libgen etc?

    • weirdo_from_space@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      19
      ·
      1 year ago

      It searches through libgens, z-library and has it’s own mirrors of the files they serve on top of that. I think it was created as a response to Z-Library’s domain getting seized but I could be wrong.

    • Andromxda 🇺🇦🇵🇸🇹🇼@lemmy.dbzer0.comOPM
      link
      fedilink
      English
      arrow-up
      12
      ·
      1 year ago

      It has way more content than Libgen

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ@lemmy.dbzer0.com

piracy@lemmy.dbzer0.com

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]
⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don’t request invites, trade, sell, or self-promote

3. Don’t request or link to specific pirated titles, including DMs

4. Don’t submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):

  • 🪶 Megathread
  • 🪶 FAQ
  • 🪶 ISP Complaints
  • 🪶 Rules
  • 🪶 Glossary

🏴‍☠️ Other communities

FUCK ADOBE!

  • [email protected]

Torrenting/P2P:

  • [email protected]
  • [email protected]
  • [email protected]
  • [email protected]
  • [email protected]

Gaming:

  • [email protected]
  • [email protected]
  • [email protected]
  • [email protected]
  • [email protected]

💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 462 users / day
  • 1.53K users / week
  • 3.9K users / month
  • 10.5K users / 6 months
  • 669 local subscribers
  • 61.2K subscribers
  • 3.84K Posts
  • 90.7K Comments
  • Modlog
  • mods:
  • db0@lemmy.dbzer0.com
  • sunbrothersco@lemmy.dbzer0.com
  • Dataprolet@lemmy.dbzer0.com
  • Unruffled [they/them]@lemmy.dbzer0.com
  • RandomLegend [He/Him]@lemmy.dbzer0.com
  • Andromxda 🇺🇦🇵🇸🇹🇼@lemmy.dbzer0.com
  • CosmicTurtle0@lemmy.dbzer0.com
  • tenchiken@lemmy.dbzer0.com
  • UI: 0.19.11-nsfw
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org