• @[email protected]
    link
    fedilink
    English
    -29 months ago

    You can anonymize anything, you just strip the identifying data out and do not include it when you give the info out. Like scrubbing the metadata off a pic before you post it online.

    The level of discourse in this sub has really fallen off a cliff recently…

      • ares35
        link
        fedilink
        79 months ago

        and, given enough samples, even those NOT in the database, are anyway by genetic relation.

      • @[email protected]
        link
        fedilink
        English
        19 months ago

        By that logic you cannot anonymize a pic either. Yet everyone who has their photo taken cannot necessarily be identified in it.

        • Bipta
          link
          fedilink
          29 months ago

          Anonymized data has long been problematic and you definitely cannot meaningfully anonymize a picture in the truest sense of the word.

        • @[email protected]
          link
          fedilink
          1
          edit-2
          9 months ago

          Can’t show a proof without doxxing me but I’ve written a patent to anonymize medical data (not genetic) and I’m a bioinformatician working with sequencing data.

          While you could probably achieve reasonable privacy levels by altering genetic data, we shouldn’t play with that under fallacious pretenses.

          You can use that data for medical research, of course… but also population profiling or stratification of customers if you are an insurance company.

    • @[email protected]
      link
      fedilink
      59 months ago

      So, paternity tests exist. Genetics, once enough data are collected, can absolutely be used to identify people.

      • @[email protected]
        link
        fedilink
        English
        39 months ago

        Correct. Much like with a fingerprint, with both examples, you can determine with good accuracy if they are the same.

        In addition, as tools are advancing, we can extrapolate with very large data sets to identify you even without having seen your specific code, by tracing commonalities through any relatives of yours that voluntarily or involuntarily submitted their codes. However, this does still require those codes to be identifiable. A randomized set of random people’s codes could not be used for this, anymore than a database of fingerprints where all the labels were deleted could be used. It’s just a bunch of random fingerprints, with no names attached anywhere at all, its just a bunch of bleh.

        So, big concern in the hands of anyone who has not scrubbed the labels all off, which would overall render it much less useful.

    • @cactus
      link
      English
      49 months ago

      And a lot of “anonymized” data can actually be deanonymized again

      • @[email protected]
        link
        fedilink
        English
        2
        edit-2
        9 months ago

        True. Another guy also pointed out a pic cannot be truly, fully anonymized, and this is also true. Plenty of nuance in the issue for sure. But it is still, nonetheless, possible to render a large genetic database harmless, if proper precautions are taken.

        edit: Harmless was a poor word. I really just meant it can be rendered useless for identifying specific individuals, and only really able to provide info on broad population trends. If proper precautions are taken, which they are not currently being taken. So, that’s not good.