Noblis Awarded Patent for Secure Communication of Sensitive Genomic Information Using Probabilistic Data Structures

PATENT NO: 11,676,683

There is a need for methods of processing, storing, and transferring genomic information such that the information may be secured and anonymized while still being able to be meaningfully processed and analyzed for genomic data analytics. Our patented technique addresses this need.

In some embodiments, genomic information in a secure computing environment may be encoded and/or anonymized by building a probabilistic data structure that represents sub-strings of the genomic information as members of a set; the probabilistic data structure may then be securely transmitted outside the secure computing environment. In some embodiments, a probabilistic data structure representing sub-strings of sensitive genomic information as members of a set may be received in an unsecure computing environment and may be queried to generate output data indicating whether reference sub-strings are probable members of the set. In some embodiments, querying the probabilistic data structure, and other techniques of analyzing the probabilistic data structure, may be used to determine whether the sensitive genomic information corresponds to an organism associated with the reference genomic information.