I'm thinking about the combination of parallel construction and Bloom filters.
Parallel construction: the methods we used to get this data were completely illegal, but now we have an answer and we can use the answer to go find a chain of evidence that we can use in court. So much easier!
Bloom filters: I'm not allowed to store this data permanently, but I can look at it in passing and can record a little metadata. A Bloom filter stores a factoid about a piece of data having been seen, being able to give definite No answers but merely high probabilities of Yes answers.
Oh, and "machine learning models are reifications of bias, known and unknown".
no subject
Parallel construction: the methods we used to get this data were completely illegal, but now we have an answer and we can use the answer to go find a chain of evidence that we can use in court. So much easier!
Bloom filters: I'm not allowed to store this data permanently, but I can look at it in passing and can record a little metadata. A Bloom filter stores a factoid about a piece of data having been seen, being able to give definite No answers but merely high probabilities of Yes answers.
Oh, and "machine learning models are reifications of bias, known and unknown".
None of it is especially reassuring.