Add group-wise inference risks #53

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

itrajanovska wants to merge 3 commits into statice:main from itrajanovska:add_group_wise_inference_risks

Contributor

itrajanovska commented Dec 16, 2025

We need group wise risks to be able to assess the fairness of the assigned risks within groups
Successor of #48 implementing the computation of the group wise risks (as well as passing a custom ML model).


          Add group-wise inference risks

ffc3d5f

Contributor Author

itrajanovska commented Dec 16, 2025

@MatteoGiomi After rebasing I deleted the branch because of some conflicts, and #52 was automatically closed.

MatteoGiomi reviewed

View reviewed changes

Member

MatteoGiomi left a comment

Thanks for this second contribution @itrajanovska, I left a few comments for your consideration.

src/anonymeter/evaluators/inference_evaluator.py Outdated

    
                  if naive:

                      guesses = syn.sample(n_attacks)[secret]

                      guesses.index = targets.index

Member

MatteoGiomi Dec 17, 2025

NIT: use reindex_like:

Suggested change

      
                    guesses.index = targets.index
          
                    guesses.reindex_lie(targets)

(untested)

src/anonymeter/evaluators/inference_evaluator.py Outdated

Comment on lines 42 to 43

    
                  if not guesses.index.equals(targets.index):

                      raise RuntimeError("The predictions indices do not match the target indices. Check your inference model.")

Member

MatteoGiomi Dec 17, 2025

I suggest to move this check inside evaluate_inference_guesses since it test for a condition necessary for the evaluation. This way we can also add a test in test_inference_evaluator to check that the exception is correctly raised.

src/anonymeter/evaluators/inference_evaluator.py

    
                          aux_cols: list[str],

                          secret: str,

                          regression: Optional[bool] = None,

                          regression: bool = False,

Member

MatteoGiomi Dec 17, 2025

💯

src/anonymeter/evaluators/inference_evaluator.py Outdated

    
                          None if self._control is None else self._attack(target=self._control, naive=False, n_jobs=n_jobs,

                                                                          n_attacks=self._n_attacks_control)

                          )

                      # n_attacks is effective here

Member

MatteoGiomi Dec 17, 2025

This and the following comments are a bit cryptic, would be good to flesh them out a little to make them clearer.

Contributor Author

itrajanovska Dec 17, 2025

This is an old debuging comment and it is not effective anymore with the new logic

src/anonymeter/evaluators/inference_evaluator.py Outdated

    
                      return results.risk(baseline=baseline)

                  def risk_for_groups(self, confidence_level: float = 0.95) -> dict[str, tuple[EvaluationResults, PrivacyRisk]]:

                      """Compute the attack risks on a group level, for every unique value of `self._data_groups`.

Member

MatteoGiomi Dec 17, 2025

Suggested change

      
                    """Compute the attack risks on a group level, for every unique value of `self._data_groups`.
          
                    """Compute the inference risk for each group of targets with the same value of the secret attribute.

or similar, there self._data_groups does not exists anymore.

src/anonymeter/evaluators/inference_evaluator.py Outdated

Comment on lines 300 to 301

    
                      if not self._evaluated:

                          self.evaluate(n_jobs=-2)

Member

MatteoGiomi Dec 17, 2025

I would raise here, just as we do for the non grouped case.

src/anonymeter/evaluators/inference_evaluator.py

    
                          n_attacks_ori = len(data_ori)

                          # Count the number of success attacks

                          n_success = evaluate_inference_guesses(

Member

MatteoGiomi Dec 17, 2025

re the other comment: here you are not checking for consistent indexes. That would come for free if it's moved in evaluate_inference_guesses

src/anonymeter/evaluators/inference_evaluator.py Outdated

    
                          # Get the targets for the current group

                          common_indices = data_ori.index.intersection(self._guesses_success.index)

                          # Get the guesses for the current group

                          data_ori = data_ori.loc[common_indices]

Member

MatteoGiomi Dec 17, 2025

here you are overwriting an object. Better to use a different name:

Suggested change

      
                        data_ori = data_ori.loc[common_indices]
          
                        target_group= data_ori.loc[common_indices]

or something.

src/anonymeter/evaluators/inference_evaluator.py Outdated

    
                              n_attacks_control = -1

                          # Recreate the EvaluationResults for the current group

                          assert n_attacks_ori == n_success

Member

MatteoGiomi Dec 17, 2025

why this check here? is this assertion always passing? n_success is the sum of a boolean array of length n_attacks_ori. The assertion should pass only if all the elements of the array (the result of evaluate_inference_guesses) are true. I don't understand, maybe I am missing something.

Contributor Author

itrajanovska Dec 17, 2025

I think this is an incorrect assumption, I don't remember why I tested for this, but it looks more like a buggy line than it makes sense

tests/test_inference_evaluator.py Outdated

    
              def test_inference_evaluator_leaks(aux_cols, secret):

                  ori = get_adult("ori", n_samples=10)

                  evaluator = InferenceEvaluator(ori=ori, syn=ori, control=ori, aux_cols=aux_cols, secret=secret, n_attacks=10)

                  ori = ori.drop_duplicates(subset=aux_cols)

Member

MatteoGiomi Dec 17, 2025

as a bonus, would be good to have this ability built in the get_adult function, for e.g. by having a parameter deduplicate_on which can accepts a list of columns..

itrajanovska added 2 commits

December 17, 2025 15:34


          Address code refactoring comments; Make test fixture code drop duplic…

cd88767

…ates; Remove old comments; Add RuntimeError test.


          Address code refactoring comments.

3b01d5a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet