GitHub - ve1nard/safeCode: SafeCode: Securing LLM Generated Code

Prefix-Tuning for CodeQwen and DeepSeek

1. Download the SVEN Repository

Begin by cloning the official SVEN repository:

git clone https://github.com/eth-sri/sven
cd sven

2. Apply Prefix-Tuning Modifications

Navigate to the sven-codeQwen+deepseek directory to view the modified files.
Then, apply these modifications to the corresponding files in the sven repository.

3. Instruction Tuning

The instruction tuning repo can be found here : https://github.com/eth-sri/SafeCoder

Cascading

1. Data sets

Navigate to data/cascading to familiarize yourself with validation and evaluation datasets for cascading experiments as well as our results for the best cascading schemes.

2. Reproducing results

Navigate to src/cascading directory to gen_ans_test.py script to generate answers and tests for all possible hyperparameter combinations. Next, run select_ans_tests.py to simulate the cascading pipeline with different configurations to produce security and cost results for each of them. Afterwards, using check_parameter_combs.py select the best combinations according to the security and cost scores. To see which of those points are Pareto-optimal, run get_pareto_points.py. After finding Pareto-optimal points, you can visualize them together with the rest of the configurations using plot_thresholds.py. Select the best points of your interest for each threshold and indicate them in data/cascading/outputs/chosen_hyperparameters.json. To perform testing on the evaluation split, run test_parameters.py and see the results in data/cascading/outputs/testing_results_new_method/testing_results.json.

In-Context Learning

The sven_seccoder directory should be cloned, and the instructions therein followed to reproduce results. In-context functionality was added to the exisitng code structure so the results can be reproduced in exactly the same way as SVEN.

Agentic Inference

The sven_agentic directory should be cloned, and the environment set up as in SVEN. To use agentic inference, run the agentic_eval script followed by print_agentic_results to run the experiments and observe the results, respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
data/cascading		data/cascading
instruction_tuning		instruction_tuning
safecoder_seccoder		safecoder_seccoder
src/cascading		src/cascading
sven-codeQwen+deepseek		sven-codeQwen+deepseek
sven_agentic		sven_agentic
sven_seccoder		sven_seccoder
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Prefix-Tuning for CodeQwen and DeepSeek

1. Download the SVEN Repository

2. Apply Prefix-Tuning Modifications

3. Instruction Tuning

Cascading

1. Data sets

2. Reproducing results

In-Context Learning

Agentic Inference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

ve1nard/safeCode

Folders and files

Latest commit

History

Repository files navigation

Prefix-Tuning for CodeQwen and DeepSeek

1. Download the SVEN Repository

2. Apply Prefix-Tuning Modifications

3. Instruction Tuning

Cascading

1. Data sets

2. Reproducing results

In-Context Learning

Agentic Inference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages