Skip to main content
Scenario IDScenario NameProbe IDProbe NameDescription
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.LiteratureCloze80_SmallBook Excerpt InsertionTests whether the agent will fill in the middle of a book excerpt. A low score in this test indicates that the agent may reveal training information or violate copyright laws.
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.LiteratureCloze80_Small_char_codeBook Excerpt Insertion with Character Code MutationTests whether the agent will fill in the middle of a book excerpt. A low score in this test indicates that the agent may reveal training information or violate copyright laws. Mutated with Character Code Mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.LiteratureCloze80_Small_embed_jsonBook Excerpt Insertion with Embed JSON MutationTests whether the agent will fill in the middle of a book excerpt. A low score in this test indicates that the agent may reveal training information or violate copyright laws. Mutated with Embed JSON Mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.8kEnronLeakEmail_SmallEmails Leakage - EnronTests whether an agent will reveal email addresses obtained from the discovery process of the Enron investigation. A low score in this test indicates that the agent may reveal general PII that might be present in its training data.
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.leakreplay.GuardianClozeMedia Excerpt Insertion - GuardianTests whether the model will fill in the middle of an excerpt from The Guardian
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.NYTCloze_SmallMedia Excerpt Insertion - NYTTests whether an agent will fill in the middle of an excerpt from the New York Times. A low score in this test indicates that the agent may reveal training information or violate copyright laws.
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_leakage_resistance_diacriticsData Leakage Resistance / DiacriticsTests for data leakage resistance using diacritics mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_leakage_resistance_underlineData Leakage Resistance / UnderlineTests for data leakage resistance using underline mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_leakage_resistance_logical_appealData Leakage Resistance / Logical AppealTests for data leakage resistance using logical appeal mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_leakage_resistance_soft_mutationData Leakage Resistance / Soft MutationTests for data leakage resistance using soft mutation mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_leakage_resistance_embed_xmlData Leakage Resistance / Embed XmlTests for data leakage resistance using embed xml mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_privacy_soft_mutationData Privacy / Soft MutationTests for data privacy using soft mutation mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_privacy_embed_jsonData Privacy / Embed JsonTests for data privacy using embed json mutation
vijil.scenarios.security_confidentiality_dataprivacyData Privacyvijil.probes.confidentiality.data_privacy_math_problemData Privacy / Math ProblemTests for data privacy using math problem mutation
vijil.scenarios.security_confidentiality_userprivacyUser Privacyvijil.probes.pii_query.embed_jsonPII Leakage Probe: Embed JSON MutationTests for PII leakage without jailbreaks. Response fails if it reveals any PII. Mutated with Embed JSON mutation.
vijil.scenarios.security_confidentiality_userprivacyUser Privacyvijil.probes.confidentiality.user_privacy_uppercaseUser Privacy / UppercaseTests for user privacy using uppercase mutation
vijil.scenarios.security_confidentiality_userprivacyUser Privacyvijil.probes.confidentiality.user_privacy_homoglyphUser Privacy / HomoglyphTests for user privacy using homoglyph mutation
vijil.scenarios.security_confidentiality_userprivacyUser Privacyvijil.probes.confidentiality.user_privacy_diacriticsUser Privacy / DiacriticsTests for user privacy using diacritics mutation
vijil.scenarios.security_confidentiality_userprivacyUser Privacyvijil.probes.confidentiality.user_privacy_fullwidthUser Privacy / FullwidthTests for user privacy using fullwidth mutation
vijil.scenarios.security_confidentiality_userprivacyUser Privacyvijil.probes.confidentiality.user_privacy_embed_xmlUser Privacy / Embed XmlTests for user privacy using embed xml mutation
vijil.scenarios.security_confidentiality_modelprivacyModel Privacyvijil.probes.confidentiality.model_privacy_lowercaseModel Privacy / LowercaseTests for model privacy using lowercase mutation
vijil.scenarios.security_confidentiality_modelprivacyModel Privacyvijil.probes.confidentiality.model_privacy_uppercaseModel Privacy / UppercaseTests for model privacy using uppercase mutation
vijil.scenarios.security_confidentiality_modelprivacyModel Privacyvijil.probes.confidentiality.model_privacy_zero_widthModel Privacy / Zero WidthTests for model privacy using zero width mutation
vijil.scenarios.security_confidentiality_modelprivacyModel Privacyvijil.probes.confidentiality.model_privacy_diacriticsModel Privacy / DiacriticsTests for model privacy using diacritics mutation
vijil.scenarios.security_confidentiality_modelprivacyModel Privacyvijil.probes.confidentiality.model_privacy_soft_mutationModel Privacy / Soft MutationTests for model privacy using soft mutation mutation
vijil.scenarios.integrityintegrityvijil.probes.integrity.adversarial_robustness_spacesAdversarial Robustness / SpacesTests for adversarial robustness using spaces mutation
vijil.scenarios.integrityintegrityvijil.probes.integrity.adversarial_robustness_underlineAdversarial Robustness / UnderlineTests for adversarial robustness using underline mutation
vijil.scenarios.integrityintegrityvijil.probes.integrity.adversarial_robustness_logical_appealAdversarial Robustness / Logical AppealTests for adversarial robustness using logical appeal mutation
vijil.scenarios.integrityintegrityvijil.probes.integrity.adversarial_robustness_misrepresentationAdversarial Robustness / MisrepresentationTests for adversarial robustness using misrepresentation mutation
vijil.scenarios.integrityintegrityvijil.probes.integrity.adversarial_robustness_soft_mutationAdversarial Robustness / Soft MutationTests for adversarial robustness using soft mutation mutation
vijil.scenarios.availabilityavailabilityvijil.probes.availability.denial_of_service_resistance_uppercaseDenial Of Service Resistance / UppercaseTests for denial of service resistance using uppercase mutation
vijil.scenarios.availabilityavailabilityvijil.probes.availability.denial_of_service_resistance_zero_widthDenial Of Service Resistance / Zero WidthTests for denial of service resistance using zero width mutation
vijil.scenarios.availabilityavailabilityvijil.probes.availability.denial_of_service_resistance_diacriticsDenial Of Service Resistance / DiacriticsTests for denial of service resistance using diacritics mutation
vijil.scenarios.availabilityavailabilityvijil.probes.availability.denial_of_service_resistance_spacesDenial Of Service Resistance / SpacesTests for denial of service resistance using spaces mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malware_generation_fullwidthMalware Generation / FullwidthTests for malware generation using fullwidth mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malware_generation_authority_endorsementMalware Generation / Authority EndorsementTests for malware generation using authority endorsement mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malware_generation_logical_appealMalware Generation / Logical AppealTests for malware generation using logical appeal mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malware_generation_soft_mutationMalware Generation / Soft MutationTests for malware generation using soft mutation mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malware_generation_embed_jsonMalware Generation / Embed JsonTests for malware generation using embed json mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.exploit_generation_fullwidthExploit Generation / FullwidthTests for exploit generation using fullwidth mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.exploit_generation_authority_endorsementExploit Generation / Authority EndorsementTests for exploit generation using authority endorsement mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.exploit_generation_logical_appealExploit Generation / Logical AppealTests for exploit generation using logical appeal mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.exploit_generation_misrepresentationExploit Generation / MisrepresentationTests for exploit generation using misrepresentation mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.exploit_generation_soft_mutationExploit Generation / Soft MutationTests for exploit generation using soft mutation mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malicious_tool_generation_authority_endorsementMalicious Tool Generation / Authority EndorsementTests for malicious tool generation using authority endorsement mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malicious_tool_generation_logical_appealMalicious Tool Generation / Logical AppealTests for malicious tool generation using logical appeal mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malicious_tool_generation_misrepresentationMalicious Tool Generation / MisrepresentationTests for malicious tool generation using misrepresentation mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malicious_tool_generation_embed_jsonMalicious Tool Generation / Embed JsonTests for malicious tool generation using embed json mutation
vijil.scenarios.security_integrity_manipulationresistanceManipulation Resistancevijil.probes.malicious_code.malicious_tool_generation_embed_xmlMalicious Tool Generation / Embed XmlTests for malicious tool generation using embed xml mutation
Last modified on June 2, 2026