Amazon Discussions

Exam AWS Certified AI Practitioner AIF-C01 All Questions

View all questions & answers for the AWS Certified AI Practitioner AIF-C01 exam

Go to Exam

Exam AWS Certified AI Practitioner AIF-C01 topic 1 question 86 discussion

Exam question from Amazon's AWS Certified AI Practitioner AIF-C01

Question #: 86
Topic #: 1

[All AWS Certified AI Practitioner AIF-C01 Questions]

Which prompting attack directly exposes the configured behavior of a large language model (LLM)?

A. Prompted persona switches
B. Exploiting friendliness and trust
C. Ignoring the prompt template
D. Extracting the prompt template

Show Suggested Answer

Suggested Answer: D 🗳️

by aws_Tamilan at Dec. 28, 2024, 4:04 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

Rcosmos

1 month, 1 week ago

Selected Answer: D

A resposta correta é D. Extraindo o modelo de prompt. Esse ataque ocorre quando um usuário consegue obter partes ou até mesmo o texto completo do prompt interno usado para configurar um modelo de linguagem grande (LLM). Esse prompt pode conter instruções, regras e configurações que influenciam o comportamento do modelo. Se um atacante descobrir esses detalhes, ele pode ajustar suas perguntas para manipular as respostas do modelo ou explorar vulnerabilidades na configuração.

upvoted 1 times

...

Rcosmos

2 months, 3 weeks ago

Selected Answer: D

Explicação: Esse tipo de ataque é conhecido como prompt extraction (extração de prompt). Ele tem como objetivo revelar o prompt-base ou instruções internas utilizadas para orientar o comportamento do LLM. Isso pode incluir regras, identidade fictícia, políticas de segurança, entre outros aspectos que definem como o modelo deve se comportar. Esse ataque expõe diretamente a configuração do modelo, tornando vulneráveis as proteções e instruções projetadas para garantir respostas seguras ou neutras.

upvoted 1 times

...

kopper2019

4 months, 2 weeks ago

D. Extracting the prompt template

upvoted 1 times

...

Jessiii

4 months, 3 weeks ago

Selected Answer: D

Extracting the prompt template refers to a situation where the attacker tries to reveal or access the underlying structure or instructions used to configure the behavior of the large language model (LLM). This type of attack can expose how the model has been trained or how it responds to certain inputs, effectively giving the attacker insight into how the LLM has been directed to generate responses. This type of attack could potentially lead to misuse, such as causing the model to behave in unintended ways, or even allow an attacker to manipulate the behavior of the model by crafting specific inputs based on the extracted prompt template.

upvoted 2 times

...

dspd

5 months, 1 week ago

Selected Answer: D

D. Extracting the prompt template

upvoted 1 times

...

AzureDP900

5 months, 1 week ago

Selected Answer: B

B. Exploiting friendliness and trust Exploiting friendliness and trust involves manipulating the LLM to respond in a way that appears friendly or trustworthy, potentially causing it to deviate from its intended behavior. This type of attack directly exposes how the LLM has been configured to interact with users, often leading it to provide information or make decisions that align more closely with the attacker's intentions rather than its original programming.

upvoted 1 times

...

Moon

6 months ago

Selected Answer: D

D: Extracting the prompt template Explanation: Extracting the prompt template is a prompting attack where an attacker intentionally crafts inputs to reveal the underlying configuration or instructions (prompt template) used to guide the large language model (LLM). This exposes the internal behavior or design of the model, potentially revealing sensitive or proprietary information about how the LLM is configured. Why not the other options? A: Prompted persona switches: This attack involves manipulating the LLM to adopt a different persona or role than intended but does not directly expose the prompt template.

upvoted 2 times

...

aws_Tamilan

6 months, 1 week ago

Selected Answer: D

D. Extracting the prompt template Explanation: Extracting the prompt template is a prompting attack where the attacker directly attempts to reveal the underlying configured behavior or instructions of the large language model (LLM). This can expose sensitive configurations, system instructions, or contextual prompts that guide the model's behavior.

upvoted 1 times

...