<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Guardrails on Break Lib Site</title>
        <link>https://break.mlibia.xyz/tags/guardrails/</link>
        <description>Recent content in Guardrails on Break Lib Site</description>
        <generator>Hugo -- gohugo.io</generator>
        <language>en</language>
        <copyright>Lib Copyright</copyright>
        <lastBuildDate>Sun, 03 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://break.mlibia.xyz/tags/guardrails/index.xml" rel="self" type="application/rss+xml" /><item>
        <title>Overcoming LLM gender bias</title>
        <link>https://break.mlibia.xyz/p/overcoming-llm-gender-bias/</link>
        <pubDate>Sun, 03 May 2026 00:00:00 +0000</pubDate>
        
        <guid>https://break.mlibia.xyz/p/overcoming-llm-gender-bias/</guid>
        <description>&lt;p&gt;&lt;strong&gt;Description&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;I made this post is to summarize my learning about security set up Foundational Models of AWS Bedrock.
I was living almost one month before where i started  thinking in security in AI so now i am training for AIF-CO1 certification. I was surprise about guardrails so i decided to make an experiment to filter gender bias.&lt;/p&gt;
&lt;p&gt;&lt;img src=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i1.jpg&#34;
	width=&#34;630&#34;
	height=&#34;420&#34;
	srcset=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i1_hu9218297684969550490.jpg 480w, https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i1_hu3439343307651708119.jpg 1024w&#34;
	loading=&#34;lazy&#34;
	
		alt=&#34;chatgpt&#34;
	
	
		class=&#34;gallery-image&#34; 
		data-flex-grow=&#34;150&#34;
		data-flex-basis=&#34;360px&#34;
	
&gt;&lt;/p&gt;
&lt;p&gt;Well some phrases can be tricky like:&lt;/p&gt;
&lt;p&gt;Emotional &amp;amp; Behavioral Stereotypes:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&amp;ldquo;X are too emotional and dramatic.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&amp;ldquo;X are passive and submissive.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&amp;ldquo;X is too aggressive/bossy for a Y(she/he).&amp;rdquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;requirements&#34;&gt;Requirements
&lt;/h3&gt;&lt;ul&gt;
&lt;li&gt;AWS account.&lt;/li&gt;
&lt;li&gt;AWS User Bedrock.&lt;/li&gt;
&lt;li&gt;&lt;a class=&#34;link&#34; href=&#34;https://docs.aws.amazon.com/bedrock/latest/userguide/guardrails-permissions.html&#34;  target=&#34;_blank&#34; rel=&#34;noopener&#34;
    &gt;AWS policies for guardrails and Bedrock&lt;/a&gt;.
&lt;a class=&#34;link&#34; href=&#34;https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference-examples.html&#34;  target=&#34;_blank&#34; rel=&#34;noopener&#34;
    &gt;link how to set up&lt;/a&gt;: &lt;a class=&#34;link&#34; href=&#34;https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference-examples.html&#34;  target=&#34;_blank&#34; rel=&#34;noopener&#34;
    &gt;https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference-examples.html&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;steps&#34;&gt;Steps
&lt;/h3&gt;&lt;ol&gt;
&lt;li&gt;Choose a model and try it in the Open Playground.
In my case i am using meta.llama3-8b-instruct-v1:0.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;code&gt;Amazon Bedrock &amp;gt; Model Catalog &amp;gt; Llama 3 8B Instruct&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;Then you will see the chat which you can interact&lt;/p&gt;
&lt;p&gt;&lt;img src=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i2.jpg&#34;
	width=&#34;1094&#34;
	height=&#34;216&#34;
	srcset=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i2_hu1389501783939327657.jpg 480w, https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i2_hu7864208598553000461.jpg 1024w&#34;
	loading=&#34;lazy&#34;
	
	
		class=&#34;gallery-image&#34; 
		data-flex-grow=&#34;506&#34;
		data-flex-basis=&#34;1215px&#34;
	
&gt;&lt;/p&gt;
&lt;ol start=&#34;2&#34;&gt;
&lt;li&gt;Then, create a script to test the model to check the correct access with the user.&lt;/li&gt;
&lt;/ol&gt;
&lt;div class=&#34;highlight&#34;&gt;&lt;div class=&#34;chroma&#34;&gt;
&lt;table class=&#34;lntable&#34;&gt;&lt;tr&gt;&lt;td class=&#34;lntd&#34;&gt;
&lt;pre tabindex=&#34;0&#34; class=&#34;chroma&#34;&gt;&lt;code&gt;&lt;span class=&#34;lnt&#34;&gt; 1
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 2
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 3
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 4
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 5
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 6
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 7
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 8
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 9
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;10
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;11
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;12
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;13
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/td&gt;
&lt;td class=&#34;lntd&#34;&gt;
&lt;pre tabindex=&#34;0&#34; class=&#34;chroma&#34;&gt;&lt;code class=&#34;language-fallback&#34; data-lang=&#34;fallback&#34;&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;client = session.client(&amp;#34;bedrock-runtime&amp;#34;)
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;response = client.invoke_model(
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    modelId=model_id,
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    contentType=&amp;#34;application/json&amp;#34;,
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    accept=&amp;#34;application/json&amp;#34;,
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    body=json.dumps({
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;        &amp;#34;prompt&amp;#34;: in_user,
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;        &amp;#34;max_gen_len&amp;#34;: 200
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    })
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;)
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;print(response[&amp;#34;body&amp;#34;].read())
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;
&lt;/div&gt;
&lt;/div&gt;&lt;p&gt;&lt;img src=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i4.png&#34;
	width=&#34;1249&#34;
	height=&#34;77&#34;
	srcset=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i4_hu5266173050395182168.png 480w, https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i4_hu6422785299713697077.png 1024w&#34;
	loading=&#34;lazy&#34;
	
	
		class=&#34;gallery-image&#34; 
		data-flex-grow=&#34;1622&#34;
		data-flex-basis=&#34;3892px&#34;
	
&gt;&lt;/p&gt;
&lt;ol start=&#34;3&#34;&gt;
&lt;li&gt;Add the guardrails in the model invocation.&lt;/li&gt;
&lt;/ol&gt;
&lt;div class=&#34;highlight&#34;&gt;&lt;div class=&#34;chroma&#34;&gt;
&lt;table class=&#34;lntable&#34;&gt;&lt;tr&gt;&lt;td class=&#34;lntd&#34;&gt;
&lt;pre tabindex=&#34;0&#34; class=&#34;chroma&#34;&gt;&lt;code&gt;&lt;span class=&#34;lnt&#34;&gt; 1
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 2
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 3
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 4
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 5
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 6
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 7
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 8
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt; 9
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;10
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;11
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;12
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;13
&lt;/span&gt;&lt;span class=&#34;lnt&#34;&gt;14
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/td&gt;
&lt;td class=&#34;lntd&#34;&gt;
&lt;pre tabindex=&#34;0&#34; class=&#34;chroma&#34;&gt;&lt;code class=&#34;language-fallback&#34; data-lang=&#34;fallback&#34;&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;bedrock_runtime = session.client(service_name=&amp;#39;bedrock-runtime&amp;#39;)
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;try:
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    response = bedrock_runtime.converse(
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;        modelId=model_id,
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;        messages=conversation,
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;        guardrailConfig={
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;            &amp;#39;guardrailIdentifier&amp;#39;: guardrail_id,
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;            &amp;#39;guardrailVersion&amp;#39;: guardrail_version
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;        }
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    )
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    print(response[&amp;#39;output&amp;#39;][&amp;#39;message&amp;#39;][&amp;#39;content&amp;#39;][0][&amp;#39;text&amp;#39;])
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    print(&amp;#34;Guardrail Action:&amp;#34;, response[&amp;#39;guardrailAction&amp;#39;])
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;except Exception as e:
&lt;/span&gt;&lt;/span&gt;&lt;span class=&#34;line&#34;&gt;&lt;span class=&#34;cl&#34;&gt;    print(e)
&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;
&lt;/div&gt;
&lt;/div&gt;&lt;p&gt;note: i am using the function converse.&lt;/p&gt;
&lt;p&gt;&lt;img src=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i3.png&#34;
	width=&#34;1265&#34;
	height=&#34;88&#34;
	srcset=&#34;https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i3_hu16000485279008194722.png 480w, https://break.mlibia.xyz/p/overcoming-llm-gender-bias/i3_hu11515996183846355073.png 1024w&#34;
	loading=&#34;lazy&#34;
	
	
		class=&#34;gallery-image&#34; 
		data-flex-grow=&#34;1437&#34;
		data-flex-basis=&#34;3450px&#34;
	
&gt;&lt;/p&gt;
&lt;h3 id=&#34;conclusion&#34;&gt;Conclusion
&lt;/h3&gt;&lt;p&gt;Take your time when implementing LLMs, as there can be pitfalls. It&amp;rsquo;s important to set them up securely to ensure others can use them safely.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tip&lt;/strong&gt; I wasted my tome to invoke my Guardrail becausde it has the default version 1 so i had to create a version of my guardrail.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;IMPORTANT&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;m currently focused on my studies while actively looking for job opportunities to apply my skills and gain experience. 🥺&lt;/p&gt;
&lt;h3 id=&#34;offtopic&#34;&gt;Offtopic
&lt;/h3&gt;&lt;p&gt;Lastly, if you have a dog, please remember to take good care of them.🐶&lt;/p&gt;

</description>
        </item>
        
    </channel>
</rss>
