21:[["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"$2b"}}],["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"{\"@context\":\"https://schema.org\",\"@type\":\"BreadcrumbList\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Learn\",\"item\":\"https://www.ainews.tech/learn\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Prompt Engineering\",\"item\":\"https://www.ainews.tech/learn/prompt-engineering\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Prompt fundamentals\",\"item\":\"https://www.ainews.tech/learn/prompt-engineering/foundations\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"Few-shot examples: when they help, how to write them\",\"item\":\"https://www.ainews.tech/learn/prompt-engineering/foundations/examples-few-shot-when-and-how\"}]}"}}],null,["$","$L2c",null,{"intro":[["$","nav",null,{"aria-label":"Breadcrumb","className":"flex items-center gap-1.5 text-xs text-gray-500 mb-6 flex-wrap","children":[["$","$L7",null,{"href":"/learn","className":"hover:text-gray-700 dark:hover:text-gray-300 transition-colors","children":"Learn"}],["$","$L6",null,{"ref":"$undefined","iconNode":[["path",{"d":"m9 18 6-6-6-6","key":"mthhwq"}]],"className":"lucide-chevron-right text-gray-700","size":12}],["$","$L7",null,{"href":"/learn/prompt-engineering","className":"hover:text-gray-700 dark:hover:text-gray-300 transition-colors","children":"Prompt Engineering"}],["$","$L6",null,{"ref":"$undefined","iconNode":"$21:3:props:intro:0:props:children:1:props:iconNode","className":"lucide-chevron-right text-gray-700","size":12}],["$","$L7",null,{"href":"/learn/prompt-engineering/foundations","className":"hover:text-gray-700 dark:hover:text-gray-300 transition-colors","children":"Prompt fundamentals"}],["$","$L6",null,{"ref":"$undefined","iconNode":"$21:3:props:intro:0:props:children:1:props:iconNode","className":"lucide-chevron-right text-gray-700","size":12}],["$","span",null,{"className":"text-gray-700 dark:text-gray-300 truncate max-w-[200px]","children":"Few-shot examples: when they help, how to write them"}]]}],["$","header",null,{"className":"mb-10 pb-6 border-b border-gray-200 dark:border-gray-800","children":[["$","h1",null,{"className":"text-3xl sm:text-4xl font-bold mb-3 leading-tight tracking-tight [overflow-wrap:anywhere]","children":"Few-shot examples: when they help, how to write them"}],["$","p",null,{"className":"text-gray-600 dark:text-gray-400 text-lg leading-relaxed mb-4","children":"When examples are the single biggest unlock you have, and when they hurt. The rules: relevant, diverse, structured. How many is enough. Why one bad example can corrupt the whole output."}],"$L2d"]}]],"pathContext":"$L2e","content":"$2f","lesson":{"stages":[{"title":"Overview","cards":[{"id":"s1c0","kind":"text","md":"If you can only learn one prompting technique, learn this one. Showing the model what good output looks like — with 1–5 actual examples — outperforms almost any other intervention."},{"id":"s1c1","kind":"text","md":"It's called few-shot prompting (one example = \"one-shot,\" several = \"few-shot,\" zero = \"zero-shot,\" which is what you do by default)."}]},{"title":"When few-shot is the single biggest unlock","cards":[{"id":"s2c0","kind":"text","md":"The technique earns its keep when:"},{"id":"s2c1","kind":"text","md":"- **The output format is non-obvious.** \"Extract all the dates\" — easy. \"Extract all the dates as ISO strings, but only if they're in the past, and group them by quarter\" — show an example.\n- **The tone is specific.** Marketing copy in a specific brand voice, code review comments in a specific style, customer support responses in a specific register. Examples carry tone better than descriptions.\n- **You want consistency across runs.** Few-shot is the most reliable way to get repeatable output shape across many runs of the same prompt.\n- **The task involves judgment.** Classifying customer feedback into categories, deciding whether a PR is risky enough to flag, grading content quality. The model needs to see what each category looks like."}]},{"title":"When few-shot doesn't help (or hurts)","cards":[{"id":"s3c0","kind":"text","md":"- **Creative tasks where you want variety.** Examples bias the output toward looking like the examples. If you want range, fewer examples or none.\n- **Tasks the model already does well by default.** Asking for a Python function to sort a list doesn't need an example.\n- **When you don't have good examples.** A bad example is worse than no example. The model will faithfully match the wrong pattern."}]},{"title":"The three rules for good examples","cards":[{"id":"s4c0","kind":"text","md":"**Rule 1: Relevant**\n\nExamples should look like your actual use case. If you're processing real customer feedback, your example should look like real customer feedback — not synthetic, not generic, not \"Customer A said X.\""},{"id":"s4c1","kind":"text","md":"Bad example for a sentiment classification task:"},{"id":"s4c2","kind":"code","md":"```\n\nInput: \"This is great!\"\nOutput: positive\n\n```"},{"id":"s4c3","kind":"text","md":"Better:"},{"id":"s4c4","kind":"code","md":"```\n\nInput: \"ngl was skeptical at first but the migration went way smoother than i expected. one minor thing on the docs - the env var naming is inconsistent w/ what's in the github readme. fixable. 8/10\"\nOutput: positive (with a specific feature complaint to flag — docs inconsistency)\n\n```"},{"id":"s4c5","kind":"text","md":"The second example shows the model the messy reality it'll actually encounter: casual language, abbreviations, mixed sentiment, embedded specific feedback. The first example will produce a classifier that handles \"this is great\" perfectly and fails on everything else."},{"id":"s4c6","kind":"text","md":"**Rule 2: Diverse**\n\nIf all your examples have the same structure, the model assumes that structure is part of the spec. Cover edge cases. Include examples that show what to do with unusual inputs."},{"id":"s4c7","kind":"text","md":"For a 5-example set on customer-feedback classification, include:"},{"id":"s4c8","kind":"text","md":"- A clearly positive one\n- A clearly negative one\n- A mixed-sentiment one\n- An ambiguous one\n- An off-topic one (where the model should refuse or flag)"},{"id":"s4c9","kind":"text","md":"If all 5 are unambiguously positive, the model will struggle the first time it sees ambiguity."},{"id":"s4c10","kind":"text","md":"**Rule 3: Structured**\n\nWrap each example in `` tags. Wrap the group in ``. This makes it unambiguous to the model that these are patterns to learn from — not instructions to follow literally."},{"id":"s4c11","kind":"code","md":"```\n\n\n\n{a realistic input}\n\n\n{the correct output for that input}\n\n\n\n\n\n{a different realistic input — different structure, different edge case}\n\n\n{the correct output}\n\n\n\n```"},{"id":"s4c12","kind":"text","md":"The model treats this as \"here are some demonstrations of the task.\" Without the tags, it might treat the examples as part of the input or part of the instructions."}]},{"title":"How many examples is enough","cards":[{"id":"s5c0","kind":"text","md":"The data is consistent across models: 3–5 examples is the sweet spot. After that, returns diminish quickly. After about 10, you can actively hurt output quality by overfitting the model to surface patterns in the examples."},{"id":"s5c1","kind":"text","md":"If you have 1 example, use 1. If you have 20, pick the 5 most diverse ones."}]},{"title":"Why one bad example can ruin everything","cards":[{"id":"s6c0","kind":"text","md":"The model is a faithful pattern-matcher. If your examples all happen to:"},{"id":"s6c1","kind":"text","md":"- Start outputs with \"Here is your...\"\n- End outputs with a question\n- Use 3-bullet lists exactly\n- Avoid certain words by coincidence"},{"id":"s6c2","kind":"text","md":"...it will assume that's part of the task. It will reproduce those patterns even when they don't make sense for a particular input."},{"id":"s6c3","kind":"text","md":"Inspect your examples for accidental patterns. If 3 of your 5 examples end with a question mark, the model will think questions are mandatory. Either fix the pattern or break it deliberately in one example."}]},{"title":"The meta-trick: ask the model to evaluate your examples","cards":[{"id":"s7c0","kind":"text","md":"Once you have a few examples, paste them to the model and ask:"},{"id":"s7c1","kind":"code","md":"```\nHere are the examples I'm planning to use for a few-shot prompt:\n\n\n{your examples}\n\n\nCritique them. Specifically:\n- Are they diverse enough to cover the actual range of inputs I'll see?\n- Are there any accidental patterns across examples that would mislead a model?\n- Is there an obvious edge case missing?\n\nIf you'd add a 6th example, what would it look like and why?\n```"},{"id":"s7c2","kind":"text","md":"This turns the model into a prompt critic. It catches patterns you missed and surfaces gaps in coverage you'd otherwise discover in production."}]},{"title":"A worked example: extracting structured data from emails","cards":[{"id":"s8c0","kind":"text","md":"Goal: extract meeting details from forwarded calendar invites."},{"id":"s8c1","kind":"text","md":"Without few-shot, you'd write a long description of the format. With few-shot, you show:"},{"id":"s8c2","kind":"code","md":"$30"},{"id":"s8c3","kind":"text","md":"Three examples teach the model: how to handle reschedules, confirms, and recurring creates. They show what to do with missing info. They establish the JSON shape. They demonstrate the \"use null, don't invent\" rule."},{"id":"s8c4","kind":"text","md":"Try writing that prompt without examples. It will be three times as long and produce worse output."}]},{"title":"What to read next","cards":[{"id":"s9c0","kind":"text","md":"- [What makes a prompt actually work](/learn/ai/prompt-craft/what-makes-a-prompt-work) — the foundations\n- [Structuring prompts with XML, roles, and sections](/learn/ai/prompt-craft/structuring-prompts-xml-roles-sections) — the formatting framework that makes examples work"}]}],"cardCount":33},"interactiveLesson":{"slug":"examples-few-shot-when-and-how","version":1,"title":"The few-shot example design lab","mission":"Choose when examples help, build a relevant and diverse set, remove accidental patterns, and save a reusable few-shot template.","estimatedMinutes":17,"journey":{"title":"Prompt fundamentals","position":3,"total":4},"stages":[{"id":"orient","title":"Teach by demonstration","shortTitle":"Model","objective":"Use examples when the task depends on pattern, judgement or consistency.","steps":[{"id":"fewshot-brief","type":"concept","eyebrow":"Few-shot lab · Briefing","title":"Examples are executable specifications","instruction":"A good example demonstrates decisions that would take paragraphs to describe.","body":"Few-shot prompting is most useful when the output format is unusual, the tone is specific, the task requires judgement or consistency across runs matters. Bad examples faithfully teach the wrong pattern.","points":["Relevant to real inputs.","Diverse across edge cases.","Structured consistently."],"visual":"mission"}]},{"id":"when","title":"Choose when examples help","shortTitle":"When","objective":"Avoid examples that constrain tasks where variety is the goal.","steps":[{"id":"fewshot-sort","type":"sort","eyebrow":"Signal 01 · Technique fit","title":"Should this task use examples?","instruction":"Sort by whether examples add useful signal.","required":true,"zones":[{"id":"helps","label":"Few-shot helps","description":"Pattern, judgement or consistency matters"},{"id":"optional","label":"Usually optional","description":"The model already handles the obvious task"},{"id":"hurts","label":"Can hurt","description":"Examples narrow desired variety or teach weak patterns"}],"items":[{"id":"sentiment","label":"Classify messy mixed-sentiment support feedback","zoneId":"helps","rationale":"Examples show how nuanced categories should be applied."},{"id":"voice","label":"Write customer replies in a very specific brand voice","zoneId":"helps","rationale":"Real examples carry voice better than adjective lists."},{"id":"sort","label":"Write a Python function that sorts integers","zoneId":"optional","rationale":"The standard task is already well represented and objectively testable."},{"id":"wild","label":"Generate radically different campaign concepts","zoneId":"hurts","rationale":"Examples can anchor outputs to the demonstrated surface pattern."},{"id":"bad","label":"Match a format when you only have one poor example","zoneId":"hurts","rationale":"A bad demonstration is worse than no demonstration."}]}]},{"id":"relevance","title":"Use real-shaped examples","shortTitle":"Relevant","objective":"Teach the model the input complexity it will actually encounter.","steps":[{"id":"relevance-choice","type":"choice","eyebrow":"Signal 02 · Relevance","title":"Which example prepares the classifier for reality?","instruction":"Choose the example that demonstrates the hard judgement.","required":true,"prompt":"The classifier must label customer feedback and separately flag concrete product complaints.","options":[{"id":"toy","label":"“This is great!” → positive","detail":"Clean, obvious and unlike the production input.","feedback":"The example teaches the easy case while leaving the actual ambiguity undefined."},{"id":"real","label":"“migration was smooth, docs naming is inconsistent, 8/10” → positive + docs complaint","detail":"Mixed signal, casual language and an embedded actionable issue.","recommended":true,"feedback":"The example demonstrates the judgement the task truly requires."}]}]},{"id":"diversity","title":"Cover the decision boundary","shortTitle":"Diverse","objective":"Build examples that span normal and edge cases.","steps":[{"id":"diversity-set","type":"multiSelect","eyebrow":"Signal 03 · Diversity","title":"Choose the five-example set","instruction":"Select at least four cases that teach different behaviour.","required":true,"minSelections":4,"prompt":"A support-feedback classifier must handle the range users actually send.","options":[{"id":"positive","label":"Clearly positive","detail":"A clean anchor for the positive class.","recommended":true,"feedback":"Useful as a simple reference point."},{"id":"negative","label":"Clearly negative","detail":"A clean anchor for the negative class.","recommended":true,"feedback":"Balances the positive anchor."},{"id":"mixed","label":"Mixed sentiment with one concrete issue","detail":"Requires more than one output signal.","recommended":true,"feedback":"Teaches the important nuanced case."},{"id":"ambiguous","label":"Ambiguous or sarcastic","detail":"Should produce low confidence or escalation.","recommended":true,"feedback":"Defines behaviour near the decision boundary."},{"id":"offtopic","label":"Off-topic input","detail":"Should refuse classification or flag irrelevance.","recommended":true,"feedback":"Shows what not to force into a category."},{"id":"duplicate","label":"Five nearly identical positive reviews","detail":"Repetition without broader coverage.","feedback":"Surface variety is low even if the wording differs."}]}]},{"id":"patterns","title":"Remove accidental rules","shortTitle":"Patterns","objective":"Notice surface patterns the model may mistake for requirements.","steps":[{"id":"accidental-radar","type":"spotRisk","eyebrow":"Signal 04 · Accidental patterns","title":"Mark what the examples teach by accident","instruction":"Select repeated features unrelated to the real task.","required":true,"lead":"Across five approved examples:","segments":[{"id":"schema","text":"Every output uses the documented JSON fields. ","risk":false},{"id":"three","text":"Every explanation happens to contain exactly three bullets. ","risk":true,"explanation":"The model may infer a three-bullet requirement that was never intended."},{"id":"question","text":"Four outputs end with an unnecessary question. ","risk":true,"explanation":"A repeated stylistic accident can become a copied rule."},{"id":"labels","text":"Each example uses one of the allowed classification labels. ","risk":false},{"id":"length","text":"All inputs are under twenty words, while production inputs average three paragraphs.","risk":true,"explanation":"The examples fail to demonstrate the real input shape."}]}]},{"id":"count","title":"Use enough, not every example","shortTitle":"Count","objective":"Prefer a compact diverse set over a large repetitive set.","steps":[{"id":"example-count","type":"choice","eyebrow":"Signal 05 · Example budget","title":"You have twenty approved examples. What goes into the prompt?","instruction":"Choose the set that gives broad signal without unnecessary context.","required":true,"prompt":"The examples vary in quality and several repeat the same obvious case.","options":[{"id":"all","label":"Use all twenty","detail":"More examples must create more accuracy.","feedback":"Large repetitive sets consume context and reinforce accidental surface patterns."},{"id":"diverse","label":"Choose the three to five most diverse strong examples","detail":"Cover normal cases, ambiguity and one edge case.","recommended":true,"feedback":"A compact set preserves context while teaching the useful decision boundary."}]}]},{"id":"build","title":"Build the demonstration set","shortTitle":"Build","objective":"Create a reusable few-shot structure for a real judgement task.","steps":[{"id":"fewshot-builder","type":"promptBuilder","eyebrow":"Signal 06 · Example builder","title":"Author the few-shot template","instruction":"Use realistic but non-confidential examples.","required":true,"artifactLabel":"Your reusable few-shot example set","fields":[{"id":"task","label":"Task and labels","placeholder":"Classify feedback as positive, negative, mixed or irrelevant…","hint":"Define the judgement being demonstrated."},{"id":"example1","label":"Typical example","placeholder":"Real-shaped input → correct structured output","hint":"Anchor the normal case."},{"id":"example2","label":"Contrasting example","placeholder":"A clearly different input and label…","hint":"Expand coverage."},{"id":"edge","label":"Edge-case example","placeholder":"Ambiguous, mixed or off-topic input → expected handling…","hint":"Teach behaviour near the boundary."},{"id":"schema","label":"Output schema","placeholder":"{ label, confidence, specific_issue, needs_review }","hint":"Keep outputs structurally comparable."}],"template":"TASK\n{{task}}\n\n\n \n {{example1}}\n {{schema}}\n \n\n \n {{example2}}\n {{schema}}\n \n\n \n {{edge}}\n {{schema}}\n \n\n\nNow apply the demonstrated pattern to the next input. Do not invent fields."}]},{"id":"exit","title":"Teach the pattern","shortTitle":"Exit","objective":"Carry a compact, reviewed example set into repeated work.","steps":[{"id":"fewshot-checkpoint","type":"checkpoint","eyebrow":"Examples ready","title":"Your demonstrations teach the intended task","instruction":"Complete the core signals to activate this checkpoint.","required":true,"outcomes":["You know when examples help or constrain the task.","Your examples resemble real production inputs.","The set covers normal, ambiguous and edge cases.","Accidental patterns have been identified.","Your tagged few-shot template is saved locally."]}]}]},"initialView":"interactive","slug":"examples-few-shot-when-and-how","category":"prompt-engineering","pillar":"foundations","title":"Few-shot examples: when they help, how to write them","href":"/learn/prompt-engineering/foundations/examples-few-shot-when-and-how","accentText":"group-hover:text-violet-600 dark:group-hover:text-violet-400","next":"How to prompt: 5 patterns that work in any model","nextHref":"/learn/prompt-engineering/foundations/prompting-patterns"}]]