Question 1

How many examples is "few" in practice?

Accepted Answer

Two to five input/output pairs usually beat a wall of prose: enough to show tone, structure, and edge handling without crowding candidate facts out of the [LLM tokens](/ai-glossary-in-practice/llm-tokens) budget. Live workshops show sourcers pasting three strong messages and getting usable fourth variants in seconds, then hitting diminishing returns by seven. Quality and recency beat count: retire examples when your bar or brand voice moves. Track which exemplar set produced each batch so you can roll back a bad week. Run a quarterly blind review where two reviewers score outputs from different example packs so weak sets retire with data, not office politics.

Question 2

Where do few-shot prompts help recruiting most?

Accepted Answer

High-repeat artifacts with visible "gold" rows: outbound sequences, intake summaries, [scorecard](/ai-glossary-in-practice/scorecard) rationales, JD cleanup, and screening summaries where a sheet already holds ideal answers. Pair with [how to write better AI prompts](/blog/how-to-write-better-ai-prompts) so each example encodes constraints hiring managers actually enforce. Few-shot shines when reviewers can diff output against a known anchor. It helps less for one-off executive searches where exemplars would be fake. Name a single owner per pack (often enablement or a lead sourcer) so updates do not fork across Slack threads, and publish where each pack may be used so GDPR reviews know which examples ever touched vendor logs.

Question 3

What is the main downside of few-shot prompting?

Accepted Answer

Overfitting: the model copies quirks you did not mean to canonize (odd sign-offs, illegal phrasing someone once slipped through) or ignores edge cases your samples never covered. Without versioning, two recruiters silently maintain different example packs and downstream quality diverges. Refresh examples when comp, remote policy, or diversity language changes, and log updates like code. Pair few-shot with [system instructions](/ai-glossary-in-practice/system-instructions) so global rules stay stable while examples rotate per req family. After a bad send, capture the exemplar row that misled the model so legal and TA see the same root cause instead of blaming "the AI" abstractly.

Question 4

How is this different from a saved system prompt or Gem?

Accepted Answer

Few-shot teaches inside a turn or thread with fresh pairs; Gems and custom GPTs persist that teaching as [system instructions](/ai-glossary-in-practice/system-instructions) across sessions. In practice you stack them: stable global rules plus three fresh exemplars for this quarter's reqs. If you only few-shot without persistence, new hires reinvent tone weekly. If you only systemize without examples, abstract adjectives creep back. Document which layer owns legal must-nots so updates do not fall through cracks. When IT rotates API keys or vendors ship new defaults, regression-test both layers with the same five anonymized profiles so drift shows up before candidates do.

Question 5

Can few-shot reduce hallucinations?

Accepted Answer

It can reduce **style** drift and missing sections, but it does not stop factual invention: the model can still invent employers or dates that were not in the profile you pasted. Keep verify-before-send habits from the [hallucination](/ai-glossary-in-practice/hallucination) entry, especially for multilingual titles and stealth startups. Use few-shot to show the shape of a truthful answer ("only facts from the resume bullet"), not to imply omniscience. Pair numeric claims with a human spot-check until metrics say otherwise. Teach coordinators that exemplars are guardrails on tone and structure, not proof that every new candidate sentence is true.

Question 6

Which tools support few-shot well?

Accepted Answer

Any chat UI that tolerates long prompts, plus [Claude](/tools/claude) and [ChatGPT](/tools/chatgpt) when you pin examples above the user task. API users can separate system, developer, and user blocks for cleaner budgets. Pick tooling based on audit logs and data residency, not only example slots. For a guided path, join a [workshop](/workshops) or take [Starting with AI: the foundations in recruiting](/store/courses/starting-with-ai-foundation) so you build packs with anonymization habits baked in. Security reviews should include a sample thread that shows exactly which block holds examples so approvers understand what leaves your boundary on each call.

Question 7

Should I anonymize real candidate examples?

Accepted Answer

Yes, always strip names, emails, employers, and identifiable projects before examples land in third-party models or shared repos. Treat few-shot packs like internal documentation with the same retention and access rules as your CRM. Rotating real snippets without redaction is how accidental PII becomes training folklore. If you need realism, synthesize plausible composites with hiring manager review instead of copy-pasting a finalist's mail. Log who approved each composite and when, so if a candidate ever asks how their story was used you can answer without guessing.

Style	When it wins	Watch out
Few-shot	Tone, format, micro-patterns	Hidden bias in samples
Long rubric	Legal must-nots, compliance	Token cost, skim risk
Hybrid	Production prompts	Needs an owner to edit both

Few-shot prompting

What is few-shot prompting?

In practice

Quick read, then how hiring teams use it

Plain-language summary

When you are running live reqs and tools

Where we talk about this

Around the web (opinions and rabbit holes)

Few-shot versus long instructions

Related on this site

Frequently asked questions