Question 1

What do hallucinations look like in recruiting work?

Accepted Answer

Fluent lies about employers, dates, titles, certifications, or URLs that look plausible until you open the profile or ATS field beside the draft. Co-pilot style drafting on thin profiles is especially risky because the model fills gaps the way humans guess, then states guesses confidently. Teams catch them fastest when they keep source tabs pinned: LinkedIn next to InMail, policy PDF next to internal FAQ. Log incidents with the prompt version so you can see whether a template change caused a spike, not random bad luck.

Question 2

Why do models hallucinate if they are "trained on the internet"?

Accepted Answer

They optimize for plausible continuation, not verified lookup against your private candidate truth. Training on broad text does not grant live access to your CRM or to yesterday's policy addendum. Even with [RAG](/ai-glossary-in-practice/rag), wiring mistakes or stale chunks can surface wrong snippets with confidence. Teach stakeholders the "pattern completion" mental model so they stop treating fluent paragraphs as citations. Pair vendor claims with your own eval set of twenty tricky profiles from last quarter. When executives ask for a demo, show a deliberate miss next to a fix so they internalize that fluency is not the same as accuracy in hiring.

Question 3

What is the minimum viable verification loop?

Accepted Answer

For candidate-facing text: keep the authoritative profile or ATS record visible, spot-check employers, dates, and locations, and require URLs or quotes pulled directly from source text rather than memory. For internal drafts, tag every claim that needs a citation before it leaves the team channel. Add a second reader on high-risk segments (exec outreach, visa-sensitive wording). Minimum viable still needs an owner: who is allowed to click send when a deadline looms and the draft "looks fine"? Publish that escalation path beside your templates and review it after every near-miss retro so new hires do not improvise under pressure.

Question 4

Does RAG eliminate hallucinations?

Accepted Answer

It reduces unsupported claims when retrieval returns the right chunk, but models can still misread tables, merge two policies, or cite an outdated file with equal confidence. Treat [RAG](/ai-glossary-in-practice/rag) as assist, not oracle, and keep [hallucination](/ai-glossary-in-practice/hallucination) checks on numbers, URLs, and eligibility language. Run quarterly corpus audits so "grounded" answers are not grounded in 2019. Log retrieval misses separately from model mistakes so engineering and TA each know what to fix. When retrieval confidence scores exist, teach recruiters what "low confidence" means in plain language before those fields drive automation.

Question 5

How do workshops talk about this with hiring managers?

Accepted Answer

Plain language: models draft structure and phrasing fast; humans own facts, fairness, and tone that matches the team they will join. That framing prevents "the computer said so" approvals in debriefs and sets expectations on turnaround time (review is a step, not overhead). We share anonymized misses so hiring managers feel why verify-before-send matters for their brand, not only compliance. Tie the conversation to score anchors so quality discussions stay behavioral, not mystical. Close with one concrete habit they can adopt this week, such as keeping the ATS tab open while approving outreach, so the lesson survives the slide deck.

Question 6

Which blog posts should the team read together?

Accepted Answer

Start with [AI candidate screening](/blog/ai-candidate-screening) and [How to use AI in recruiting](/blog/how-to-use-ai-in-recruiting) as a pair: one on funnel risk, one on operating norms. Then align on tools with [ChatGPT for recruiters](/tools/chatgpt) so procurement hears the same limits engineering does. Reading as a group surfaces disagreements early (what counts as public data, who approves drafts) and turns policy into behavior, not PDFs on a shelf. Capture three decisions per session in your [Markdown for AI](/ai-glossary-in-practice/markdown-for-ai) knowledge base with owners and dates so assistants and new hires inherit the same story six months later.

Question 7

When should we avoid generative models entirely?

Accepted Answer

Skip generative passes for high-stakes compliance narratives, redundancy selections, compensation communications, or anything you cannot audit under current policy. Prefer deterministic templates, official legal review paths, or vendor features with contractual guarantees there. Temporary bans are fine while you build rubrics; permanent bans without alternatives just drive shadow IT. Document the decision with names and dates so future leaders know it was intentional, not ignorance. Revisit bans when retrieval, logging, or human-in-the-loop controls mature, because blocking tools without a safe lane rarely stops motivated recruiters.

Task	Relative risk	Mitigation
Outreach personalization	High	Facts from profile only
Intake summary	Medium	Quote hiring manager
Boolean string	Lower	Still test in tool
Policy interpretation	High	Legal, not LLM

Hallucination

What is a hallucination?

In practice

Quick read, then how hiring teams use it

Plain-language summary

When you are running live reqs and tools

Where we talk about this

Around the web (opinions and rabbit holes)

Hallucination risk by task

Related on this site

Frequently asked questions