Question 1

What should recruiters actually know about how an LLM works?

Accepted Answer

Enough to calibrate trust: it predicts likely next tokens, it does not query your ATS unless wired through tools, and it has a finite context window that trades off with instructions and attachments. That mental model prevents magical thinking in debriefs when someone says "it should know our policy." Pair the overview with your vendor's logging story: who can see prompts, where data is processed, and how long transcripts persist. When hiring managers ask for certainty, translate model behavior into review steps you already staff instead of impossible guarantees.

Question 2

How do I choose between vendors (OpenAI, Anthropic, Google, and others)?

Accepted Answer

Start from governance and workflow fit: EU data handling, SSO, audit logs, retention controls, and whether you need API access for [workflow automation](/ai-glossary-in-practice/workflow-automation). Benchmarks help compare draft quality, but procurement should weight incident response and subprocessors as highly as leaderboard scores. Run the same twenty recruiting prompts from your last quarter across finalists and score factual errors, not only fluency. Involve IT and legal before you standardize, or you will re-platform six months later when security review finally runs. Capture a side-by-side matrix of logging, residency, and red-team results so executives compare evidence, not slogans.

Question 3

Is a bigger model always better for recruiting text?

Accepted Answer

Not always. Smaller models with strong [system instructions](/ai-glossary-in-practice/system-instructions), curated [Markdown for AI](/ai-glossary-in-practice/markdown-for-ai), retrieval, and [few-shot prompting](/ai-glossary-in-practice/few-shot-prompting) often beat a frontier model fed a vague paragraph. Cost and latency matter when you scale across reqs and languages. Measure end-to-end time including human review, not only tokens per second. Sometimes the right answer is two specialized calls in a [prompt chain](/ai-glossary-in-practice/prompt-chain) instead of one giant completion. Re-evaluate after major vendor releases, because the cheapest accurate stack this quarter may not stay cheapest after pricing or safety filters change.

Question 4

What is the difference between an LLM and automation?

Accepted Answer

The LLM proposes text, labels, or summaries; automation (Make, n8n, webhooks) moves structured data between systems and triggers actions. Workshops separate "just prompting" from **skills in project folders** and APIs because integration depth changes GDPR surface area and who gets paged at night. Mixing the two without boundaries yields ghost sends or silent CRM corruption. Document which nodes are allowed to write candidate-visible fields versus draft-only fields. Add monitoring on automation branches that call models so token spikes or schema errors page someone before candidates see broken templates.

Question 5

Where do maturity models help?

Accepted Answer

They align TA, HRBPs, sourcing, and finance on how deep you go this quarter versus next, so budget asks map to observable milestones instead of "more AI." Read [AI adoption maturity levels](/blog/ai-adoption-maturity-levels) and pair it with the [AI adoption ladder](/ai-glossary-in-practice/ai-adoption-ladder) glossary entry for concrete artifacts per stage. Name owners per stage (prompt library, automation keys, corpus hygiene) or the model slides backward after one busy month. Maturity models fail when they are only marketing; they work when tied to metrics and risk reviews.

Question 6

Which on-site tools should we standardize on first?

Accepted Answer

Most teams pick one chat assistant plus one automation path so recruiters are not juggling five stacks with different retention rules. Compare [ChatGPT](/tools/chatgpt), [Claude](/tools/claude), and [n8n](/tools/n8n) in the directory against your regions and SSO needs before you buy overlapping seats. Run a thirty-day pilot on two real reqs with sourcers and recruiters logging friction daily. Standardization beats feature sprawl when compliance asks for a single DPA map. Publish the approved stack list internally so shadow tools lose their "everyone else uses it" excuse.

Question 7

Do we need an engineer to use LLMs responsibly?

Accepted Answer

For chat-first drafting with human send gates, no dedicated engineer is required if TA partners with IT on accounts and logging. For synced CRM writes, webhooks, or bulk processing, you need engineering or a strong ops partner who treats scripts like production. The [Starting with AI: the foundations in recruiting](/store/courses/starting-with-ai-foundation) course stays recruiter-native first so you earn governance habits before API complexity. Document escalation paths when a model misbehaves in automation, not only when chat feels off. Name a liaison who can read vendor status pages during incidents so recruiters are not guessing alone at 2 a.m.

Layer	You get	You still need
Chat UI	Fast drafts	Copy-paste hygiene
Saved skills / Gems	Consistency	Owners and updates
API + automation	Scale	Keys, monitoring, GDPR

Large language model (LLM)

What is a large language model (LLM)?

In practice

Quick read, then how hiring teams use it

Plain-language summary

When you are running live reqs and tools

Where we talk about this

Around the web (opinions and rabbit holes)

Chat versus API depth

Related on this site

Frequently asked questions