Question 1

How is this different from RAG?

Accepted Answer

[RAG](/ai-glossary-in-practice/rag) usually implies dynamic retrieval over a larger corpus at query time, with chunking and ranking tuned per question. An agent knowledge base is often smaller, hand-maintained, and versioned like internal product docs your team trusts day to day. You can combine both: stable "always read" files plus retrieval for long archives. The distinction matters for ownership: who deletes outdated comp guidance matters more than whether you used vectors. Start small so reviewers can actually read everything in the base quarterly.

Question 2

What files belong in the first version?

Accepted Answer

Ship a minimum lovable corpus: employer or agency positioning, channel rules (InMail versus email), three anonymized strong outreach examples, [scorecard](/ai-glossary-in-practice/scorecard) anchors with observable behaviors, booking links, and a short "do not say" list legal approved. Use [Markdown for AI](/ai-glossary-in-practice/markdown-for-ai) so changes are diffable, and mirror key bullets into [system instructions](/ai-glossary-in-practice/system-instructions) inside vendor UIs for consistency. Avoid dumping every PDF from 2019; stale files are how assistants confidently cite wrong visa lines. Tag v1 with a README that links to the approval ticket so future editors know which files counsel already blessed.

Question 3

Who maintains it?

Accepted Answer

Name a rotating owner (sourcer, recruiter, or TA ops) with quarterly review on the calendar, not "the team." Without ownership, enthusiasm from one workshop decays into conflicting copies in personal drives. Maintenance includes deletes, not only adds: when comp bands or remote policy change, retire old files loudly. Pair maintenance with metrics recruiters feel: fewer repeated Slack questions, faster HM alignment on tone. Escalate access issues to IT early so contractors are not editing canonical tone files anonymously. Publish the on-call rotation beside the repo README so PTO does not pause every update.

Question 4

What data should never live there?

Accepted Answer

Unredacted candidate PII, unreleased compensation bands you cannot defend in audit, secrets without legal review, or anything you would not paste into a vendor support ticket. Treat the folder like HR documentation with retention and access rules, especially if [workflow automation](/ai-glossary-in-practice/workflow-automation) later pipes excerpts into API calls. If a file is "for AI only," it still counts as processing personal data. When unsure, ask counsel before you optimize for convenience. Keep a short blocklist of file types (raw CRM exports, full comp grids) that assistants should refuse even when someone uploads them in a hurry.

Question 5

How does this connect to automation?

Accepted Answer

Once files are stable, [workflow automation](/ai-glossary-in-practice/workflow-automation) can pass excerpts or structured fields into model calls per ATS row, but only after you have logging, retries, and a human inbox for failures. Automation should read the same canonical Markdown humans edit, not forked snippets in a Zapier field nobody tracks. Version the prompt templates that wrap excerpts so you can roll back when a hiring manager flags tone. Test with synthetic rows before you touch real candidates. Add a dry-run mode that logs proposed payloads without sending until a named reviewer flips production on.

Question 6

Where can we learn the habits around it?

Accepted Answer

Read [AI-native](/ai-glossary-in-practice/ai-native) for the operating style, climb the [AI adoption ladder](/ai-glossary-in-practice/ai-adoption-ladder) deliberately with artifacts not slogans, and practice in a [workshop](/workshops) or the [Starting with AI: the foundations in recruiting](/store/courses/starting-with-ai-foundation) course. Bring your worst "folder of final" story so peers can help you design naming and review habits that survive PTO. If you are first in the company, publish a short charter that names approvers, review cadence, and which directories are canonical versus sandbox experiments so IT and legal know where truth lives.

Pattern	Strength	Weakness
Chat-only	Fast start	Context lost, hard to audit
Shared Markdown base	Portable, diffable	Needs owners
Drive dump	Easy upload	Noisy, expensive tokens

Agent knowledge base

What is an agent knowledge base?

In practice

Quick read, then how hiring teams use it

Plain-language summary

When you are running live reqs and tools

Where we talk about this

Around the web (opinions and rabbit holes)

Knowledge base versus chat-only memory

Related on this site

Frequently asked questions