Question 1

Where do AI engineers actually spend time online?

Accepted Answer

Most are active on GitHub (check repository stars, pull-request velocity, and research-adjacent forks), Kaggle (competition rankings reveal practical ML skill), arXiv (submitted or cited papers signal research depth), and niche Slack or Discord communities tied to frameworks like PyTorch or JAX. LinkedIn is a lag indicator for this cohort; many update it only when already open to roles. Conference attendee lists from NeurIPS, ICML, ICLR, and CVPR are among the richest sources for senior AI research engineers. Cross-reference two or three of these before reaching out: one strong GitHub project without peer engagement is worth verifying with a quick arXiv or Kaggle check.

Question 2

What signals distinguish a strong AI engineer from a general software engineer?

Accepted Answer

Look for domain-specific code: custom model architecture commits, experiment tracking in MLflow or Weights and Biases, and distributed training experience. Research publications or pre-prints, even as second or third author, indicate familiarity with rigorous evaluation. Kaggle gold medals in modeling competitions signal practical optimization skill under constraints. Open-source contributions to TensorFlow, PyTorch, Hugging Face, or LangChain are strong markers. General SWE credentials without at least one of these signals rarely indicate the depth AI roles need. Verify claims against public profiles before adding a candidate to a shortlist: [hallucination](/ai-glossary-in-practice/hallucination) risk applies to AI-generated profile summaries too.

Question 3

How should I write outreach to an AI engineer who is not actively looking?

Accepted Answer

Reference something specific they built or published: a repository, a Kaggle notebook, a paper you actually read. Avoid generic "exciting opportunity" language; these candidates receive dozens of such messages and filter them quickly. Mention what your team works on at the model or data level (architecture, dataset scale, inference latency targets) so they can self-select. Keep the first message under 80 words: a brief framing of what makes your problem interesting, the stack in one line, and a low-friction ask. Tools for [AI outreach drafting](/ai-glossary-in-practice/ai-outreach-drafting) can speed personalized notes from a profile summary, but always review for hallucinated project details before sending.

Question 4

What compensation context do I need before sourcing senior AI engineers?

Accepted Answer

In competitive markets, senior ML engineers and research scientists routinely command total compensation packages that exceed general SWE bands at the same level once equity, compute budgets, and publication time are factored in. Benchmarks from Levels.fyi and the Pragmatic Engineer salary surveys give current data. Teams that lowball initial outreach waste sourcing effort because AI engineers talk to each other and compensation signals spread fast. Agree on comp bands with your HRBP before the first sourcing wave, not after a verbal offer conversation has started. Skipping this step closes pipelines that took months to build.

Question 5

How does AI actually help with AI engineer sourcing?

Accepted Answer

Semantic search tools can surface GitHub profiles, arXiv authors, and Kaggle contributors who match a target capability profile even when their titles do not include the word "AI." [Boolean search](/ai-glossary-in-practice/boolean-search) strings built with model help can target specific framework expertise across LinkedIn Recruiter and GitHub search. AI drafting tools speed personalized outreach at scale once you have a shortlist. One caveat: AI tools can confuse researchers with practitioners and vice versa, so a human technical reviewer should validate shortlists before scheduling calls. See [AI sourcing tools](/ai-glossary-in-practice/ai-sourcing-tools) for an overview of what holds up in production versus what demos well.

Question 6

What GDPR and data concerns apply when scraping public AI engineer profiles?

Accepted Answer

Scraping public GitHub profiles, arXiv author listings, and Kaggle leaderboards is a gray area in many EU jurisdictions. GDPR's legitimate interest test requires you to weigh candidate privacy expectations against your recruitment need. Always use a compliant [candidate data enrichment](/ai-glossary-in-practice/candidate-data-enrichment) vendor or an internal process that stores minimum fields, sets a clear retention period, and can delete records on request. Do not aggregate personal data across platforms without a legal basis and a documented record of processing. For practical guidance, see [GDPR and first-touch outreach](/ai-glossary-in-practice/gdpr-first-touch-outreach) on email and message consent flows for cold sourcing.

Question 7

Where can I build AI engineer sourcing skills with a community?

Accepted Answer

Live workshops at [AI with Michal](/workshops) cover the sourcing stack for technical roles, including Boolean string construction for GitHub and arXiv, how to read a commit history as a technical signal, and how to structure outreach sequences without damaging employer brand with a cynical candidate pool. The [Starting with AI: foundations in recruiting](/store/courses/starting-with-ai-foundation) course covers prompting and outreach workflows that apply directly to technical sourcing. Bring a real req to a live session rather than a hypothetical one: hands-on practice with your actual job description surfaces what tools actually help versus what is demo-ware.

Dimension	General tech sourcing	AI engineer sourcing
Primary signal	LinkedIn title, years of experience	GitHub commits, papers, competition rankings
Candidate pool	Wide	Narrow and globally distributed
Outreach trigger	Role and compensation	Specific technical problem or dataset
Research needed before outreach	Low	High (read one project or paper)
Comp alignment needed before sourcing	Optional	Required

AI engineer sourcing

What is AI engineer sourcing?

In practice

Quick read, then how hiring teams use it

Plain-language summary

When you are running live reqs and tools

Where we talk about this

Around the web (opinions and rabbit holes)

AI engineer sourcing vs. general tech sourcing

Related on this site

Frequently asked questions