Lead scraping is the foundation of every successful cold email campaign. Without a high-quality list of verified prospects, even the best copy and infrastructure will underperform. At imisofts, we have helped agencies and B2B companies build lead lists of 10,000+ verified contacts across multiple industries and geographies.
The challenge is not finding leads — the internet has billions of them. The challenge is finding the right leads: decision-makers at companies that actually need your solution, with verified contact information, at scale and speed.
This guide covers the complete lead scraping workflow we use for our clients, from tool selection to data enrichment to list optimization.
What is Lead Scraping?
Lead scraping is the systematic extraction of prospect data from public sources. This includes company websites, LinkedIn, industry directories, job boards, and other publicly available databases. The data you extract includes names, job titles, company names, email addresses, phone numbers, LinkedIn profiles, and company metadata like industry, size, location, and revenue.
Lead scraping is different from list buying. When you buy a list, you get a pre-compiled set of leads that may or may not be relevant to your ICP (Ideal Customer Profile). When you scrape, you define your exact targeting criteria first, then extract leads that match those criteria precisely.
For example, if you are targeting "VP of Sales at mid-market SaaS companies with Series B+ funding in the US," lead scraping lets you find exactly those people. List buying would send you thousands of generic leads with marginal fit.
The Lead Scraping Tools We Use
We test and maintain relationships with multiple lead scraping platforms because each has strengths in different scenarios.
Apollo is our primary tool for most campaigns. Apollo has a database of 250+ million B2B contacts across 200+ million companies. The search interface lets you filter by company size, industry, title, technology stack, location, hiring intent, recent funding, and dozens of other parameters.
The advantage of Apollo is depth and breadth. You can find niche titles, filter by specific technologies in the company's stack, and access historical hiring data. Apollo also integrates directly with most campaign platforms, so you can pull leads straight into Instantly or SmartLead without manual upload.
SmartLead Supersearch is our second tool of choice. SmartLead has a slightly smaller database (around 200 million contacts) but offers superior filtering for European markets and GDPR-compliant sourcing. If you are targeting Germany, UK, France, Netherlands, or other EU countries, SmartLead often delivers cleaner, more verified data.
Supersearch also excels at finding professional email addresses — the native domain email addresses that have higher deliverability than generic Gmail or Outlook accounts. For our European scrape project that generated 26,500+ leads across 30+ countries, SmartLead was the primary tool.
Clay is our enrichment platform. After scraping initial leads from Apollo or SmartLead, we use Clay to add missing data — phone numbers, company websites, LinkedIn profiles, recent funding information, and technographic data. Clay integrates with 40+ data sources (Apollo, Hunter, Clearbit, LinkedIn, Crunchbase, and others) and runs waterfall enrichment automatically.
Hunter.io is our backup email finder. If Apollo or SmartLead does not have an email for a contact, Hunter can often find it by pattern-matching against the company domain. Hunter is particularly strong for European leads and smaller companies.
The Lead Scraping Workflow
Here is the exact workflow we follow for every campaign:
Step 1: Define Your ICP
Before scraping a single lead, define exactly who you want to reach. This includes job titles, company size, industry, location, technologies used, recent funding, and growth indicators.
For example, one of our agency clients targeting marketing automation buyers defined their ICP as:
- Title: VP of Marketing, CMO, Director of Marketing, Head of Marketing
- Company size: 50-500 employees
- Annual revenue: $5M-$100M
- Industry: B2B SaaS, marketing tech, business services
- Location: United States
- Technologies: HubSpot, Marketo, or Salesforce using Pardot
This level of specificity is crucial. Vague ICPs result in bloated lists full of mismatched leads.
Step 2: Scrape Using Apollo or SmartLead
Log into Apollo, build out your search filters, and set parameters for titles, company size, industry, location, and any technology filters relevant to your pitch. For the ICP above, the Apollo search would include all those parameters. Apollo will return thousands of matching contacts. Export them to a CSV file.
If you are targeting European markets, switch to SmartLead Supersearch and repeat the same process. SmartLead often has superior data quality for EU prospects, especially regarding email verification and GDPR compliance.
Step 3: Deduplication
If you scraped from both Apollo and SmartLead, or if you are combining new scrapes with older lead lists, deduplicate. Use email address as the primary dedup key, then secondary keys of name + company. We use simple Python scripts or Google Sheets formulas for deduplication.
Step 4: Data Enrichment with Clay
Now you have your raw lead list. Use Clay to run a waterfall enrichment that automatically pulls phone numbers, company websites, recent funding, technographic data, and LinkedIn profiles for each lead. Clay tries multiple data sources in sequence, so if Apollo does not have a phone number, Clay checks Hunter, then Clearbit, then other sources.
A properly enriched lead has: first name, last name, email, phone, company, title, company website, company size, industry, LinkedIn URL, recent funding, and key technologies in use.
Step 5: List Validation and Segmentation
Before running a campaign, validate the list. Remove any leads with invalid email addresses, mismatched titles, or companies outside your actual ICP. A smaller list of high-quality, tightly matched leads will outperform a large list of loose matches.
Segment the list into campaigns based on company size, industry, or other factors. This allows you to craft tailored messaging for each segment.
Real-World Examples: Lead Scraping at Scale
One of our European agency clients ran a lead scraping project targeting 26,500+ leads across 30+ countries in just three weeks. The project included approximately 5,000 leads that achieved verified email status. After data enrichment with Clay, the client had phone numbers, funding information, and technographic data for most leads.
The leads were then segmented into 3 campaign-ready CSV files: Agency/White-label, SaaS/Tech, and Other verticals. This segmentation allowed the client to craft industry-specific pitches rather than one generic message to all 26,500 leads.
Using our infrastructure packages, the client was able to run these segmented campaigns simultaneously across multiple domains and inboxes, achieving 50-80% open rates and 1-3% reply rates.
Economics of Lead Scraping
Most clients ask: "How much does lead scraping cost?"
Here is the breakdown:
- Apollo or SmartLead subscription: $150-$500 per month depending on export limits
- Clay enrichment: $200-$1,000 per month depending on enrichment volume
- Email finder tools (Hunter, etc): $50-$200 per month
- Time cost: 5-10 hours per campaign for scraping, dedup, and setup
Total cost per campaign: approximately $400-$1,500 in tools and labor. For a client scraping 10,000 leads, that is roughly $0.04-$0.15 per lead, which is dramatically cheaper than list-buying services that often charge $0.50-$2.00 per lead.
Combining Scraping with imisofts Infrastructure
Once you have your scraped, enriched, verified lead list, the next step is campaign execution. At imisofts, we help you run this list through our cold email infrastructure to maximize deliverability and replies.
Our infrastructure handles the technical side — DNS authentication, warmup, inbox rotation, sending limits — while you focus on copy and targeting. Combined with a high-quality scraped list, this drives 50-80% open rates and 1-3% reply rates.
Learn more about our cold email infrastructure packages at imisofts.com/cold-email-marketing#packages.
Conclusion
Lead scraping is a core skill for any B2B outbound operation. Start with a clearly defined ICP, use Apollo or SmartLead to extract matching leads, enrich the data with Clay, validate quality, and segment for personalized campaigns. Done right, you can build 10,000+ verified, tightly-matched leads in 2-3 weeks for a fraction of what list-buying would cost.