Skip to main content
Intent Signals

How Rodz Produces Intent Signals: 350+ Scrapers

Peter Cools · · Updated on May 3, 2026 · 5 min read

The Collection Infrastructure

350+ scrapers running continuously

An intent signal doesn’t appear out of thin air. Behind every detected event sits a large collection infrastructure. Rodz runs more than 350 scrapers that continuously query public and partner data sources, using platforms like Apify to orchestrate and run crawlers at scale. Each scraper targets a specific source type, whether that’s a business registry, a job board, a professional social network, or a sector-specific database, and it’s tuned to extract relevant information without hammering the source server.

These scrapers run around the clock. Query frequency varies by source: some registries update daily, others in real time. The point is to catch every event within hours of publication. Rodz fully rebuilds each scraper four or five times a year, because sources change their structure constantly and a scraper that isn’t maintained drifts back to producing noise.

250+ cross-referenced data sources

The 250+ sources queried cover the full digital footprint of a business.

Legal and administrative sources

  • SIRENE: registrations, address changes, statutory modifications
  • Commercial registries: financial filings, capital increases
  • BODACC: legal notices, transfers, liquidations
  • Legal announcement journals: official publications

Employment and HR sources

  • General job boards (Indeed, HelloWork) and specialist ones (Welcome to the Jungle, Talent.io)
  • LinkedIn: job postings, executive role changes
  • Company career pages

Financial sources

  • Press releases: funding rounds, partnerships, financial results
  • Investor databases: funding rounds, valuations
  • Industry publications

Media and social sources

  • LinkedIn: posts, comments, engagement signals
  • Industry press: articles mentioning target companies
  • Google Maps: new locations, address changes

Cross-referencing sources

The value Rodz produces doesn’t come from any single source in isolation. Anyone can check SIRENE or Indeed, or spin up a scraper on Apify. What changes when sources are cross-referenced: a hiring surge on Indeed, matched against a funding round picked up in a press release, matched again against a leadership change on LinkedIn, forms a composite signal that’s considerably stronger than any one of those events read alone. That stacking is where the operational value actually comes from.

The 108 Signal Types

A complete taxonomy of commercial events

Rodz categorizes events into 108 signal types organized into major families:

  • Growth signals: hiring, capital increases, office openings, headcount growth
  • Transformation signals: executive appointments, mergers/acquisitions, reorganizations, strategy changes
  • Intent signals: solution research, event attendance, tech stack changes
  • Regulatory signals: compliance initiatives, certifications, patent filings
  • Financial signals: funding rounds, published results, investments

Each signal type has its own characteristics: how often it’s detected, how strong a need it reveals, how long it stays relevant, and which sectors it applies to.

222 configurations per signal

The same signal type doesn’t read the same way depending on context. Hiring three people doesn’t mean the same thing as hiring thirty. That’s why each signal can be configured across 222 parameters:

  • Trigger thresholds: minimum number of hires, minimum funding amount
  • Sector filters: industry codes, keywords in business description
  • Geographic filters: departments, regions, countries
  • Size filters: minimum/maximum headcount, revenue
  • Role criteria: type of position being hired, seniority level of the new executive
  • Combinations: pairing multiple signals to produce composite signals

This granularity lets each company configure detection that actually fits their offering and market, rather than receiving a generic feed they have to filter manually afterward.

The Verification Process

From raw signal to actionable signal

A raw event isn’t an actionable signal. Between detection and delivery to the sales rep, several processing steps run in sequence.

  1. Deduplication: the same event often surfaces across multiple sources. Rodz eliminates duplicates so only one signal per event gets delivered.

  2. Contextualization: the signal is enriched with company information (size, sector, history) so the sales rep understands the context at a glance, without extra research.

  3. Balance Scoring: a proprietary model assigns a score combining the nature of the signal (strength of revealed intent) and its recency. That recency coefficient decays after 48 hours.

  4. Deep Search Enrichment: the right contact’s details are identified via a SIRENE, Google Maps, and LinkedIn cascade, with 80 to 85% accuracy on professional emails.

  5. Delivery: the enriched, scored, and contextualized signal goes to the client’s CRM or appears in the Rodz interface.

The 48-hour rule

The full process, detection through delivery, must complete within 48 hours of the event. That deadline isn’t arbitrary. Rodz data shows a signal loses 80% of its value past that threshold. The prospect has already been contacted by someone else, has started their own research, or has been pulled into other priorities. An intent signal is only valuable inside that window. Within it, reply rates run 4x cold-outbound levels. Past it, the message lands like any other cold email.

The Numbers That Matter

The Rodz infrastructure produces measurable results:

  • 4 actionable signals per company per year on average
  • 4x qualified meetings compared to cold prospecting
  • +74% closing rate because of timing relevance
  • 15 hours saved per week per sales rep

And one KPI measures effectiveness: the positive response rate. Not open rates or click rates, which tracking pixels and security filters skew beyond usefulness.

Want to use this infrastructure yourself? The getting started guide with the Rodz API covers authentication and your first request, and the complete API reference lists all available endpoints.

Frequently Asked Questions

How long does it take to configure the signals?

The creation phase takes one to two weeks. It includes an audit of your offering, selection of the relevant signals from the 108 available, and setup of the 222 configurations. The first signals are delivered as soon as that phase is done.

Is the collected data GDPR-compliant?

Yes. Rodz only collects data from public sources: registries, press releases, job postings, professional profiles. Each of those publication events is, by construction, the legitimate interest that justifies the outreach. Deep Search enrichment uses professional contact details only. The right to object is offered in every outreach.

What happens if a signal is false or outdated?

Multi-source cross-referencing reduces false positives considerably. A signal detected on a single source gets verified against others before delivery. The Balance model’s recency coefficient automatically downgrades signals that age past the 48-hour threshold, so stale data doesn’t reach the sales rep as if it were fresh.

Share:

Detect your next customers automatically

100 free credits. No credit card.

Generate your outbound strategy for free

Our AI analyzes your company and creates a complete playbook: ICP, personas, email templates, call scripts.

Generate my strategy